Gene PHATRDRAFT_29702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_29702 
Symbol 
ID7194881 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp383713 
End bp386719 
Gene Length3007 bp 
Protein Length878 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183086 
Protein GI219125646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGTATCTCG AGCGCTATTG ACATTGATAG CGCTCAGAAA TACCAACTAC AGACTGAAGT 
TTACTGAAGG TTTGAGTTAA AAGGACAGTA AGATGCTCCG CTTAATCCAA AGGACCGCTT
TCTCCCGCTT GGTGTCGCGA AAGCCGTCGG CGTCACCCAC AGTTTCGTTA CGCACGGCAG
TCCCTGTCGT CGGTACCACC ACGACAACAT TTCGTCGTGA ACTGCACTTA TCGCCCCGTG
AAGCAGAGCA CTTGCAACTC CACCAAGTCG GTCGGCTGGC TCAATACCGT CTCGCTCGAG
GCGTTCGTTT AAACTACGTG GAAGCCGTCG CGCTTATTAG CATGCAAATG ATGGAAAAGA
TTCGGGACGG ACAGGATTCG GTTGCTGATC TGATGACGAT GGGACAGTCA CTCATCGGAC
GGAATCAAGT AATGCCGGGT GTCGCTAAAA TGATTGGGCA AGTGCAAGTG GAAGCGACCT
TTCTGGATGG GACAAAGCTG TTGACGATAC ACAACCCTGT CTCGGCGCAA GATGGAAATC
TCGAACGGGC TCTGGACGGA TCCTTCCTAC CCGTTCCCAA TCTCAGCATC TTTACCAAGG
GAACCGACGA GGAAGAAAAT CTTGTTCCCG GTCAAGTCAT GACGCAAGGG GATCCCATTA
CCATCAACGC CAATCGTGAG TTGATTGAAC TGTCCGTCAC CAACACGGGC GATCGCCCTA
TTCAGGTTGG CTCGCACTAT GCCTTTGTTG AAACCAACAA GGCGTTGTCC TTCGACCGCT
CCGCGTCCAT CGGTAAGCGC TTGAACGTAC CGTCGGGAGC GTCTGTACGT TTCGAACCAG
GCGAGCGCAA AACGGTCACC CTCTGCGCGC TCGGTGGAAT CCAACGGGTC GTTTCTGGCA
ATCGACTCAC AGACGGGGAT GCCCGAGACC CTGCCCGACA CGCCGCTATT CTCGAACGCG
TCACGTCCCA AGGATTTCAG CACGAACCCG TCGACCCGGC GGATATACCC AAGGGGCGTG
CGTACGTCAT GGAGCGATCG TCCTACGCCG ACATGTACGG CCCTACAGTC GGGGATAGAA
TCGCGCTCGG CGACACTGGC CTGGTCGTTC GTGTCGAACG GGACTATACC GTCTACGGCG
ACGAATGCAA ATTCGGCGGA GGCAAGACAT TACGGGAAGG AATGGGACAG GCAACGGGAC
CAACATCCGA CGATGCATTG GATGTGGTTA TTACCAATGC TTTGATTATC GATCCCTGTA
TTGGTATCGT TAAGGCCGAT GTGGGCATAA AGGGTACTTC TATAGTGGGT ATAGGCAAGG
CCGGCAATCC CGACATGATG GACGGAGTGA CGCCCAATAT GATCGTGGGA AACACCACAG
ATGTCATTGC CGGCGAAAAG CTAATTTTGA CAGCCGGTGG CATCGATACA CACGTTCATT
ACATTTGCCC CCAACAGATT GAGGAAGCGA TTTCGAGTGG GGTGACGACC ATGTTTGGAG
GGGGCACTGG ACCGGTACGT TGCTAAAATT TGTCCAGCTC ATTATCCGAT CTTCCGCAAG
GTAGGATACT GACACTTGTT TCTCTCCTAC TTTGGAATTT GCATGTTAAT ACGTTTCAAT
TCCATGTTGT TGACAGTCTG CCGGATCGAA TGCTACAACC TGCACTCCGG CTCCGAGTCA
AGTTGAAATA ATGCTCAAAG CGACCGATAA ATACCCTTTG AATTTTGGAT TTTCCGGCAA
GGGGAACACG AGCGATACAA AAGCTTTAGA GAACGTACTC AAGGCTGGCG CGGCAGGGTT
CAAACTTCAC GAAGATTGGG GCACCACTCC GAGTTCTATT GACGCCGCCT TAGACTTTGC
CGACGAGCAC GATGTGGCGA TCACAATCCA TTCCGATACA CTCAACGAGT CCGGCTTTGT
GGATGATTCC ATCGCAGCCA TGAAAGGCCG CACTATCCAT ACGTATCATA CTGAAGGGGC
CGGTGGTGGT CACGCTCCGG ACATTATCAA AATTGTCGGC GAAAACCACG TGTTGCCGAG
TTCCACGAAT CCGACGCGTC CGTTTACTGT GAACACGATT GACGAACATC TTGACATGCT
CATGGTATGC CATCACCTCG ACAGTAGCAT TCCGGAAGAT GTGGCGTTTG CGGAATCTCG
CATTCGTGCC GAAACAATCG CTGCCGAAGA CATTTTACAC GATACCGGTG CAATCAGTAT
GATCTCGTCC GATAGTCAAG CCATGGGCCG AGTCGGCGAG GTCATTACTC GCACCTGGCA
AACAGCCGAC AAGATGAAAG CTCAGCGTGG CGCGCTACCA GAAGATTCTG CTGGCGACGA
CAATGTACGC GTCAAGCGAT ACATAGCGAA GTATACAATT AATCCAGCGA TAACCCATGG
GATGAGTCAT ATGATCGGCT CCATCGAAGT GGGTAAGATG GCTGATCTAG TCTTGTGGAA
GCCCTGTATG TTTGGTGCCA AACCGGAAAT GATCGTCAAG GGCGGAACTA TCGCGTACGC
CCAAATGGGG GATCCCAATG CCTCTATACC AACGCCGCAG CCCGTCAAGA TGCGCCCCAT
GTTCGGTAAC ACATCAGCCG GTATGAATTC GGTTGTTTTT GTATCACAGG CCGCCATTCA
TGCTGACACT GCCGGCAAAT TGGGTTTGCA GAAAGCCGCA GCGGGCGTCG TGCGGTGCCG
GGCGGTAACG AAGAAAGACA TGGTCTGGAA CGATCATACA CCAAACATCA CTGTAAATCC
CGAAACCTTC GAGGTGGTAG TAGACGGGGA ATTGCTGCGC TGTGATCCGA TTGACAAGGT
TTCTTTGGGA CAACGTTTTT TCCTTTTTTA AAGCAAGGCA AATGTGGCTC TGGAGAAGGA
CGGATCAGCC TACAAAAATC GAACTTACAA TTTAACGCGC CGCCGCAATC ATGCTCCGGT
TCTGCATCAA CACAATACAC TTACGAGTTC CAACTAACCA TAAGAAATAC CTTATTAATG
TAACACA
 
Protein sequence
MLRLIQRTAF SRLVSRKPSA SPTVSLRTAV PVVGTTTTTF RRELHLSPRE AEHLQLHQVG 
RLAQYRLARG VRLNYVEAVA LISMQMMEKI RDGQDSVADL MTMGQSLIGR NQVMPGVAKM
IGQVQVEATF LDGTKLLTIH NPVSAQDGNL ERALDGSFLP VPNLSIFTKG TDEEENLVPG
QVMTQGDPIT INANRELIEL SVTNTGDRPI QVGSHYAFVE TNKALSFDRS ASIGKRLNVP
SGASVRFEPG ERKTVTLCAL GGIQRVVSGN RLTDGDARDP ARHAAILERV TSQGFQHEPV
DPADIPKGRA YVMERSSYAD MYGPTVGDRI ALGDTGLVVR VERDYTVYGD ECKFGGGKTL
REGMGQATGP TSDDALDVVI TNALIIDPCI GIVKADVGIK GTSIVGIGKA GNPDMMDGVT
PNMIVGNTTD VIAGEKLILT AGGIDTHVHY ICPQQIEEAI SSGVTTMFGG GTGPSAGSNA
TTCTPAPSQV EIMLKATDKY PLNFGFSGKG NTSDTKALEN VLKAGAAGFK LHEDWGTTPS
SIDAALDFAD EHDVAITIHS DTLNESGFVD DSIAAMKGRT IHTYHTEGAG GGHAPDIIKI
VGENHVLPSS TNPTRPFTVN TIDEHLDMLM VCHHLDSSIP EDVAFAESRI RAETIAAEDI
LHDTGAISMI SSDSQAMGRV GEVITRTWQT ADKMKAQRGA LPEDSAGDDN VRVKRYIAKY
TINPAITHGM SHMIGSIEVG KMADLVLWKP CMFGAKPEMI VKGGTIAYAQ MGDPNASIPT
PQPVKMRPMF GNTSAGMNSV VFVSQAAIHA DTAGKLGLQK AAAGVVRCRA VTKKDMVWND
HTPNITVNPE TFEVVVDGEL LRCDPIDKVS LGQRFFLF