Gene Tery_0977 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0977 
Symbol 
ID4245426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1540186 
End bp1543833 
Gene Length3648 bp 
Protein Length1215 aa 
Translation table11 
GC content41% 
IMG OID638106219 
ProductTPR repeat-containing protein 
Protein accessionYP_720831 
Protein GI113474770 
COG category[N] Cell motility
[S] Function unknown
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.773463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACAA AAATCCTTGG CAAAAATTAC TTAGTCAAAA CTTGGTTGAT AGCTATTCTT 
GTTATTAATG GGTTAGAAGT TTTAGTTATG CAGGTTGGAA GGGCTGAGAA TTTTGTCTTA
TCACAACAAC CATTAACAAC AGAGGCAACT GAAGAGCAGA TAATTAATGG GACTCGCAAT
GAAAAAGGTA AACAGGGAAG TTATAAGCTA AGTTGGCGAC AGGCTTCAGC TAAAGAAGAG
TCTCTTACTT TTGCTACAGA ACTTAATGAC GAAGCTTTTG AACTATATAA ACAAGGTAAA
TATGATGAAG CAGTTCCTTT ATTAGAGCAG TCTTTGAAAA TTAGGCTGCA GTTGCTGGGT
GCTGAACATC CTGATGTTGC CACCTCCCTC AATAATTTGG CTTTTCTTTA CCAACGTCAA
GGAAGGTATA CAGAAGCGGA ACCATTGTAT ATACAAGCAT TAGATATGAT AAAGAAGCTC
CTTGGTGCTG AACATCCATT AGTTGCGACC TCCCTCAACA ATTTGGCAGA ACTTTACAGA
GTTCAAGGAA GGTATACAGA AGCGGAACCT TTGTATCAAC AAGCATTAAA GATGAGAAAG
AAGCTCCTTG GTGCTAAACA TCCTGATGTT GCCACCAGCC TCAACAGTTT GGCTTTACTT
TACAAAGATC AAGGAAGGTA TACAGAAGCG GAACCTTTGT ATATACAAGC ATTAGAGATG
AGAAAGAAGC TCCTGGGTGC TGAACATCCT GATGTTGCCT CCAGTCTCAA CAATTTGGCA
GAACTTTACA GATCTCAAGG AAGGTATACA GAAGCGGAAC CTTTATATCT ACAAGCATTA
GAAATGACAA AGAAGCTCCT GGGTGCTGAA CATCCTGATG TTGCCTCCAG CCTCAACAAT
TTGGCAGGAC TTTACAAAGA TCAAGGAAGG TATACAGAAG CGGAACCTTT ATATCTACAA
GCATTAGAGA TGAGAAAGAA GCTCCTGGGT GCTGAACATC GATATGTTGC CTCCAGCCTC
AACAATTTGG CTTTACTTTA CAGAGTTCAA GGAAGGTATA CAGAAGCGGA AGCATTGTAT
ATACAAGCAT TGGAAATGGA TAAGAAGCTC CTGGGTGCTG AACATCCTGA TGTTGCCACC
AGCCTCAACA ATTTGGCTTT ACTTTACTCA GATCAAGGAA GGTATACAGA AGCGGAACCT
TTGTATATAC AAGCATTAGA GATATTTAAG AAGCTCCTTG GTGCTGAACA TCGATATGTT
GCCTCCAGCC TCAACAATTT GGCAGGACTT TACAGAGTTC AAGGAAGGTA TACAGAAGCG
GAACCTTTGT ATGTACAAGC ATTGAAAATG TGGAAGAAGC TCCTGGGTGC TGAACATCCT
GATGTTGCCA CCAGCCTCAA CAATTTGGCT TTACTTTACA AAGATCAAGG AAGGTATACA
GAAGCGGAAC CATTGTATCA ACAAGCATTA GATATGAGAA AGAAGCTCCT TGGTGCTGAA
CATCCTGATG TTGCCACCAG CCTCAACAAT TTGGCAGGAC TTTACAGAGT TCAAGGAAGG
TATACAGAAG CGGAACCTTT GTATGTACAA GCATTGAAAA TGTGGAAGAA GCTCCTGGGT
GCTGAACATC CTGATGTTGC CACCAGCCTC AACAATTTGG CTTTACTTTA CAAAGCTCAA
GGAAGGTATA CAGAAGCGGA ACCTTTGTAT ATACAATCAT TAGAGATGAG AAAGAAGCTC
CTTGGTGCTG AACATCCTTC TGTTGCCCAA AGCCTCAACA ATTTGGCAGC ACTTTACTAT
TATCAAGGAA GGTATACAGA CGCGGAACCA TTGTATCAAC AAGCATTAGA GATGAGAAAG
AAGCTCCTTG GTGCTGAACA TCCTGATGTT GCCATCTCCC TCAACAATTT GGCTTTACTT
TACTCTGCTC AAGGTAACAT TGCCTCAGCA GTCCAATACC TTGAACGCGG CTTGGAAGTT
CAAGAAAAAA ACCTAACTTA CAACTTAGCC GCCGGTGCTG AACCTCAAAA AGAAAAATAT
CTCGAAACTA TCTCAGGAGC CAAGGATAGA TCCATCTCTC TCCATCTGCA AATAGCACCA
AATAATCCAG CCGCCACCAC TCTAGCATTA ACCACCGTCC TACGCCGCAA AGGTCGCCTC
CTCCAATTCT TAACCGCCAG CAGAAAAATA CTCCGACAAC AACTTGACCC TCAAGGACTA
CAATGGCTAG ATGAACTTGA TAGTATTAAT AGTCAACTTT CCACTCTACT CTATAACCGA
CCGGAAAATC TCCCTCTAGA AACCTATCGT GATAATTTTG CCAAATTAAA ACAACAGGCA
AATGAACTAG AAAACAAAAT TAGTCGTCGT AGTAGGGAGT TTCGGACTTC GACCCAACCT
GTAACCTTAG AAGCTATTCA GCAGTTGATA CCCGCCAACG CTGCCTTAGT AGAGTTTATA
CAGTATTATC CTTTTGACCC CAAGACAGAG ACATGGGATG ATCCACGCTA TGGGGTTTAC
GTTCTGAATG CAGAGGGAGA ACCCCAAGGC ATCGACCTGG GAACAGTTGA AGAAATTAAA
TCAGACCTAG ACAAATTTAG AGTTCTTCTA AAAAAGAAAA GAGCCCCTTT AGAGAAGCTT
AAAAAAACCG CTAGAGAACT TGACGAAAAA CTAATGCAAC CTGTACGTCA ATTAATAGGT
TCAAAGGAAC AAATTCTAAT TTCTCCTGAT AGTCATCTTA ACCTAATACC TTTTGAAGCT
CTAGTAGATG AAAATAATCA GTATTTAGTA GAAAACTATA GTATTACCTA TCTCAGCTCA
GGACGTGACT TACTTCAATT GACTACTAAA GCCAGAAAAA CATCACCAGC GTTATTGTTA
GGAGACCCAA ATTATGAAAA AAAAGACAAA ATTGCTACTG AGCGTGGTTT CAATAAAACA
TCTTCTAATA TAGTATTAGG AAGACTATTA AAAACTGCTG ATGAAGTCAA AGCCATAGGA
AAATTACTAG GAGTTAAACC TTTGCTACGA GGAGCAGCCA CTGAAAAAGC TATCAGGCAA
GCACAAAATC CTTTTATATT ACATATAGCT ACCCATGGTT TATTTCAAGA ATTTGAAGAA
AAAGCTCAAA ACCCTGGAGA ACTGCCTATC ATAGGGAGAA AATCCCTACT ACGATCAGGT
TTAGCTTTGG CTGGGTTTGA GGAAGAAAAT ATAGTAGGAG ATAATAGTGT TCCGCCAGAA
TTAGAGCCAA AAGAAACAGA CGAAGACAAT GGTTTTTTGA CTGCTTTAGA AGCCACAGGA
TTAAAATTAC TAGGCACAGA GCTCGTAGTG TTATCAGCTT GTGACACAGG GAGGGGGGGA
ATTAGTCCCG GAGAGGGAGT TTATGGACTA CGAAGAGCCT TTTTTATTGC CGGTTCCCAG
AGTCAGGTCA TTAGTTTATG GCAAGTTGAT GATGAAGGCA CAAAAGACTT AATGGTCAAG
TATTATCAAC GTCTGTTAGA TGGAAATATA GGACGAACAG AAGCCTTAAG GAAAACTCAA
CTCGAGATGC TGAGGGGAGA AGCAGGAGAA AACTACAGTC ATCCCTATTA TTGGGCTAGT
TTTATTCCTT CTGGAAATTG GCAACCAGTT CCTCCAAGAT TAAAATAG
 
Protein sequence
MFTKILGKNY LVKTWLIAIL VINGLEVLVM QVGRAENFVL SQQPLTTEAT EEQIINGTRN 
EKGKQGSYKL SWRQASAKEE SLTFATELND EAFELYKQGK YDEAVPLLEQ SLKIRLQLLG
AEHPDVATSL NNLAFLYQRQ GRYTEAEPLY IQALDMIKKL LGAEHPLVAT SLNNLAELYR
VQGRYTEAEP LYQQALKMRK KLLGAKHPDV ATSLNSLALL YKDQGRYTEA EPLYIQALEM
RKKLLGAEHP DVASSLNNLA ELYRSQGRYT EAEPLYLQAL EMTKKLLGAE HPDVASSLNN
LAGLYKDQGR YTEAEPLYLQ ALEMRKKLLG AEHRYVASSL NNLALLYRVQ GRYTEAEALY
IQALEMDKKL LGAEHPDVAT SLNNLALLYS DQGRYTEAEP LYIQALEIFK KLLGAEHRYV
ASSLNNLAGL YRVQGRYTEA EPLYVQALKM WKKLLGAEHP DVATSLNNLA LLYKDQGRYT
EAEPLYQQAL DMRKKLLGAE HPDVATSLNN LAGLYRVQGR YTEAEPLYVQ ALKMWKKLLG
AEHPDVATSL NNLALLYKAQ GRYTEAEPLY IQSLEMRKKL LGAEHPSVAQ SLNNLAALYY
YQGRYTDAEP LYQQALEMRK KLLGAEHPDV AISLNNLALL YSAQGNIASA VQYLERGLEV
QEKNLTYNLA AGAEPQKEKY LETISGAKDR SISLHLQIAP NNPAATTLAL TTVLRRKGRL
LQFLTASRKI LRQQLDPQGL QWLDELDSIN SQLSTLLYNR PENLPLETYR DNFAKLKQQA
NELENKISRR SREFRTSTQP VTLEAIQQLI PANAALVEFI QYYPFDPKTE TWDDPRYGVY
VLNAEGEPQG IDLGTVEEIK SDLDKFRVLL KKKRAPLEKL KKTARELDEK LMQPVRQLIG
SKEQILISPD SHLNLIPFEA LVDENNQYLV ENYSITYLSS GRDLLQLTTK ARKTSPALLL
GDPNYEKKDK IATERGFNKT SSNIVLGRLL KTADEVKAIG KLLGVKPLLR GAATEKAIRQ
AQNPFILHIA THGLFQEFEE KAQNPGELPI IGRKSLLRSG LALAGFEEEN IVGDNSVPPE
LEPKETDEDN GFLTALEATG LKLLGTELVV LSACDTGRGG ISPGEGVYGL RRAFFIAGSQ
SQVISLWQVD DEGTKDLMVK YYQRLLDGNI GRTEALRKTQ LEMLRGEAGE NYSHPYYWAS
FIPSGNWQPV PPRLK