Gene Tery_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3497 
Symbol 
ID4244497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5391012 
End bp5394998 
Gene Length3987 bp 
Protein Length1328 aa 
Translation table11 
GC content43% 
IMG OID638108471 
Productpeptidase-like 
Protein accessionYP_723060 
Protein GI113476999 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[S] Function unknown
[N] Cell motility 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACCA GAATCTTCAG CATCACTTAT TTGGCCCAAA CTTGCCTGGC AGCTATTGTT 
ATTATCAATG GCCTAGAAGC TTTGGTTGTG CAGCCTGTAT GGGCACAGAC TTATGTCTTT
CCACAAGAAC CATCAACAAC CGAAGCAACT GAGGAGCTGA TAATTAATGG GACTCTCAAT
GAAAATAGTG AGCTGTTGGA AGATGGGAGT TACTTTCAGT GGCACACTTT TACGGGCAAG
GCGGGTGAAG CTATTACTAT TGAGTTGAGT AGTTCTGAGT TTGATGCTTA CTTAATGTTA
GTAGACCCAA GATCCAATAA AATTGCTGAG AATAATGATG GAGGGGAAAA TAATAATGCC
AAAATTACAA TAACTCTACC TACAACAGGA ACTTATACAG TTATTGTCAA TACCTATGAA
AAAGGTCAAC AAGGAAGTTA CAAACTGAGT TGGCGACAGG CTTCAGCTAA AGACCAGTCC
CTAGCTTCTG CCACAGAACT TAATGACAAA GCTATTGAAC TATATAAACA AGGTAAATAT
GATGAAGCAG TTCCTTTATT AGAGCAGTCT TTGAAAATTA GGCAGCAGGC GCTGGGGGCT
GAACATCCTG ATGTTGCCGC CAGCCTCAAC AATTTGGCTT TTCTTTACAA TGCTCAAGGA
AGGTATACAG AAGCGGAACC TTTGTATATA CAAGCATTAG AGATGAGAAA GAAGCTCCTG
GGTGCTGAAC ATCCTGATGT TGCCTCCAGC CTCAACAATT TGGCTTTTCT TTACAAAGCT
CAAGGAAGGT ATACAGAAGC GGAACCTTTG TATATACAAG CATTAGAGAT GAGAAAGAAG
CTCCTTGGTG CTGAACATCC TGATGTTGCC ACCAGCCTCA ACAATTTGGC TTCTCTTTAC
GAATCTCAAG GAAGGTATAC AGAAGCGGAA CCATTGTATA TACAAGCATT AGAGATATTT
AAGAAGCTCC TGGGTGCTGA ACATCCTGAT GTTGCCACCA GCCTCAACAA TTTGGCTTTT
CTTTACAATG CTCAAGGAAG GTATACAGAA GCGGAACCTT TGTATATACA AGCATTAGAT
ATGACAAAGA AGCTCCTGGG TGCTGAACAT CCATCAGTTG CCACCAGCCT CAACAATTTG
GCTTTACTTT ACGAGGATCA AGGAAGGTAT ACAGAAGCGG AACCTTTGTA TATACAAGCA
TTAGAGATGA GAAAGAAGCT CCTGGGTGCT GAACATCCTG ATGTTGCCAC CAGCCTCAAC
AATTTGGCAG GACTTTACAA TGCTCAAGGA AGGTATACAG AAGCGGAACC TTTGTATATA
CAAGCATTAG AGATGAGAAA GAAGCTCCTT GGTGCTGAAC ATCCTGATGT TGCCACCAGC
CTCAACAATT TGGCTTCTCT TTACGAATCT CAAGGAAGGT ATACAGAAGC GGAACCATTG
TATATACAAG CATTAGAGAT ATTTAAGAAG CTCCTGGGTG CTGAACATCC TGATGTTGCC
TCCAGCCTCA ACAATTTGGC AGGACTTTAC AAGGATCAAG GAAGGTATAC AGAAGCGGAA
CCTTTGTATA TACAAGCATT AGAGATGAGA AAGAAGCTCC TGGGTGCTGA ACATCCTGAT
GTTGCCTCCA GCCTCAACAA TTTGGCAGCA CTTTACAAGG ATCAAGGAAG GTATACAGAA
GCGGAACCTT TGTATATACA AGCATTAGAG ATGAGAAAGA AGCTCCTGGG TGCTGAACAT
CCTGATGTTG CCTCCAGCCT CAACAATTTG GCAGGACTTT ACAATGCTCA AGGAAGGTAT
ACAGAAGCGG AACCTTTGTA TATACAAGCA TTAGAGATGA GAAAGAAGCT CCTGGGTGCT
GAACATCCAT TAGTTGCCTC CAGCCTCAAC AATTTGGCAG GACTTTACAA TGCTCAAGGA
AGGTATACAG AAGCGGAACC TTTGTATATA CAAGCATTAG AGATATTTAA GAAGCTCCTG
GGTGCTGAAC ATCCATTAGT TGCCACCAGC CTCAACAATT TGGCAGGACT TTACAATGCT
CAAGGAAGGT ATACAGAAGC GGAACCATTG TATATACAAG CATTAGAGAT GAGAAAGAAG
CTCCTGGGTG CTGAACATCC TTATGTTGCC ACCAGCCTCA ACAATTTGGC TTTACTTTAC
TATGCTCAAG GAAGGTATAC AGAAGCGGAA CCATTGTATA TACAAGCATT AGAGATGAGA
AAGAAGCTCC TGGGTGCTGA ACATCCTGAT GTTGCCTCCA GCCTCAACAA TTTGGCAACA
CTTTACAATG TTCAAGGTGA CATTGCTTCA GCAGTGCAAT ACCTTAAACG CGGCTTAGAA
GTGCAAGAAA AAAACCTAAC TTACAACTTA GCCGCCGGGG CTGAACCTCA AAAAAACAAA
TATCTCCAAA CCATCTCAGG AGCAAAGGCT AGAGCCATCT CTCTCCATCT GCAAACAGCA
CCAAATGACC CAGCCGCGAC CACTCTAGCA TTAACCACCA TCCTACGCCG CAAAGGTCGC
CTCCTCGAAT TCTTCACCAC CAGCAGACAA ATACTCCGAC AACAACTCGA CACTCAAGGA
CTACAATGGC TAGACGAACT TAATAATATT TATAGTGAAC TTTCCACTCT GCTCTATAAC
GGACCAAAAG ATCTTCCTCT AGAAACCTAC AAGGAAAATT TTGCCCTCTT AGAACAACAG
GCAAAAGAAC TAGAAGACAA AATTAGTCGC CGCAGTAGCC AGTTTCGGGC AGCCACCCAA
CCTGTAACCT TAGAAGCTAT TCAGGAGTTG ATACCCGCCA ACGCTGCCTT AGTAGAATTT
GTCCAGTATT CTCCTTTTGA CCCCAAGACA GGCAAATTCG GTAAACCACG CTATGGGGTT
TACGTTCTAA GTAGAGAGGG AGAACCCCAA GGCATGGACC TGGGAACAGT TGAAGAATTT
AAAAACATCA TAAAAGTGTT TAGAACTTCT CTACGAGACA AAAGCCAAAC TCCCCTAAAA
CAGCTTAAAG ACACAGCCAG AGAACTTGAC CAAAAACTGA TGCAACCTGT ACGTCAACTA
CTGGGGTCAC AAAAACAAAT TCTGATTTCT CCTGATAGCA ATCTCAATTT AATTCCTTTT
GAAGCATTAG TTGATGAGAA TAACCGCTTT CTCGTAGAAA ACTATAGCTT CACCTACCTA
AGCTCAGGAC GGGACTTACT CACATTTACC TCCACACCAA GAAACACCTC ACCAGCGGTA
TTGCTAGGAT ACCCTACATA CTCGGAAAAA GATCAGGTAA CTATTAAGTC TAAGGGTAGG
TTCGGTGTCT CCGACTTAGG TCTTTCTGGA CTACCAGGAA CCGAAGAGGA GGTCAAAGCC
ATTGGTGAGT TGCTTGGAGT TGAACCTTTA CTACGAGAGG CAGCCACTGA AGAAGCCGTA
AAACAAGTTC AAAGTCCTTT CATATTGCAT ATTGCTACTC ATGGTTTATT TGAAACAATT
GAAGATCCTG AGAAATATCC CACGATAAAT AAAAATTCCC TACTACGGGC AAGTTTAGCT
CTGGCTGGGG TTAAAGAGGA ACAAATAGAT GGAGATAATG GTTTATTGAC CGCTTTGGAA
GCCGCCGGGC TGAATTTGCT AGGCACAGAG CTAGTGGTAT TATCCGCCTG TGACACAGGG
CTGGGGGGAA TAAGTCCCGG AGAGGGGGTT TATGGACTTA GACGAGCATT CGCGATCGCA
GGTTCCCAGA GTCAGGTAAT TAGTTTATGG AAAGTTGATG ACCAAGGGAC GAAAGACTTA
ATGGTGAAGT ATTATCAACG GTTGTTGGGA GGAGACATGG GACGAACAGA AGCTTTAAGG
CAGACTCAAC TGGAGATGCT GGGGGGGGAA GCCGGGGAAA AGTACAGTCA TCCTTATTTT
TGGGCGAGTT TTATTCCCTC TGGAAATTGG CTGGCGATCC CTCCCAGGGT AGAAAGTAAA
AAATCAAAAG TCAACAGTCA AAGGTAA
 
Protein sequence
MFTRIFSITY LAQTCLAAIV IINGLEALVV QPVWAQTYVF PQEPSTTEAT EELIINGTLN 
ENSELLEDGS YFQWHTFTGK AGEAITIELS SSEFDAYLML VDPRSNKIAE NNDGGENNNA
KITITLPTTG TYTVIVNTYE KGQQGSYKLS WRQASAKDQS LASATELNDK AIELYKQGKY
DEAVPLLEQS LKIRQQALGA EHPDVAASLN NLAFLYNAQG RYTEAEPLYI QALEMRKKLL
GAEHPDVASS LNNLAFLYKA QGRYTEAEPL YIQALEMRKK LLGAEHPDVA TSLNNLASLY
ESQGRYTEAE PLYIQALEIF KKLLGAEHPD VATSLNNLAF LYNAQGRYTE AEPLYIQALD
MTKKLLGAEH PSVATSLNNL ALLYEDQGRY TEAEPLYIQA LEMRKKLLGA EHPDVATSLN
NLAGLYNAQG RYTEAEPLYI QALEMRKKLL GAEHPDVATS LNNLASLYES QGRYTEAEPL
YIQALEIFKK LLGAEHPDVA SSLNNLAGLY KDQGRYTEAE PLYIQALEMR KKLLGAEHPD
VASSLNNLAA LYKDQGRYTE AEPLYIQALE MRKKLLGAEH PDVASSLNNL AGLYNAQGRY
TEAEPLYIQA LEMRKKLLGA EHPLVASSLN NLAGLYNAQG RYTEAEPLYI QALEIFKKLL
GAEHPLVATS LNNLAGLYNA QGRYTEAEPL YIQALEMRKK LLGAEHPYVA TSLNNLALLY
YAQGRYTEAE PLYIQALEMR KKLLGAEHPD VASSLNNLAT LYNVQGDIAS AVQYLKRGLE
VQEKNLTYNL AAGAEPQKNK YLQTISGAKA RAISLHLQTA PNDPAATTLA LTTILRRKGR
LLEFFTTSRQ ILRQQLDTQG LQWLDELNNI YSELSTLLYN GPKDLPLETY KENFALLEQQ
AKELEDKISR RSSQFRAATQ PVTLEAIQEL IPANAALVEF VQYSPFDPKT GKFGKPRYGV
YVLSREGEPQ GMDLGTVEEF KNIIKVFRTS LRDKSQTPLK QLKDTARELD QKLMQPVRQL
LGSQKQILIS PDSNLNLIPF EALVDENNRF LVENYSFTYL SSGRDLLTFT STPRNTSPAV
LLGYPTYSEK DQVTIKSKGR FGVSDLGLSG LPGTEEEVKA IGELLGVEPL LREAATEEAV
KQVQSPFILH IATHGLFETI EDPEKYPTIN KNSLLRASLA LAGVKEEQID GDNGLLTALE
AAGLNLLGTE LVVLSACDTG LGGISPGEGV YGLRRAFAIA GSQSQVISLW KVDDQGTKDL
MVKYYQRLLG GDMGRTEALR QTQLEMLGGE AGEKYSHPYF WASFIPSGNW LAIPPRVESK
KSKVNSQR