Gene Tery_4464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4464 
Symbol 
ID4246117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6883492 
End bp6885699 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content39% 
IMG OID638109347 
Producthypothetical protein 
Protein accessionYP_723924 
Protein GI113477863 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTA AACGTAGAGA ATTTCTTATC TTCCTTGGAG CAGGTGCCGG AAAAATTGTC 
CTGAGTTCTT CTCAGCAAGA ACCTTCAGCC AAGCTAGCTA AAAAATATCC AATTAACAAC
AGTTCAACTA ACAATATTAA CTTCCAACCA CCTCAAAGCC CAATACCTCT AGTAACCAGT
AATCAAGAAT TCAAAATAGA TGACTATACT ACATACGAAG TAGTAGATGA TTTAGTCATT
CCAAAAGGCT TTACCTACGA CATTATAGCT TCTTGGGGAG ACAAAATCGG TGATTCTCGT
TTTGGCTACA ATAATGATTA TCTCTCCTAT ATTGAAACAG GAGAAAACGA AGGGTTATTA
ACCATAAATT TTGAATATAT TAGCCCCTTA CCTTGGGTAA CAAGTTATCA AAAAATTATT
GGTAAATCTT TGCCATTTTT AGAGATAGCA TCAGAATTTC AGGAAGCAGT AAAATCCAAT
ATTAACTCTC AAGAATTACT AAATAAAAGC CCTCTAAAAA AACAAATAAC TACAATTGCT
AAGGAAGCAA TGATCGATAT GGGAATAGGC GTTATTTCCA TTCGTCGCCA ACCTGATGGC
CAATGGGTAA GAACAAACTC AAAATTTGAC CGCCGAATAA CAGGTATTTC AGGATTAGAA
GATGGACACT ATTTAAAATC AACAGGCCCA GCAGTAGCAG TATTTAAGAA AAAAGGCAAA
GGTTATGATG ATAAATTAGG CGATCGCATC ATTGGTAGTT TTGCCAACTG TGCAGGTGGT
ACTACACCTT GGGGAACAGT CTTTAGTGCC GAAGAAAATT TTCAAAAGTA TGTAATTGAA
GCAGTTTATT CTGATGGTAC TTCTATTGAC CCTACCCAAG ACAAGTCTCT CTTTATTGGT
GAACAAGGAG TAGTTGGTCT GGGCAGTGCC TTTGGTTTAG CTGGTAATAA ATATGGTTGG
ATGGTAGAAG TAGATCCAGC CTTACCCGAT GATTATGGGA CAAAACATAC TTGGTTGGGC
CGTTATCGTC ATGAAGCAGT CGGTATTAGG GCACAAGCAA ACCAACCTTT AGCTTTTTAT
TCAGGATGCG ATCGCCGTAG CGGTCATCTC TATAAATTTG TCAGTAAAGA CATTGTCAAA
GACCCCACAG ACAAATCCAA CTCCCAACTA TTGACAAATG GAATGTTATA TGGTGCAAAA
TTTAATTCTA ACGGTACAGG TAGTTGGATT CCCTTAAAAG AGGATACCCC CGTTAATCCA
GATTTACCAA GTGTCCATGT GAACAATATG ATTAATCTAC CCCAAAGACC TGATGGTGGT
AGCTTCAAAG TAACAAAAGA TAGAGATATA GAAGCTTTTA AACAAAAATA TCAAACTTTG
GGGGACTTAT ACCAAGGAGA CCCCGAAGCA AAACAAGGAG CAATTCTGAT AGATGCCCAT
TTAGCTGGGA ATGCTGCTGG GGTTACTTGT ACTGCCCGTC CCGAAGATAC TATAGTGGCC
AAAGACGGCA GTCTATTTAT TGCTTTTACT TCTGGATATC CTAGTAGTTC TGATGGTAGT
CCAGATCAAC GTATCTTTAA AGGTCCAGAT GGTGAAGCTT ATGAATATGG CTGGATTATG
CGTTTAGTTG AAGATAGCGA TCGCCCAGAT GCTATGACAT TTCGTTGGCA AATGTTTGCC
ACAGGAGGAG AACCAACATT TGGTGGACTA GGATTTTCTA ATCCTGACAA CCTAGAGTTT
GACGATCAGG GTAATCTCTG GATGGTCACA GATATTAGTA CCAATAAGCA TAATCAAATA
GTGCCTGAGA ATCGAATAGA CAAAGATGGC AAACCAATTA ACCAGGCAAG TCTTTGTGGT
TTATTCGGTA ATAATTCAAT CTGGTATATT CCCACCTCTG GCTCTAATGC TGGTCAAGCA
TACTTATTTG GCATTGGACC AACAGAATCA GAAACTTCTG GGCCTTTCTT TACCAAAGAT
GGAGAAACTT TATTTTTGGC CATTCAACAT CCAGGAGAAT ATAACGGCAT CCGTCAAAAC
ATGGCCTCAG AAACTCGTCG GTTGGCCATG AAAACCATAG ATGGTTCAGA TTTTATGCAA
AATCGTATAG TCCCCATGGG TTCTAACTGG CCTGAAAAAA AAATTAACCA CCCTCCGAAA
CCTTCAGTAG TTGCTATTCG TCGCTTAGAT TCAACAAGGA TTACATGA
 
Protein sequence
MNIKRREFLI FLGAGAGKIV LSSSQQEPSA KLAKKYPINN SSTNNINFQP PQSPIPLVTS 
NQEFKIDDYT TYEVVDDLVI PKGFTYDIIA SWGDKIGDSR FGYNNDYLSY IETGENEGLL
TINFEYISPL PWVTSYQKII GKSLPFLEIA SEFQEAVKSN INSQELLNKS PLKKQITTIA
KEAMIDMGIG VISIRRQPDG QWVRTNSKFD RRITGISGLE DGHYLKSTGP AVAVFKKKGK
GYDDKLGDRI IGSFANCAGG TTPWGTVFSA EENFQKYVIE AVYSDGTSID PTQDKSLFIG
EQGVVGLGSA FGLAGNKYGW MVEVDPALPD DYGTKHTWLG RYRHEAVGIR AQANQPLAFY
SGCDRRSGHL YKFVSKDIVK DPTDKSNSQL LTNGMLYGAK FNSNGTGSWI PLKEDTPVNP
DLPSVHVNNM INLPQRPDGG SFKVTKDRDI EAFKQKYQTL GDLYQGDPEA KQGAILIDAH
LAGNAAGVTC TARPEDTIVA KDGSLFIAFT SGYPSSSDGS PDQRIFKGPD GEAYEYGWIM
RLVEDSDRPD AMTFRWQMFA TGGEPTFGGL GFSNPDNLEF DDQGNLWMVT DISTNKHNQI
VPENRIDKDG KPINQASLCG LFGNNSIWYI PTSGSNAGQA YLFGIGPTES ETSGPFFTKD
GETLFLAIQH PGEYNGIRQN MASETRRLAM KTIDGSDFMQ NRIVPMGSNW PEKKINHPPK
PSVVAIRRLD STRIT