Gene Tery_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3473 
Symbol 
ID4244473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5348974 
End bp5350791 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content36% 
IMG OID638108447 
Producthypothetical protein 
Protein accessionYP_723036 
Protein GI113476975 
COG category[S] Function unknown 
COG ID[COG5305] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAA ATAACATTAA ACTACTACTT AAAAACCAAT GGTTTCACCT AATAATATTA 
TTATTCTGGC TAACAGTTGG CATTATGATC CGGATCACAA ACTTAGCCGC AAAACCAGCC
TCATCCATTG AAATTGCCAC ATTAGGCTAT AGCTTAGGGC ATAGCTTTTT TGACCTACCC
TTAGATCAAA TTATCACCCT AAATGAACTA CTCTCACCCT TAAAATTTGA ATCAACCTCT
ACCGCCATCG ACGTAGTTGA TCAGTTACTA AGAGAAGATA CCCATCCCCC AGTCTACTTT
GTATTAAGTC ATTTGTGGCT TAAGTTATTC AGCACTGACG GAGAAATAGT TTCTCTCTGG
GCTGGGCGTT CCCTAAGTGT TATTTTAGGC GTAGCAGCCA TTCCAGCAAT TTTTAGCTTA
GGTAAGCTAG CATTTTCCCC ATTAGTAGGT CATATAGCAG CCGCATTAAT GGCTGTTTCT
CCTTATGGCA TTTACCTCGC CCAAGAATGT CGCCATCATA CCCTAACCAT ATTATGGACT
ATTGCTTCTA TAGCTTGCTT AATTAAAATA GCACCCTATA TCAAAAAACA TAAAGCATTT
CCAATTTGGT TAGGCTCCGC CTGGGTTGCC ATCAATAGCT TAGGAGTTGC CACCCATTAT
ATTTTTATTT TAGTGTTAGC TACAGAAGGT TTAGTTATAG GAGTTTTTTG GCTAAAAGAT
ATAGAAAACA GACTTCAAAA TTATTGGTGG CGAATTTATC TAGTAGCTTT AGGAACATTT
GTTAGTTGTT TAGTTTGGTT ACCCGTAGTC ACCAGCGCTG CTAATAATAA ACTTACTGGG
TGGATATCAA CCAGCTTCGA TCTTGATGAA ATTTGGCACC CTATCCCCCG TTTGTTAGGT
TGGACTCTGA CAATGGTATG GCTTTTGCCC GTTGAAGGAA CAAATTTATT TGTAACTATC
TTATCCGGAG TCACCCTTTT AGTTGCATTA TTGTGGGTTA TTCCCAAACT ATGGCAAGGA
GGGAAAGCAC AGATGAGGGA TCTTCCAAAC CGTTTATTTT TTCAAATATT TGTGAGTTTT
TTAGTAGGAG CGATCGCCTT ATTTTTAGTA ATTATTTATG GTATGGGTAG AGACTTATCT
CTTGCTGCCC GCTATCAAAT TGTTTATTTT CCTGTTGTAA TTATTTTATT AGCCGCAATA
TTAGGGAAAT GTTGGAACAG CTCAGAAAAA GAAACAGAAG TAAAAAAAGT TTTTTCTGAC
AAACAGACGG GTCAAGTCAA AAAGGAAAGA GTAATAAAAA AAGAGTTAAT TATTAGTTCT
GCCACAAGGA TAATTGTTTC CAACTTAAAA CCAATCAATA AAAGAGTTGT AATTGTAGTT
TTGTTAATCA GTTTTTGGGG TGGGTTAACA GTAATTAACA ACTATGGTTA TCAAAGGTCA
AGGCGTGCAG ATATTCTAGT TAAAGAGATG CAAACACAAT CTAAAGCAGC GCCATTAATT
GCGACAACCT ATCAAACCCA TGCAGAAATT CGTGCTTTAA TTGCACTTGG TTTGGAGTTA
AAACGCCAAG AAGATAAAAG CAATACATCG GGAAATTTTC AACCTCAATT TATATTAGCT
AAAAGACAAC AAAATAAAAA ATTAACCCCA GATTCAACTT TAGCTAAATT TCTATCTCAA
AAATCAAAAC CAATTGACTT ATGGGGAATT AACTTAAAAA TAGAAGCCAG GGAATTAGAA
GCTTTTAACT GTCAAAAATA TTCTGGGAAT CAACCAAAAA TTAATGGTTA TAGTTATAGG
TTGTATCATT GTCGTTAA
 
Protein sequence
MKENNIKLLL KNQWFHLIIL LFWLTVGIMI RITNLAAKPA SSIEIATLGY SLGHSFFDLP 
LDQIITLNEL LSPLKFESTS TAIDVVDQLL REDTHPPVYF VLSHLWLKLF STDGEIVSLW
AGRSLSVILG VAAIPAIFSL GKLAFSPLVG HIAAALMAVS PYGIYLAQEC RHHTLTILWT
IASIACLIKI APYIKKHKAF PIWLGSAWVA INSLGVATHY IFILVLATEG LVIGVFWLKD
IENRLQNYWW RIYLVALGTF VSCLVWLPVV TSAANNKLTG WISTSFDLDE IWHPIPRLLG
WTLTMVWLLP VEGTNLFVTI LSGVTLLVAL LWVIPKLWQG GKAQMRDLPN RLFFQIFVSF
LVGAIALFLV IIYGMGRDLS LAARYQIVYF PVVIILLAAI LGKCWNSSEK ETEVKKVFSD
KQTGQVKKER VIKKELIISS ATRIIVSNLK PINKRVVIVV LLISFWGGLT VINNYGYQRS
RRADILVKEM QTQSKAAPLI ATTYQTHAEI RALIALGLEL KRQEDKSNTS GNFQPQFILA
KRQQNKKLTP DSTLAKFLSQ KSKPIDLWGI NLKIEARELE AFNCQKYSGN QPKINGYSYR
LYHCR