Gene Tery_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3355 
Symbol 
ID4243449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5146012 
End bp5147667 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content33% 
IMG OID638108339 
Productextracellular solute-binding protein 
Protein accessionYP_722930 
Protein GI113476869 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAC ATAAATCATT TAATTCAATC AACCATTCCT GGCAGTCAAT AATTCAATTT 
TTTGGCTTAT TTTGTCTTTG CTGTTTCCTG GTCATTAGTT GTAGCCAACC TCAAAATAAC
CCCGACAATA CTCTAACTAT AGAAACAAAT ACAAATCGCA TTACTATAGG CACAACTTTA
AAACCTCGGA CTATCGACCC GGCAGATGCC TACGAAGTAA TATCTGGTAA CTTACTCCAT
AACTTAGGCG ATCGCCTCTA CGGTTATAAG CTAGGAACAA TGGAACTTGT ACCATCACTA
GCAACAGAAA TGCCAAAAAT TAGTGAAGAT GGCACAACAT ATACTATTCC CATTCGTCAA
GGGGTGACAT TCCATGACGG TACTCCTTTC AACGCTGAAG CAATGGCCTT TTCTTTCAAA
CGTTTTATTA AAAATAGTGG CCCACCTTCT TCTTTGTTAA CTAATACTAT TAAATCAGTA
GAAGCTACGG GAGAATACCA ATTAACAATT AAGTTGAAAA AACCCTTTGC AGCTTTTACT
TCTTTATTAA CATTTTCTGG TCTGTGTGCT GTTTCCCCAC AAGCTTATGA AATAGGTGAA
AGTCAATTTA AACCTGATAC ATTTATTGGT ACTGGTCCCT ATAAATTAGC TGAGTATGGT
ACTGATACTT TACGCTTAGA TGTATTTGAA AATTATTGGG GAGAAAAACC AAAAAATCAA
GGAATTGACA TTCAAATATT TTCTAGTTCA GCTAACCTAT TTAATGCTTT TAAAACAGGG
TCTATTGATG TAGCTTATTT TTCTCTAGAC ACTGACCAAA TTACTAATTT AGAAGCTGAA
GCTATCCGTC AAGGATGGCA AGTAATTTCT ACAGATGGTA AGACAGTTAA TTATATGGTT
TTAAATCTCA ATTTAGAACC ATTAAATAAC AAAGCTGTTA GACAAGCTTT AGCATCTATT
ATTGATAGAA AATTACTAAA TGAACGAGTT TTACAAGGAA AAGCTGAACC AGTTTATAGT
CTAATTCCTA AACAAATTAA TAGCTATAAA CCAGTGTTTA AAGAAAACTA TGGAGATGGA
AATTTTGCTC AAGCTAAGGA GTTATTAAAA GAGGCAGGAT ATTCTCGAAA TAGCCCAGCC
AAAATTGAAA TTTGGTATGC TGCAAATTCT ACTAAGAGGC AATTAACAGC TAGTACATTA
AAAGCATATG TAGATCAAAA TTTAGAAGGT TTAATGGAGT TAGAATTGAA TAGCGTAGAA
GCAGCTACTG CTTTTAATAA TTTAGATAAA GGAGTATATC AAACATTTAT TTTAGACTGG
TATGGAGACT TTCTTGATGC GGATAATTAT ATCCAACCAT TTTTAGAATG TATCAAAGGT
TCAAAAGAAA AAGGTTGTGA AGAAGGGGCT AGTCAATTTC AAGGTTCATT TTATTATAGT
GATCGGATAA ATAAATTAAT CCAACAACAA CGTCAAGAGC AAAACCCAGA AAAGCGGCTA
GCTATTTTTG TAGAAATTCA AGAATTATTA GCAGAAGATG TTCCTTTTAT CCCATTATGG
TTAGATAAAG ATTATGTATT TGCTCAGAAA AACATCAGTG GGGTTAGTTT AGAACCTACT
CAGCAATTTT CTTTCTTGAA AATTAATAAA TCATAA
 
Protein sequence
MEKHKSFNSI NHSWQSIIQF FGLFCLCCFL VISCSQPQNN PDNTLTIETN TNRITIGTTL 
KPRTIDPADA YEVISGNLLH NLGDRLYGYK LGTMELVPSL ATEMPKISED GTTYTIPIRQ
GVTFHDGTPF NAEAMAFSFK RFIKNSGPPS SLLTNTIKSV EATGEYQLTI KLKKPFAAFT
SLLTFSGLCA VSPQAYEIGE SQFKPDTFIG TGPYKLAEYG TDTLRLDVFE NYWGEKPKNQ
GIDIQIFSSS ANLFNAFKTG SIDVAYFSLD TDQITNLEAE AIRQGWQVIS TDGKTVNYMV
LNLNLEPLNN KAVRQALASI IDRKLLNERV LQGKAEPVYS LIPKQINSYK PVFKENYGDG
NFAQAKELLK EAGYSRNSPA KIEIWYAANS TKRQLTASTL KAYVDQNLEG LMELELNSVE
AATAFNNLDK GVYQTFILDW YGDFLDADNY IQPFLECIKG SKEKGCEEGA SQFQGSFYYS
DRINKLIQQQ RQEQNPEKRL AIFVEIQELL AEDVPFIPLW LDKDYVFAQK NISGVSLEPT
QQFSFLKINK S