Gene Tery_2495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2495 
Symbol 
ID4245264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3843547 
End bp3844557 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content37% 
IMG OID638107576 
Producthypothetical protein 
Protein accessionYP_722175 
Protein GI113476114 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.939833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTAA AACTGACTAA CTCAAAACTA CCCTTAGCTC CACTTTTGCT AATTGCTCCC 
TTCTTCTTTT GGGGAACTGC AATGGTAGCA ATGAAAGGTG TTATCCCTCA AACAACACCA
TTTTTTATGG CAGCAATACG TATATTGCCA GCAGGTATAT TATTGTTACT TATAGGAATG
CTTCAGGGAA GACCTCAACC ACAAAATAAA TTAGCTTGGT TGTGGATATT ATTGTTTGCT
TTAATAGATG GTGCTTTGTT TCAAGGGTTT TTGGCGCAAG GGTTAGTCAA AACAGGTGCT
GGGTTGGGCT CAGTAATGAT TGACTCTCAA CCTCTAGCGG TAGCTATTTT ATCATTGTGG
TTATTTCAAG AGAGAATTAG ATTTTGGGGT TGGCTAGGGT TAGGTATTGG TGTTTTTGGC
ATTAGCTTAA TTGGTTTACC TGATGAATGG ATATCAAGTT TACTGCACCC GGAAACAATA
CAAATATCTC TGGGTATGGA TACTTTTTCT CAAAGTGGAG AATGGTTAAT GTTATTAGCA
TCTCTGTCTA TGGCAGTAGG AACAGTATTA GTGCGTTGGG TTTGCAAATA CAACGACCCA
GTTATGGCAA CTGGTTGGCA TTTAATTTTA GGTGGTATTC CATTGCTTGC TATTTCAGCA
GGGGTTGAGT CTCAGCAATG GGTTAATATT GATCAGTATG GTTGGATAGC TATGGGTTAT
GCCGCTGTTT TTGGGAGCGC GATCGCTTAT GGTTTATTTT TTTACTTTGC TTCTTCAGGA
AATCTTACCA GTTTGAGTGC TTTAACTTTT CTCACACCAA TTTTTGCTTT GTTATTTGGC
AATTTATTTT TAGGAGAAAT ATTGAGTCGG CTACAGTCAA TAGGAGTTGG TTTAACTTTA
GTAAGTATTT ATTTAATTAA TCAGCGGGAT GTATTAGCTG AGAGGTTAAA TTTTAGTAGT
TCAAGACAAG AAGTTTTTTC CAATATTACT ACACGCAATT TATTAAAATA G
 
Protein sequence
MQLKLTNSKL PLAPLLLIAP FFFWGTAMVA MKGVIPQTTP FFMAAIRILP AGILLLLIGM 
LQGRPQPQNK LAWLWILLFA LIDGALFQGF LAQGLVKTGA GLGSVMIDSQ PLAVAILSLW
LFQERIRFWG WLGLGIGVFG ISLIGLPDEW ISSLLHPETI QISLGMDTFS QSGEWLMLLA
SLSMAVGTVL VRWVCKYNDP VMATGWHLIL GGIPLLAISA GVESQQWVNI DQYGWIAMGY
AAVFGSAIAY GLFFYFASSG NLTSLSALTF LTPIFALLFG NLFLGEILSR LQSIGVGLTL
VSIYLINQRD VLAERLNFSS SRQEVFSNIT TRNLLK