Gene Tery_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2973 
Symbol 
ID4245089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4624227 
End bp4625204 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content34% 
IMG OID638108011 
Producthypothetical protein 
Protein accessionYP_722604 
Protein GI113476543 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.140721 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATC AAGCTAGATT ATTACCATTA AACCATTTCT CATCAATACC TATAGACTGT 
TTAGCTGCAT TAATTGGCAG TCTAATCTGT ATATCTTTTT CACCAATTTT TATCCGTTTA
AGCGAGTATA ATCTTGGTCC AAATGCTACA ATATTCAATC GTTTTTGGAT TGCAGCTATT
CCCTTTTGGT TACTAAATGG ATTATTAACT TTAAATTACC GAAAACAAAA AAATAAAACT
CAAATTGAAT TAAATAAACC AGAACTAGTT AAAAAAGGAA TGCTACTGAT AGCAGATGGG
ATCCTTTTAT CTATTGGTCT GATACTTTGG GCTTGGTCTT TAACCATGAC AAGCATTGCT
CACTCTACTA TGCTCCATAG TTTAGTGCCT ATATTTACTG TGTTAGGAGC ATGGTTAGCT
TTAAGTCAAA CCTTTGACCA TAAATTTCTA CTTGGTATGT TTGTGGCGAT ATGTGGTTCT
GCCTTGTTAG AAGTGAATGA TATTTTCTCA TTTAGAATTA GTGAACAATT ACTAGGAGAC
TTAGCAGCTT TACTCTCAGC AGTATTTTTT GGTATACACC CACTTATAGT TCAGAAACTT
CGCACTCAAT TCAACCCAGT TACAATTATG ACTTGGTCTT CAACAACGAG TGCCTTATTG
CTCTTACCTG TAGTTCTAAT TACTGAGGAT CAACTTTTTC CAAGTTCATT AACTGGTTGG
CTTGCAGTCA TTATTCAAGC TCTTTGTAGT CAAATGATAG GTATAGGGCT GTGGGCTTAT
TCTCTCAAAA AATTATCTGC TGGGTTTACT AGCTTAGTTG CATTATGTGC TCCTGCTCTT
AGTATGATCG AAGGTTGGAT AATTTTCTCG GAAAATATTA ATCTCTGGAG TTCGGTAAGC
TTCTCAATAA TTTTATGGGG AATGTATTTG GCTATATCAA GTAAATCTGT TAATACAGGA
GTTGAGTCTT CTAATTAA
 
Protein sequence
MNNQARLLPL NHFSSIPIDC LAALIGSLIC ISFSPIFIRL SEYNLGPNAT IFNRFWIAAI 
PFWLLNGLLT LNYRKQKNKT QIELNKPELV KKGMLLIADG ILLSIGLILW AWSLTMTSIA
HSTMLHSLVP IFTVLGAWLA LSQTFDHKFL LGMFVAICGS ALLEVNDIFS FRISEQLLGD
LAALLSAVFF GIHPLIVQKL RTQFNPVTIM TWSSTTSALL LLPVVLITED QLFPSSLTGW
LAVIIQALCS QMIGIGLWAY SLKKLSAGFT SLVALCAPAL SMIEGWIIFS ENINLWSSVS
FSIILWGMYL AISSKSVNTG VESSN