Gene Suden_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSuden_2077 
Symbol 
ID3763088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurimonas denitrificans DSM 1251 
KingdomBacteria 
Replicon accessionNC_007575 
Strand
Start bp2165699 
End bp2166979 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content39% 
IMG OID 
Productglucose/galactose transporter 
Protein accessionYP_394586 
Protein GI78778271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCTA AAAGTTCATT CATTCCTATG TTTATCATGG GCACTCTCTT CTTTATCTTT 
GGTTTTGTAA CATGGTTAAA TGGCTCTTTA ATCCCATTTT TAAAAATAGT ATGTGAGCTC
AATGAGTTTG AAGCACTGTT TGTAACCTTT GCTTTTTATA TCTCATATAC TGTTATGGCT
CTTCCTATGG CGTATGTTTT GGAAAAAACT GGCTACAAAG ATGGTATGGC TTTAGGACTT
GGAGTGATGG CTATCGGCGC ACTTCTTTTT ATACCAGCAG CACAAAGTGC TGAGTTTATT
ATCTTCTTGA TTGCACTCTT TACATTAGGA ACAGGACTTA CAATCCTTCA AAGCGCTTCA
AATCCATATA TCGTTTATCT GGGTCCAGTA GAGAGTGCGG CTATGCGAAT AAGCATAATG
GGAATCATAA ACAAAGGTGC TGGAGTTTTG GCACCTATCG TATTTAGCGC TTTGCTCTTT
TTGGATGTTG GTGAACAAGA TGTGATGAGC GAAGCCTCAA GGGAGATACT TGCTCAAAAA
CTTATAGTTC CATATATTGT TATGGCGCTT ATATTGGTAG CGCTTATCGT TCTTATCAAG
TTCTCATCAC TAGAGAGTCT CACAATAAAA GATGATAAAT CAAATAGTGA AAAAAGCTCT
ATATTTGAAT TTCCACGCCT TATCTTAGGC GCAATTGCCC TCTTTTTTTA TGTCGGCATA
GAAGTCATTG CGGGAGATAC AATAGCTCTT TATGCGCAAA GTATTGGAGT TGAGAGTTAT
AGCACTCTTA CCTCTTTTAC CATGTTTTTT ATGGTGCTTG GATATATAGC TGGAATTGTT
TTTATACCTA GATATCTCTC ACAAAAAAAC GCACTTATAG GCTCTGCACT CTTTGGTATT
TTGTTTTTAC TCGGTGTTGT ATTTTCCTTA TCAACCTCTC ATCTCTTATC TGAGATTTTA
TGGGGATGGA GTGGTGTTAG AACTCTTCCA GATACTATAA CATTTGTTGC ACTTTTAGGT
TTTGCAAATG CTCTCGTATG GCCGAGTATC TGGCCATTAG CGCTTAATGG GCTTGGCAAA
CATACCCCAA AAGGGAGCGC ACTGCTTATT ATGTCAATAG CGGGAGGAGC GCTTCTTCCG
CTTCTTTTTG GCAAAATTGC TCAGCTGGTC TCAAGCATGC AAACAGCATA TCTCCTTGGC
ATAGTCTCTT ATGCCTTTAT ACTTTATTAC GCTGTCGCAG GGCACAAAAT TTCATCTTGG
AAAAACAGTG ATAAGAGTTA G
 
Protein sequence
MPAKSSFIPM FIMGTLFFIF GFVTWLNGSL IPFLKIVCEL NEFEALFVTF AFYISYTVMA 
LPMAYVLEKT GYKDGMALGL GVMAIGALLF IPAAQSAEFI IFLIALFTLG TGLTILQSAS
NPYIVYLGPV ESAAMRISIM GIINKGAGVL APIVFSALLF LDVGEQDVMS EASREILAQK
LIVPYIVMAL ILVALIVLIK FSSLESLTIK DDKSNSEKSS IFEFPRLILG AIALFFYVGI
EVIAGDTIAL YAQSIGVESY STLTSFTMFF MVLGYIAGIV FIPRYLSQKN ALIGSALFGI
LFLLGVVFSL STSHLLSEIL WGWSGVRTLP DTITFVALLG FANALVWPSI WPLALNGLGK
HTPKGSALLI MSIAGGALLP LLFGKIAQLV SSMQTAYLLG IVSYAFILYY AVAGHKISSW
KNSDKS