Gene Suden_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSuden_1993 
Symbol 
ID3762843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurimonas denitrificans DSM 1251 
KingdomBacteria 
Replicon accessionNC_007575 
Strand
Start bp2074536 
End bp2075753 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content36% 
IMG OID 
ProductSodium:dicarboxylate symporter 
Protein accessionYP_394502 
Protein GI78778187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAAA AAATAAAAAA ATATTCTACT ACAAATACTC TGGTTCTCTT GGGTATAGTC 
TTAGGAGTTC TTTTTGGAGT ATTTGCACCA CAACTTGCAC TAGGGCAACA AATTATAGGA
GAGATTTTTA TAAGTTTTTT AAAAATGCTC GTTGTTCCTC TCGTTTTTTC TAGTATATAT
GTATCAATTG TTGGGTTAGG CGATTTAAAA AGCCTTAGAG ATATTGGGGC AAGAGTTATT
GCTCTTTATA TTTTAACAAC AGCGCTTGCT GTATTTGCTG CGGTAATGGC TATGAATTTA
GTGCCTCTTG GCGAGGTTGT CGCAGTTGAA GGTTTAGAGT ATGCAAAAGC TTCAGAGCTA
GCATCTTTCT CATTTAAGAG CATGATTTTG AGCTTTATTC CCACAAACAT ATTTCACTCT
CTCTCAGAGG GATCAATGAT GCCTATTATT GTTTTTGCAT CCCTTTTTGG AGTAGCTTCT
TTACATGTAG CTAAACAAAA AGAGCTTGAA ATGGTTAATT TTTTTACTGG CGTAATGGAT
GCAATGCTTA TAATAGCACA GTGGGTTATA AAACTAACTC CCATTGGAGT ATTTAGCCTT
ATCTCTTATG TGGTTGCTAA TCAAGGAGTA GATGTAATAA TTGGTCTTTG GAAGTATCTT
TTAATGGTTT TAGCTGTTCT GTTTTTTCAT GCAGTTGTTA CGCTGCCTGC TCTTCTGTTT
TTCTTTTCTC GTATAAATCC GTTTAAATAT TTAAGTGCGA TACGTGAAGC TCCTATTATG
GCATTTTCTA CAGCTTCTAG TATGGCTACT CTTCCTGTTT CCATGCGTGT TGTTGAAGAA
GTTGGCGGAG TTGATAAAAA AAATGCCTCT TTTGTTCTTC CTCTTGGTGC GACAATTTCT
ATGGATGGCA CAGCTGCATA TTTGACAATA GCAACTCTCT ATATCTCGCA CTTATCTGGA
GTTGATTTAA CGCTTTTTGA ACAGTTACTA TTAGGTGTGA GTGTTGTGGC TCTTAGTGTA
GGAGTTGCAG CCCTGCCTAG TGCCTCTTTG GTAATGCTCA TTGTTATTCT CAAACAGTTT
GGTTTGCCTT TGGAATATAT AGCGTTAATT ATTGCAGTAG ATAGAATTTT AGATATGGCA
AGAACAGCAT TGAATGTCAC TTCTGATTTG GTTGTTGCAA AAATTGTTGA TGAGTGCTTA
AAGCGTAAAA ATATATAA
 
Protein sequence
MVKKIKKYST TNTLVLLGIV LGVLFGVFAP QLALGQQIIG EIFISFLKML VVPLVFSSIY 
VSIVGLGDLK SLRDIGARVI ALYILTTALA VFAAVMAMNL VPLGEVVAVE GLEYAKASEL
ASFSFKSMIL SFIPTNIFHS LSEGSMMPII VFASLFGVAS LHVAKQKELE MVNFFTGVMD
AMLIIAQWVI KLTPIGVFSL ISYVVANQGV DVIIGLWKYL LMVLAVLFFH AVVTLPALLF
FFSRINPFKY LSAIREAPIM AFSTASSMAT LPVSMRVVEE VGGVDKKNAS FVLPLGATIS
MDGTAAYLTI ATLYISHLSG VDLTLFEQLL LGVSVVALSV GVAALPSASL VMLIVILKQF
GLPLEYIALI IAVDRILDMA RTALNVTSDL VVAKIVDECL KRKNI