Gene Suden_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSuden_1119 
Symbol 
ID3763668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurimonas denitrificans DSM 1251 
KingdomBacteria 
Replicon accessionNC_007575 
Strand
Start bp1172301 
End bp1173500 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content39% 
IMG OID 
Producttryptophan synthase subunit beta 
Protein accessionYP_393632 
Protein GI78777317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00584813 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACATTC CATCAGCTTC AAAATTTGAT CCAAAAAATG GTCACTTCGG CATCTTTGGC 
GGTAGATATG TACCTGAGAC TCTTATGCCT GCACTTTTAA AACTAGAACA AGAGTATGAA
AGTATCCGCT TTGACAAAGA TTTTTGGAGT GAAGTGGACT ACTATCTTGT AGATTATGTA
GGTCGCCCTT CCCCGCTCTA CTATGCAAAA AATATATCTG ATGAACTTGG TGCAAAAATC
TATCTAAAAA GAGAAGATTT AAACCATACA GGGGCACATA AAGTTAATAA CGTTATTGCT
CAAGGTCTTA TGGCAAAACG TCTTGGATAT AAAAAAATCA TAGCTGAAAC TGGAGCTGGT
CAACATGGAG TAGCAACTGC TACTATCTGC GCACTTCTAG ATTTAGAGTG TGAGATATTT
ATGGGTGCAA AAGATGTAGC TCGTCAGGAA CTTAACGTTT TTCGTATGAA ACTTCTTGGT
GCAAAAGTAA ATAGTGTCGA GAGCGGAAGC AAAACTCTAA AAGATGCTAT GAATGATGCA
ATCCGTCACT GGGTAACAAA TGCAAGAGAT ACTTTTTACA TTATCGGAAC AGTTGCAGGT
CCGCATCCAT ATCCTATGAT GGTTAGAGAT TTTCAAGCTA TTATCGGTTA TGAAGCAAGA
GCACAGATAC TTAAAAAAGA GGGTCGTTTA CCAGACCATG TTATAGCATG TATAGGCGGA
GGAAGCAACG CTATTGGTAT GTTTCAACAC TTTTTAGAAG ATAAAGAGGT TGAGTGTATT
GGTATAGAAG CTGGCGGTCA TGGTATAGAG ACACTGGAGC ATGGATGCTC ACTTGAGAAA
GGCAGAGCTG GAGTACTTCA TGGGCAGATG AGCTATCTTC TTCAAGATGA AGATGGGCAG
GTTCAAGAGG CATACTCTAT CTCAGCTGGA CTTGATTATC CTGGAATTGG ACCCGAACAT
GCGTTTCATT TTGAAAATAA AAGCGTAAGT TATAATCATG CAACAGATCA AGAAGCTCTA
GATGCATTTG TTTGGCTCTC ACGCAAGGAG GGAATTATTC CCGCATTTGA GAGCGCACAT
GCAGTAGCTT ACCTTAAAAA AATGCCAAAT ATAAAAAATA AACTTATCAT TGTTAACCTT
TCAGGCAGAG GCGACAAAGA TATGATTCAA GCAAAAAATA TATTAAATTT TGATAACTAA
 
Protein sequence
MYIPSASKFD PKNGHFGIFG GRYVPETLMP ALLKLEQEYE SIRFDKDFWS EVDYYLVDYV 
GRPSPLYYAK NISDELGAKI YLKREDLNHT GAHKVNNVIA QGLMAKRLGY KKIIAETGAG
QHGVATATIC ALLDLECEIF MGAKDVARQE LNVFRMKLLG AKVNSVESGS KTLKDAMNDA
IRHWVTNARD TFYIIGTVAG PHPYPMMVRD FQAIIGYEAR AQILKKEGRL PDHVIACIGG
GSNAIGMFQH FLEDKEVECI GIEAGGHGIE TLEHGCSLEK GRAGVLHGQM SYLLQDEDGQ
VQEAYSISAG LDYPGIGPEH AFHFENKSVS YNHATDQEAL DAFVWLSRKE GIIPAFESAH
AVAYLKKMPN IKNKLIIVNL SGRGDKDMIQ AKNILNFDN