Gene Suden_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSuden_2053 
Symbol 
ID3763039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurimonas denitrificans DSM 1251 
KingdomBacteria 
Replicon accessionNC_007575 
Strand
Start bp2138986 
End bp2140188 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content37% 
IMG OID 
Productaromatic hydrocarbon degradation protein 
Protein accessionYP_394562 
Protein GI78778247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000452049 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA CAATTAAACT CGCAGTAGTA GCTGCATTAG CTTTGGGTAC AACTTCTGCA 
TTCGCAACAA ATGGTGACAC TATGATAGGT GTTGGTGCAA AAGCTATCGG TATGGGTGGT
GTTGGTATTG GTGTAAGCCA TGGTGCAGAA TCAGCATTAA CAAACCCTGC GATGATTACT
AATGTAGAGG GAACTGAAAT TTCTTTCGGT GGTACTATAT TTATGCCAGA TGTAAAAACT
AATATGGGAG ATGGATCAGG TTTTCATAAT AGTGATGCTG ATCTTTCAGT AATTCCTTCA
GTTGCAATTG CCCAAAAAGT ATCTAATAAT TTTTACTGGG GTATTGGTAT GTACGGGGTA
GCTGGTATGG GAACGGATTA TCGTGATGCT ACTGGCGGTA TGGCTAATAT GAATATGGTA
ACAAATTTAC AGTTAATGCA ATTTGTGGTT CCATTAGCAT ATAAAGCAAA CGGATTTAGT
CTTGGTATAG CTCCAATACT TCAATACGGT TCATTAGACA TTAATTATGA TATGAGTGCA
ATGTATATGG CTGCGCCTGG TACGAATATG TCAACTACAA GAGGCGTTGC ACAAGATTTT
GGTCTTGGTT ATAATGTTGG TGTAGCATAC GAAACAGCAG GCTTAACAGT TGGTGCTTCG
TATAAATCAA AAATTGATAT GGAATACAAA GGACAAATTA GTAGAGCAAT GAAAGATTTC
ACTGGATTTT TAGGTTCTGA TAGTTTAGAA CAACCAGCAG AAATTGGAGT TGGTGCATCA
TATAAGGTAA GCGGTAACAC TTTTGCAATA GATTATAAAC AAATTAAATG GTCTGATGCA
AAAGGGTATA AAGATTTTGC ATGGGATAAT CAAAATGTAA TTATGGTTGG TTATCAATAT
GCTCAAGATA ATTGGGCACT ACGTGCAGGT TATAATCATG CAAAAAGCCC AATAAAAGAT
CAAGGAATGG CAGGTTCATT ATCAAATGTA TTTAACCTTC TTGGTTTCCC AGCAATAGTT
GAGAGTCATT ATACTGTTGG TGCAAGCTAT GGTTTCAGCA AAATGACATC ACTTGATTTA
GCGTATGTTT ACTCACCAGA GGCAAGTGAG AATTATGCAT ATAGCTTAGG TGGTCCAGCA
ACTACTATTG AAACAAAACA TAGTCAATCA GCTGTTACTG CACAGCTTGA CTTTAAATTC
TAA
 
Protein sequence
MKRTIKLAVV AALALGTTSA FATNGDTMIG VGAKAIGMGG VGIGVSHGAE SALTNPAMIT 
NVEGTEISFG GTIFMPDVKT NMGDGSGFHN SDADLSVIPS VAIAQKVSNN FYWGIGMYGV
AGMGTDYRDA TGGMANMNMV TNLQLMQFVV PLAYKANGFS LGIAPILQYG SLDINYDMSA
MYMAAPGTNM STTRGVAQDF GLGYNVGVAY ETAGLTVGAS YKSKIDMEYK GQISRAMKDF
TGFLGSDSLE QPAEIGVGAS YKVSGNTFAI DYKQIKWSDA KGYKDFAWDN QNVIMVGYQY
AQDNWALRAG YNHAKSPIKD QGMAGSLSNV FNLLGFPAIV ESHYTVGASY GFSKMTSLDL
AYVYSPEASE NYAYSLGGPA TTIETKHSQS AVTAQLDFKF