Gene Suden_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSuden_2059 
Symbol 
ID3762665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfurimonas denitrificans DSM 1251 
KingdomBacteria 
Replicon accessionNC_007575 
Strand
Start bp2149905 
End bp2151053 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content39% 
IMG OID 
Productdiheme cytochrome c SoxD 
Protein accessionYP_394568 
Protein GI78778253 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000173988 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAT TAAATAATAA ATTAATTATT TTAGGGGTTT CTGTGGTTGC AAGCAGTGCT 
CTGTTTTTCA CAGGTTGTTT AGGTTCAAAT GCTTCTGCTG GAAACTCATC AGCAAAAGTA
AGCTCAGCAG CAAATGGTAT GTATAATCCA ACAAAAGACG CTCTTGATGG TGGTGTTACA
TACAAAAGAG AAAATGGTAT GTATGCTGCG TATGCTGTTA ATGACCAAGC TACAACTGGT
GTAAACTTTG GTAGAACACC AACTCCAAAT GAGCTAAAAG CATGGGATAC AGATATTATG
CCAGATGGTA CGGGCTTACC AGTTGGTAGC GGTACTGTTG ATGATGGTGA AGCACTTTAT
GATAAAGATT GTGCTGTTTG TCATGGTGAG TTTGGTGCAG GCGGTAAAGG TTACCCAACT
CTAACGGGTG GTTCTTTAAA ATCATTATCA AACCAAAGAA CTTGCCCTGG CAAAGATGCT
CCAAATAGAA CAATTGGTTC ATATTGGCCA CAAGCTAGTA CGTTGATTTG GTATATTCGT
GATGCAATGC CATATGCAAA CCCAAAAAGT TATACACCAG ATCAGATGTA TGCTATGACG
GCTTACTTGC TAAAAGAAAA TGGTGTTAAA ATAGATGGTG AAGATATTGA AGAGTTAAAT
CAAGATAACT TCAAAAAGAT AGTTATGCCA AATCGTGATG GATTTTATCC AAATATTGAT
GGACCAAATG GTGTAGAAAA TGTTAAAGCA TTCTACAAAG ATCCTAAGAA CTTTGGTGCA
GTTGGAGTAC GTTGTATGAC TAACTGTGGA AAAGAGAGTG TAGCAACAAT AGGAAACGAG
ATAACTGCGG TAGTACCTGC TTACTCTACT CTAAGAGATC TTCCACCAGA AAGCGCAAGT
GGACCAGTGT CAGAGGCTCA AAAGATATAT GAAAAATCAT GTGCAGTTTG CCACAAAACT
GACACTATGG GTGCACCTGC GCTTGGAGAC AAGAATGCTT GGGCAACCGT ATTAGAGCAA
GGTATAAACA TGGTAAATAA CAATGCAATC AATGGTATTG GCGGTATGCC TCCAAAGGGT
GGCGCTATGG ATTTAAGTGA CGACCAAGTC AAAGATGTTG TTAAATTTAT GGTAGAATCT
AGTAAGTAG
 
Protein sequence
MIKLNNKLII LGVSVVASSA LFFTGCLGSN ASAGNSSAKV SSAANGMYNP TKDALDGGVT 
YKRENGMYAA YAVNDQATTG VNFGRTPTPN ELKAWDTDIM PDGTGLPVGS GTVDDGEALY
DKDCAVCHGE FGAGGKGYPT LTGGSLKSLS NQRTCPGKDA PNRTIGSYWP QASTLIWYIR
DAMPYANPKS YTPDQMYAMT AYLLKENGVK IDGEDIEELN QDNFKKIVMP NRDGFYPNID
GPNGVENVKA FYKDPKNFGA VGVRCMTNCG KESVATIGNE ITAVVPAYST LRDLPPESAS
GPVSEAQKIY EKSCAVCHKT DTMGAPALGD KNAWATVLEQ GINMVNNNAI NGIGGMPPKG
GAMDLSDDQV KDVVKFMVES SK