Gene Acid345_2293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2293 
Symbol 
ID4073287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2716574 
End bp2717695 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content63% 
IMG OID637984309 
Productpolysulphide reductase, NrfD 
Protein accessionYP_591368 
Protein GI94969320 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3301] Formate-dependent nitrite reductase, membrane component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.882646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC ACAAGCCGCT CGTCAGCATC GACAGCATTA ACCGTCGCGA GCGCCGTCTC 
GAAGAAATTC GCGCCGAAGC CGAACGGCAC CGGCAGGTCC AAGGATTCGG CGTACGGCCC
GAGGGCGCGC CGTTCCCCAT CGCGTCGCCC GAGGCCGGCT ACTACGGCAT CCCACTGCTC
AAGGAACCGC AGTGGACGTG GGAGATCCCG TTGTATTTCT TCGTCGGCGG GATCGCGGGA
GCCTCAGCCA TCATCGGAGC CGCAGCGCAC TGGAGCGGCA AAGACCTGCG CATTTCGCGT
GACTGCCGAT ATCTCGCAGC GGGTGGAGCG ATGCTCTCGA GCGCATTGCT CATCTCGGAT
CTCGGTCGTC CGGAGCGCTT CCTCAATATG CTGCGAGTGT TCAAGCCGCA GAGCCCGATG
TCCGTGGGCG CGTGGGTGCT GGCTGCGTTT GGTTCGTTCG CTGGAGCTTC GGCCTTTGCA
CAGTGGCTCG CCGACTTCAC CGAGATCCGC GGCATTCAGG TCGTCGGAGA TGCTGCCGAG
GGATTTGCCT GCCTCTTCGG CCTACCGCTT GCCACCTATA CGGGAGTGCT GCTCGGAGCC
ACCGCGATTC CAGTGTGGAA CGAGCATGTC ACCACGCTGC CAATCCACTT TGGTATGAGC
GGGCTGAACT CCGCTGTCGG AGCGTTGGAA CTGCTCGGTC ATGATCGCAG CCCCGCGCTC
CAAGCTTTAG GACTACTCGC TGCGACCGTC GAATCTGCGG AAGGCATTCG TTTGGAGACC
AATACCGATC GTGTGGCCGA GCCGGTGAAA CACGGTTCCA GCGGTTGGAT CATCCGCGCA
GGCGGCATGC TCTCCGGACC GATCCCGCTG GGACTGCGCC TCGCCTCGTT GTTCGTCAGC
CGCGAAAAAC GCCGCCGCAT GAGACGTATG GCCGCAGCGT CAAGCCTCGC CGGTTCGCTG
ATCACACGCT ACGCATGGGT GCATGCGGGA CATATTTCCG CACGCGATTG GCGGTTGCCT
CTGGAAATTG ACGCGCCCTT GGAGCAACCA CAATTGTCGC GAAGCGATGT GCCGCAATCC
AAGACCGTCG CACAGCCTCC GAAGAAAGCG GCCGGAGATT AG
 
Protein sequence
MTDHKPLVSI DSINRRERRL EEIRAEAERH RQVQGFGVRP EGAPFPIASP EAGYYGIPLL 
KEPQWTWEIP LYFFVGGIAG ASAIIGAAAH WSGKDLRISR DCRYLAAGGA MLSSALLISD
LGRPERFLNM LRVFKPQSPM SVGAWVLAAF GSFAGASAFA QWLADFTEIR GIQVVGDAAE
GFACLFGLPL ATYTGVLLGA TAIPVWNEHV TTLPIHFGMS GLNSAVGALE LLGHDRSPAL
QALGLLAATV ESAEGIRLET NTDRVAEPVK HGSSGWIIRA GGMLSGPIPL GLRLASLFVS
REKRRRMRRM AAASSLAGSL ITRYAWVHAG HISARDWRLP LEIDAPLEQP QLSRSDVPQS
KTVAQPPKKA AGD