Gene Acid345_4187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4187 
Symbol 
ID4072146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4953425 
End bp4954588 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content61% 
IMG OID637986218 
ProductNAD(P)(+) transhydrogenase (AB-specific) 
Protein accessionYP_593261 
Protein GI94971213 
COG category[C] Energy production and conversion 
COG ID[COG3288] NAD/NADP transhydrogenase alpha subunit 
TIGRFAM ID[TIGR00561] NAD(P) transhydrogenase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.37436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.248235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGTTG GTGTTCCCCG AGAGTCCTAT CCGGGGGAGC GGCGAGTTGC GCTGACCCCG 
GCAGTTGTTC CGAACCTGGC GAAAGCGGGC TTAGAAGTCG TTATCCAAGC GGGTGCCGGC
GCTGATGCCG GTTATCCCGA TGCCCTGTAT GTCGAAAAGG GTGCGAAGGT CCTGCCTGAC
CGCGCTTCCG TCTTTTCTTC CGCCGACACA ATTCTTCAAG TTCTTGGCGA CGGCGCCAAT
GACATTACCG GCGCGCAGGA CGTGCCGTTC TACCGCCGAG ACCAGGTGCT GGTCGGATTT
CTGCGTCCAC TTGGCACGAA AGAGGTCGTC CAGCAGATTG CCGACGCCGG CGTGACTTCA
TTCGCAGTCG AGTTGATGCC ACGCATTACC CGCGCGCAGA GCATGGACGC ACTCTCATCC
ATGGCAAACA TCGCCGGATA CAAGGCGGTG CTCATGGCCG CCGACCTCGT TCCGCGTGTT
TTCCCAATGA TGACCACTGC AGCAGGAACG ATTACGCCAG CTCGTGTGTT GATTATCGGC
GTAGGCGTTG CGGGGTTACA GGCGATTGCA ACCGCCCGTC GTCTTGGTGC AGTGGTGAGC
GCTTACGATG TCCGGCCCGC AGTGAAGGAG CAGGTGCAGT CGCTCGGCGC AAAGTTCGTC
GAACTGCCTC TTGAAACTAG CGACGCGCAG GATGAGCGCG GCTACGCGAA GGCGCAGGAC
GAGGCCTTCT ATCAAAAGCA GCGCGAACTG CTAGGTAAAG TTGTCGCCGA GAGCGACGTG
GTGATTACCA CCGCCGTGGT GCCGGGCAAG AAAGCGCCTT TGCTCGTTCC TGCCGGGATG
GTTCGTGGCA TGCAGCCGGG ATCCGTCGTC GTGGATCTCG CGGCAGAACG CGGTGGCAAC
TGCGAACTGA CGAAGCCTGC ACAAAATGTG GTCGAGAATG GCGTGACCAT CGTCGGGCAA
GTCAATGTCG CCAGCGGCGT GCCGTTCCAC GCGAGCCAGA TGTACGCCAA GAACCTGCTG
ACCTTCCTGC AATCGCAGAC CAAGGAAGGG AAGTTCCTCT ACGACATGAG CGACCAGGTC
ACCACTGACA CGCTGCTCAC GCGCGGAGGA CAGATCGTGA ACAAGCGCGT GCGCGAACAC
TTCGGTCTGC CGCCGCTCGC ATGA
 
Protein sequence
MIVGVPRESY PGERRVALTP AVVPNLAKAG LEVVIQAGAG ADAGYPDALY VEKGAKVLPD 
RASVFSSADT ILQVLGDGAN DITGAQDVPF YRRDQVLVGF LRPLGTKEVV QQIADAGVTS
FAVELMPRIT RAQSMDALSS MANIAGYKAV LMAADLVPRV FPMMTTAAGT ITPARVLIIG
VGVAGLQAIA TARRLGAVVS AYDVRPAVKE QVQSLGAKFV ELPLETSDAQ DERGYAKAQD
EAFYQKQREL LGKVVAESDV VITTAVVPGK KAPLLVPAGM VRGMQPGSVV VDLAAERGGN
CELTKPAQNV VENGVTIVGQ VNVASGVPFH ASQMYAKNLL TFLQSQTKEG KFLYDMSDQV
TTDTLLTRGG QIVNKRVREH FGLPPLA