Gene Acid345_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1289 
Symbol 
ID4071361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1566613 
End bp1567758 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content61% 
IMG OID637983298 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_590365 
Protein GI94968317 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.950756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAG CAATCGAAGT AACGCCCAGC GATCTCGAGA ACAAAACGTT CCTGGATTCG 
AACGAGCTGG TCCTCAATAT GGGGCCGCAG CATCCGTCGA CGCACGGCGT GTTACGCGTG
ATTTTGAAGC TCGATGGTGA GCGGGTGCTG GGTACGGAGT GCGTGATCGG CTACCTGCAT
CGCGGGGTAG AGAAAATCGC TGAGAACCGC ACGTATGTGC AGTTCAACCC GTACGTGGAC
CGCATGGACT ACGTAGCGGC GGTAAGCAAC GGCCTGGGCT ATTGCGAGGC GGTCGAGAAG
CTGATCGATG TGCAGGCGCC GCCGCGGGCA CAGTTCTTGC GCGTGATCCT GACGGAACTG
AACCGGCTGG CGAGCCACAT GGTTTGGCTG GGCACGCACG CGCTCGACAT AGGTGCGCTG
ACGCCGCTGT TCTACACCTT CCGCGATCGC GAAGAAGTGC TGAAGATTTT CGAGAAGTAC
TGCGGCGCGC GGCTTACAAC GCACGCGTTC CGCATCGGCG GCTGCATTTA CGAGGCGTAC
GAAGGGTTCG AGAAGGACGT TCTCGATTAC TGCGACAAGC TTGGGCCGAG GATTGACGAG
TACGAAGGGC TGCTGACGGG CAATCGTATC TGGCTGCAAC GGACGAAGGG TATCGGGATT
TTGAATGCCG CAGATTGCAA GGCCTTGGGC GTGACTGGCC CGGTACTGCG CGCGGCTGGC
GTGAAGTGGG ATTTGCGGAA AGCACAGCCC TACGCGGCTT ACGACCAGGT GGACTTCGAC
GTGCCGACCG GCGAGAACGG CGACACCTTC GACCGGTACA TGGTTCGCAT GCAGGAGATG
CGACAGTCGA TCCGGATCCT GCGACAGGCG GTGGAGAAGC TGCCCGCGGG GCCGACGATG
GCGAAGGTTC CGAAAGTGCT CAAGCCGCCC GTCGGAAGGA TTTATCACTC GATCGAAGCG
CCCAAGGGCG AGCTTGGATA TTTCATTGTG AGCGACGGCT CGTTGCAGCC TTACCGCGTG
CGGGTGCGGC CTCCGAGCTT TGTGAACCTG CAGGCGCTGG ACAAGATGGT GCGCGGCGCG
CTGATCGCCG ATGTGGTGGC CGTGATCGGC ACACTCGACA TCGTGCTGGG TGAGGTGGAC
CGGTGA
 
Protein sequence
MSTAIEVTPS DLENKTFLDS NELVLNMGPQ HPSTHGVLRV ILKLDGERVL GTECVIGYLH 
RGVEKIAENR TYVQFNPYVD RMDYVAAVSN GLGYCEAVEK LIDVQAPPRA QFLRVILTEL
NRLASHMVWL GTHALDIGAL TPLFYTFRDR EEVLKIFEKY CGARLTTHAF RIGGCIYEAY
EGFEKDVLDY CDKLGPRIDE YEGLLTGNRI WLQRTKGIGI LNAADCKALG VTGPVLRAAG
VKWDLRKAQP YAAYDQVDFD VPTGENGDTF DRYMVRMQEM RQSIRILRQA VEKLPAGPTM
AKVPKVLKPP VGRIYHSIEA PKGELGYFIV SDGSLQPYRV RVRPPSFVNL QALDKMVRGA
LIADVVAVIG TLDIVLGEVD R