Gene Acid345_2526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2526 
Symbol 
ID4069895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2982844 
End bp2984178 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content60% 
IMG OID637984543 
ProductNADH dehydrogenase 
Protein accessionYP_591601 
Protein GI94969553 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.383895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATA CGAGAAGCGC TCGCGTGGTA GTCATCGGCG GCGGCTTCGG TGGTCTCGAA 
ACCGTTTCCC ATCTCAAACA CCAACCGGTG CAGATCCTCC TGCTCGATCG CAAAAACCAT
CACACTTTCC AGCCACTGCT CTACCAGGTC GCGACCGCAG GGCTGTCGCC GGCGGAGATC
GCTGCACCGC TGCGTTCGAT CGTGTCGAAA CAGAAGAACA CTGAAGTCTT GCTCGCGGAA
GTCACCGGCT TCGATCTCGC GCGCAAGATC GTGCACACTG TGGACGCAGA AATCGAATAC
GACTATCTCG TCGTCGCTGC CGGAGCGCGG CATTCGTATT TTGCCCATGA CGAGTGGGAA
CCTCTCGCGC CCGGGCTCAA GACCGTCGAA GACGCTCTTG AGATTCGTCG TCGCGTGCTG
CTTGCGTTTG AAGAAGCAGA GCGCCAGGCC GCGCTGCACG GCCATCATCA GGACATCACG
TTCGCGATCG TGGGTGGCGG ACCCACCGGG GTCGAACTCG CCGGAACAAT TTCCGAAGTC
GCCCGCACCT CGCTCCCCCG TGACTTCCGC AACATCGATC CTACGCACAC GCGGGTGATT
CTCATCGAAG CCGGACCGCG AGTTCTCGCT GCCTTTCCGG AAGACCTCTC GGCAAGTGCG
GAACAACAGT TGCACAAACT CGGTGTGGAA GTCCGCACCA GCACAATGGT CACGCAGATC
CTTCCCCACG AAATCCATAT GGGCGATGTG GTGTTGCCGA CGTCGGTTAC GCTGTGGGCT
GCGGGTGTCT TTGCCTCTCC GCTCGGCCGC GCTTTAAGTG AACAAGTAGA CCGCGCCGGT
CGCGTTCCCG TGCAGCCCGA CCTCACGCTT CCCAATCATC CCGAAGTCTT CGTGATCGGC
GACCTCGCTA CCATCAAAAA CCCTGACGGC AAGCCGGTTC CCGGCGTCGC GCCCGCCGCC
ATGCAAATGG GACGCTTCGT CGCCAAAACC ATCAGCGATG ATCTCGCCCA TCGTCCGCGC
ACCAACTTCG TTTACAACGA CAAAGGTAAT CTCGCCACCA TCGGCCGCAA CGCCGCCGTC
GCCCAGTTCC CTGGGTTCAA ACTGACCGGT TACTTCGCGT GGCTGTCCTG GCTCTTCATC
CACATCTTGT TCCTCATCGG ATTCCGCAAT CGCCTGCTCG TAATGATCGA ATGGGCATGG
TCGTACCTCA CTTACAAGCG CAGCGCACGG CTCATCACGG ATGAAGTCGG CCGCCTCCGC
GAGCAACGAT CTTCGCCCGA CTCCCACAAT GCCGAAACGC CTCCTGTGGA AGTGATTGAA
AGGCGAAGCG CTTAG
 
Protein sequence
MTDTRSARVV VIGGGFGGLE TVSHLKHQPV QILLLDRKNH HTFQPLLYQV ATAGLSPAEI 
AAPLRSIVSK QKNTEVLLAE VTGFDLARKI VHTVDAEIEY DYLVVAAGAR HSYFAHDEWE
PLAPGLKTVE DALEIRRRVL LAFEEAERQA ALHGHHQDIT FAIVGGGPTG VELAGTISEV
ARTSLPRDFR NIDPTHTRVI LIEAGPRVLA AFPEDLSASA EQQLHKLGVE VRTSTMVTQI
LPHEIHMGDV VLPTSVTLWA AGVFASPLGR ALSEQVDRAG RVPVQPDLTL PNHPEVFVIG
DLATIKNPDG KPVPGVAPAA MQMGRFVAKT ISDDLAHRPR TNFVYNDKGN LATIGRNAAV
AQFPGFKLTG YFAWLSWLFI HILFLIGFRN RLLVMIEWAW SYLTYKRSAR LITDEVGRLR
EQRSSPDSHN AETPPVEVIE RRSA