Gene Caci_5629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5629 
Symbol 
ID8336989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6489767 
End bp6491047 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content69% 
IMG OID644958733 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_003116329 
Protein GI256394765 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0649561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGACT GGGACGAAGT CCTGCGCGAT GCCGAACGCA CCCCCGGAGC CGGGGGCGAC 
GGCCAAGAGC GCCTCGTCGT CAACATGGGA CCGCAGCACC CCTCGACCCA CGGCGTGCTG
CGCCTGATCC TGGAGATCGA GGGCGAGAGC GTCATCGAGG CCCGCTGCGG CATCGGCTAC
CTGCACACCG GCATCGAGAA GAACCTGGAG TACCGGAACT GGACCCAGGC GGTGACCTTC
CTGACCCGCG CCGACTACCT GATGCCGCTG TTCAACGAGA CCGTCTACTG CCGCGCGGTC
GAGGCGCTGC TCGGCGTCGA GGACGACGTG CCGCCGCGCG CGAACGTCAT CAGGGTCCTG
CTGATGGAGC TGAACCGCAT CTCCTCGCAC CTGGTGGCCC TGGCCACCGG CGGCATGGAA
CTCGGCGCGA TGACGGTGAT GACCAACGGG TTCCGCGACC GCGAGCCGAT CCTGGACGTC
CTGGAGGCGG TCACGGGGAA CCGGATGAAC CACGCCTACG TCCGTCCCGG CGGGCTCGCG
CAGGACCTCC CCGACGGCGT CGTCGAGCAG ATCAGAGCCC TGATTGTCGA GTTCCCCAAA
CGGATCCTGG ACTACGAACG CCTGCTGAGC GCCAACCCGG TGTTCGTCAG ACGCACCAAG
GGCGTCGGGT ATCTGGACCT GCCGGGCTGT ATGGCGCTCG GCGTCACCGG CCCGGTGCTG
CGCGCCGCCG GACTCGCGCA CGACCTGCGC AAGTCGGACC CGTATCTGGG CTATGAGACC
TACGACTTCG AGGTACCGAC CGACACCGGC TGCGACGCCT ACGGCCGCTA TCTGGTGCGC
CTGCACGAGA TGACCGAATC GCTGCGCATC ATCGAGCAGG CCCTGGACCG CCTGGAACCC
GGACCGGTCA TGCTCGCCGA CCCGAAGATC GCCTGGCCGG CGCGCCTGTC GCTCGGCGGC
GACGGGCTGG GCAACTCCGA GGAGTGGATC CGCCACATCA TGGCGCAGTC GATGGAGGCG
CTGATCCACC ACTTCAAGCT GGTCACCGAG GGTTTCGTGG TGCCGGCCGG GCAGGTGTAC
TCCTTCGTGG AGTCCCCGCG CGGCGAACTC GGCGCGCACG TCGTCAGCGA CGGCGGCACA
CGGCCGTTCC GCGTGCACCT GCGCGACCCC TCGTTCACGC ACCTGCAGGC GGTGTCGGCG
ATGGCCGAGG GCGGCATGCT CGCGGACGTC GTGGCGGTGG TGGCCTCGGT GGATCCGGTG
CTGGGCGGCG CGGACCGCTG A
 
Protein sequence
MYDWDEVLRD AERTPGAGGD GQERLVVNMG PQHPSTHGVL RLILEIEGES VIEARCGIGY 
LHTGIEKNLE YRNWTQAVTF LTRADYLMPL FNETVYCRAV EALLGVEDDV PPRANVIRVL
LMELNRISSH LVALATGGME LGAMTVMTNG FRDREPILDV LEAVTGNRMN HAYVRPGGLA
QDLPDGVVEQ IRALIVEFPK RILDYERLLS ANPVFVRRTK GVGYLDLPGC MALGVTGPVL
RAAGLAHDLR KSDPYLGYET YDFEVPTDTG CDAYGRYLVR LHEMTESLRI IEQALDRLEP
GPVMLADPKI AWPARLSLGG DGLGNSEEWI RHIMAQSMEA LIHHFKLVTE GFVVPAGQVY
SFVESPRGEL GAHVVSDGGT RPFRVHLRDP SFTHLQAVSA MAEGGMLADV VAVVASVDPV
LGGADR