Gene Caci_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3784 
Symbol 
ID8335137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4278930 
End bp4279958 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID644956924 
Productaldo/keto reductase 
Protein accessionYP_003114527 
Protein GI256392963 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00710905 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCC CCCTGCGTCC GTTCGGACGT ACCGGCGTCA AGGTCAGCGC GCTCGCGCTG 
GGCACGATGA TGTTCGGACC CCGCGGCAAC CCGGATCACG ACGACAGCAT CCGGATCGTC
CACCGGGCCC TGGACGCCGG GATCAACCTG GTCGACACCG CGGACGTGTA CAGCCAGGGC
GAGTCCGAGA CGATCGTCGG CAAGGCCCTG GCCGGCCGCC GCGACAGCGT CTTCCTGGCC
ACGAAGTTCC ACGGCCGGAT GGGGGAGGAC GCCAACCGCT TCGGCAACTC CCGCCGGTGG
ATCGTCAAGG CGGTCGAGGA ATCGCTCGCG CGTCTCCAGA CCGACCACAT CGACCTGTAC
CAGGTGCACC GTCCCGAGGA CGACACCGAC ATCGACGAAA CCCTCGGCGC GCTGTCCGAT
CTCGTCCACC AGGGCAAGAT CCGCTACATC GGCACCTCGA CCTTCGAGCC GTCCGGCATC
GTGGAGGCCC AGTGGGTGGC CGAGAAGCGC GGCCGGGAGC GGGTGGTGGC CGAACAACCC
CCGTACTCGG TCCTCGCCCG CGGCATCGAG CGCGAGGTCC TGCCGGTCGC GCAGAAATAC
GGACTCGCGG TGATCCCCTG GAGCCCCCTG GCCGGAGGCT GGTTGACCGG CAAGTTCCGC
GTCGGCGCAG CGCAGCCCGA GACCAGCCGC GGCGCCCAGC AGGGCCGCTT CGAGATCGGC
GACCCGGCCA ACGCCGACAA ACTCCAGGCG GTCGAAGCCC TGGCCCTCCT CGCCGAGGAA
GCAGGCATCT CACTGGTCCA CCTGGCGCTG GCGTTCGTGC TGGAGCACCC GGCCGTCACG
GCGCCGATCA TCGGCCCCCG CACGTTCGAG CAGCTCGAAG GACAGCTCGA CGCGGCATCC
CTGCGGTTGG CACCGGACGT CCTCGACGAG ATCGACAAGA TCGTGCCCCC GGGCGTCACC
CTGTCCGCAC GCGATGCCGG ATATAACCCG CCGTCGGTCA CCGACTCCGC CCGTCGGCGC
CGTGCCTGA
 
Protein sequence
MSIPLRPFGR TGVKVSALAL GTMMFGPRGN PDHDDSIRIV HRALDAGINL VDTADVYSQG 
ESETIVGKAL AGRRDSVFLA TKFHGRMGED ANRFGNSRRW IVKAVEESLA RLQTDHIDLY
QVHRPEDDTD IDETLGALSD LVHQGKIRYI GTSTFEPSGI VEAQWVAEKR GRERVVAEQP
PYSVLARGIE REVLPVAQKY GLAVIPWSPL AGGWLTGKFR VGAAQPETSR GAQQGRFEIG
DPANADKLQA VEALALLAEE AGISLVHLAL AFVLEHPAVT APIIGPRTFE QLEGQLDAAS
LRLAPDVLDE IDKIVPPGVT LSARDAGYNP PSVTDSARRR RA