Gene Caci_3691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3691 
Symbol 
ID8335044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4135359 
End bp4137014 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content71% 
IMG OID644956831 
Producthypothetical protein 
Protein accessionYP_003114434 
Protein GI256392870 
COG category 
COG ID 
TIGRFAM ID[TIGR03605] SagB-type dehydrogenase domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.392558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTCG CCCACGAGTA CGCCACCGCC GTGGCCTGGC GCGGCAGGGT CCTGATGGAA 
CCCGCCGACT TCGTCCCGAA CTGGGCGGAC AAGCCACGGC GCGCCAAGTA CTACCCCGGC
GCGCTCGGTT TTCCGCTGCC GGACACCGAG GACGAGGCCG CGGCCAGCGT CCAGAAGGGA
CTGTTCGATC CGGCCGGTTC GCAGCCCTTC ACCCTGAGCC TGCTCGGCGG CATGCTGCGC
GACTCCTACG GACTGATCGG CCGCCGGCTC GGCGTGCAGG CGAACACCGA CCTGGCCGCG
CTGCCGTCGT ACAAGGACGC GAACTGGTCG CGGGGAACCG CCTCGGGCGG CGGCCTGTAC
CCGATCGGCG TCTACTGGTT GTCCGGTCCC TCCGGGCCGC TTCTCCCCGG CGCCTACCAC
TACTCGCCCG GCCATCACGC GATGCAGCGG CTCGTCGTGG GCGACCCGAC CGGCGAGGTG
CGGGCCGCGG TTGGCGACGA GGCGCTCACC GCGGACACCG ACCAGTTCCT CGTCCTGGGC
ATCAAGTTCT GGCAGAACGC CTTCAAATAC AACAGCTTCT CCTACCACGC GGTGACGATG
GACGTCGGCA CGGTGCTGCA GACGTGGCGG ATGTGGGCGG GAGCCAGAGG CCTGCGGATC
GACCCGCTCC TGTGGTTCGA CGAGCAGCGC CTGAGCCGCC TGCTCGGGGT GTCGACCGAG
GACGAAGGGC TGTTCGCGGT GGTTCCCGTG CGGTGGGACG CGCCGTCGGC GCCCACCGCC
GAGCCGGCGA CCGAGCGGCT GACTGAGCCG CCGAACGAGC GGCCGACCGA GCCGCCGATC
CAGGTGCGGC GCACCGACCA GGAGCGTTCT CGCACCGTCC TGACCTTCGA CACCATCCGC
CGGGTTCACG CCGCCACCAT CGAGCACGCG ACGCAGCGCC CCGACCGCCT GGCCCTGGAA
GCGGCCAGGG CTCACGCGCC CGATGAGCGG CGCGAGGCTG CGACGCTGCC CGAGCCGCGT
CCGCTCCAAG CCACCGTCCG CGCGGCTCTG CACGCCAGGC GCAGCAGCTT CGGACGGTTC
TCCGCGCAGC GGACGATCGC TGCGGACCAG CTCTCCGCCG TGCTCGCAGC CGCTGCCGCC
GGTGCCGCGC TGGAATGCGA CGTGACGAAG CCAGGAGGCG CCGAGCTGGT CAAGCTCTAC
GCGTTCGTCT CCCACGTCGA CCAGATCGCC CCGGCGAGTT ACGAGTACGA CCCGCAGGAA
GGTGCGCTGC GGATGGTCAA GCCGGGCGCG CCCGGCTCGT TCCTCCAGCG CAACTACTTC
CTCGCCAACT ACAACCTGGA ACAGGCCGCG GCCGTCCTGG TCCCCTCGGT GCGCACGCAC
GCCGTGCTCG ACGCGGTCGG CGACCGCGGC ATCCGGCTGG TGAACGCGCT GGTCGGGGCG
GTGGCGCAGG CGGTGTACAC CGCGAGCGCG GCGGCCGGCA TCGCCTGCGG CGTCGCCCTC
GGCTTCGACA CCATCTCCTA CATCGAAGAA CTCGATCTCC ACCAGGCCGG CGAGATCCCC
TTGCTGACCA TGATGATCGG CGCCGAGCGG CCGCGGCCGG CGGACTTCCG CCACGATTTC
GGCCCGCTCG GCCCTGTCCC GGGGAGCGTG CGGTGA
 
Protein sequence
MGFAHEYATA VAWRGRVLME PADFVPNWAD KPRRAKYYPG ALGFPLPDTE DEAAASVQKG 
LFDPAGSQPF TLSLLGGMLR DSYGLIGRRL GVQANTDLAA LPSYKDANWS RGTASGGGLY
PIGVYWLSGP SGPLLPGAYH YSPGHHAMQR LVVGDPTGEV RAAVGDEALT ADTDQFLVLG
IKFWQNAFKY NSFSYHAVTM DVGTVLQTWR MWAGARGLRI DPLLWFDEQR LSRLLGVSTE
DEGLFAVVPV RWDAPSAPTA EPATERLTEP PNERPTEPPI QVRRTDQERS RTVLTFDTIR
RVHAATIEHA TQRPDRLALE AARAHAPDER REAATLPEPR PLQATVRAAL HARRSSFGRF
SAQRTIAADQ LSAVLAAAAA GAALECDVTK PGGAELVKLY AFVSHVDQIA PASYEYDPQE
GALRMVKPGA PGSFLQRNYF LANYNLEQAA AVLVPSVRTH AVLDAVGDRG IRLVNALVGA
VAQAVYTASA AAGIACGVAL GFDTISYIEE LDLHQAGEIP LLTMMIGAER PRPADFRHDF
GPLGPVPGSV R