Gene Caci_8501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_8501 
Symbol 
ID8339881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp9860484 
End bp9861560 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content72% 
IMG OID644961588 
Productoxidoreductase domain protein 
Protein accessionYP_003119165 
Protein GI256397601 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.17095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGGG GATCGCCGGC TGCCGAGCAG CCGGGAATCG CTGTGGTCGG TGCCGGTTAT 
TGGGGACCGA ACCTCGTGCG GAACCTGATG GGCTCGCCGG ACTGGGACCT GCGCTGGCTG
GTCGATCTGG ACACCGAGCG GGCCCGGAAG GTCGCCGCGC CGTACGCCAG TGTCCAGGTC
ACCAGCGCGC TGGATGAGGC GCTGGCCGAC CCGCGGGTCA AGGCGGTTGC GATCGCGACG
CCGGCGCGCA CGCACCGGGA CGTCGCCATG GCCGCGCTGC GCGCGGGGCG GCACGTCCTG
GTCGAAAAGC CGCTGGCCGC CACGGAAGCC GAGGGCGCCG AACTGGTCGC CGAGGCTGCC
AAGCGCGGCC TGATCCTGAT GTGCGACCAC ACCTACTGCT ACACCCCGGC CGCCCTGGCG
ATCAGGGAGC TGATCCACTC CGGCGAGCTC GGCGAGGTCC ACTTCGTCGA CTCGGTCCGG
ATCAACCTGG GACTGATCCA GCCGGACGTG GACGTGTTGT GGGACCTCGC CCCGCACGAC
CTGTCGATCC TGGACTTCAT CCTCCCGGAC ACCGTGAAGC CGGTCGCGGT GGCCGCGACC
GGCGCCGACC CGCTCGGCGC CGGACGGACC TGCGTCGCCT ACCTGACGCT GGCGCTGTCC
TCCGGCGCCA TCGCGCACGG CCACGTGAAC TGGCTGTCCC CGACCAAGGT GCGCACCATC
ACCGTCGGCG GCTCCAAGCG CACGCTGGTC TGGGACGACG TGAACCCCGC GCAGCGGGTC
AGCGTCTTCG ACCGCGGCGT GGACCTGGCC CGGCCGGAGG AACTCGGCGC GGACCAGCGG
CGCGCGGCGC TGGTGTCCTA CCGGACCGGC GACATGGTCG CTCCGGCGCT GAACGAGCGC
GAGGCATTGG CGGCTGCGGT CGAGGAGTTC GCCCGCGCGG TACGCACCGG AACGCCGGCC
GCGACCGACG GCCGCGCCGG TCTCCGAGTC CTGCGAATCC TCGAGGCCGC CTCGCGCAGC
CTCGCCGAGA ACGGAGCCCT CGTGGCTGTG AACGACGACG CCTTGGAGGG CGAATGA
 
Protein sequence
MSGGSPAAEQ PGIAVVGAGY WGPNLVRNLM GSPDWDLRWL VDLDTERARK VAAPYASVQV 
TSALDEALAD PRVKAVAIAT PARTHRDVAM AALRAGRHVL VEKPLAATEA EGAELVAEAA
KRGLILMCDH TYCYTPAALA IRELIHSGEL GEVHFVDSVR INLGLIQPDV DVLWDLAPHD
LSILDFILPD TVKPVAVAAT GADPLGAGRT CVAYLTLALS SGAIAHGHVN WLSPTKVRTI
TVGGSKRTLV WDDVNPAQRV SVFDRGVDLA RPEELGADQR RAALVSYRTG DMVAPALNER
EALAAAVEEF ARAVRTGTPA ATDGRAGLRV LRILEAASRS LAENGALVAV NDDALEGE