Gene Caci_6804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6804 
Symbol 
ID8338168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7856677 
End bp7857846 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content73% 
IMG OID644959893 
ProductPeptidoglycan-binding domain 1 protein 
Protein accessionYP_003117486 
Protein GI256395922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.826004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGGC CGTTCGTGAA GGCGCCGTCC GGCGGTTCGG GGGACGGGTC GGGCGACAAT 
CGGCGCGGAC GTCAGGCTTC TGGTGGCGGC CAGGGTCGCC AGCAGCCCGC CGGCCGCGGA
CGGTCGAACG GTTCCGGGTC CGGTTCCGGT CCCGGCGGCC AGGGCGGTCC GGGCGTCCCC
GGCCCGCGAC CGCCGCGCGA CAACGACCCC AGACAGGTCC CGGGCCTGGT CTCGAACCCG
ATGCCGCCGC CGCGCGCGGT GGCCCGGCAG TCCCCGCCGC GCCAGCCGGC TCCGGATATG
AGCGACCTGA CCACGACGAT GCCGATCCTG GCCGTGCCGG CTGACGACGG CTATGACGGC
TATGAAGGGC ACGACGGGCA TGCCGGGTAT GCCGCCGAAG ACGGGTATGA CGGATACGAC
GACCGCGACG GCTACCAGGA CTACGAGTAC GAGAACGCCC CGCCCGGCGA GCACGGGCAC
TACCACGACG ACGCCGACCT CGGCCGGCAC CGCGGCAACC GCCCGCCGCG CGCGCTGAAG
ATCGGCGCGA TCCTGGCCGG CGTCGCGGTG GTCAGCGTCG CGGCGTACAG CGTGCTCGGC
GGCGGGTCCA AGGCGCCCTC GGCGGGCCCG GTGGCGGCCG GGGCGAGCAC CGCCAGCGGC
GCGGCCGACG CCCCCGGCAC GCCCTCGGAC AGCAGCACGG CCGGCGCGCC GGCGCCGACC
GGCACGCCGT CGAGCTCCAA GTCGGCGTCG TCCACCTCCT CGACCAAGCC CTCGCCGACG
CACACCACGA AGTCCTCGCC GTCGAAGCCC ACGACGTCCT CGCACTCCTC CGCGCCGAGC
AGCAGCGCGC CGATCGCACC GCCGAGCAGC GTGCCGTCGA CGACCGCGAC CTCGGTGGCG
GCACCGCCGC CGACTTCCGC GTCGCCGACG TTCACCTCGC TCAAACTCGG TTCCAGCGGC
GCAGCAGTCA CGCAACTTCA ACAGAACCTC AGAAGATGGC AAAGATCTTC CTTCGGATGG
TCAACCATCC AAGTGAGCGG CAACTACGAC TCCGCGACTC AAGACGCAGT TCAGAGCTTC
CAGGACAACA ACCCTGGTAC GAGTCCGCCC GACCCCGCTG GCGTCTACGG CCCGGCGACC
GACCAGGCGT TGCGCAAGGC CGTGGGCTGA
 
Protein sequence
MVRPFVKAPS GGSGDGSGDN RRGRQASGGG QGRQQPAGRG RSNGSGSGSG PGGQGGPGVP 
GPRPPRDNDP RQVPGLVSNP MPPPRAVARQ SPPRQPAPDM SDLTTTMPIL AVPADDGYDG
YEGHDGHAGY AAEDGYDGYD DRDGYQDYEY ENAPPGEHGH YHDDADLGRH RGNRPPRALK
IGAILAGVAV VSVAAYSVLG GGSKAPSAGP VAAGASTASG AADAPGTPSD SSTAGAPAPT
GTPSSSKSAS STSSTKPSPT HTTKSSPSKP TTSSHSSAPS SSAPIAPPSS VPSTTATSVA
APPPTSASPT FTSLKLGSSG AAVTQLQQNL RRWQRSSFGW STIQVSGNYD SATQDAVQSF
QDNNPGTSPP DPAGVYGPAT DQALRKAVG