Gene Caci_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4083 
Symbol 
ID8335436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4610898 
End bp4612955 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content75% 
IMG OID644957186 
Producthypothetical protein 
Protein accessionYP_003114789 
Protein GI256393225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0392788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0298659 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCG AGCCGAAGTC CGTTCCCCCG CGCGCCGGGG GCGTGGGCGG CCGGGCTTCG 
GCCCGGCCCG CCGCCCGCGC GTTCGCCGCG CTCCTCGCCG CGATCGCGGC ACTGATGATC
GCGATCCCGG CGGCCTCTGC CGCGACCGCG CAGAAGGCGG CGGCACCGGT CGCCGCCGCC
CCGATCCGCG CCGATCCGGC ACCACAGCCG AGCCCGTCTC CGTCGGGTCC CGGCATCGTG
CTGGGGCCAG GCGCGCCCAC CCCGACCGAC ACCGTCCCGG CTCCCGGCGC ATCCGGCGGA
ACCAGCTCGA GCGGAGGCAA CAGCGACCCG AGTTTCTGGG ACATCCCCGG CCAGATCGAG
AAGGCGATCA ACGACTGGTT CGCCTCTCTG GTGACCGACG CCCTCAACCC GGTGCTGAAC
CTGCTGGGCT CCACCGTGCT GAGCACCCCG GACGTCACCA CGATGCCGCG GGTGGGCCAG
ATCTGGACCA CGATGGCGCT GCTGGCGAAC GGGTTCTACG TCCTGTTCGT CCTGGCCGGC
GGGGTCATCG TGATGACCCA CGAGACGCTG CAGACCCGGT ACGCGATGAA GGACATCGCC
CCGCGCCTGT TCGCCGGATT CCTGGTCGGC AACGCCAGCC TCGCGGTCCT CGGCCTGATC
ATCCATCTCG CCGACTCCCT CGCGGCCGGG ATCATGGGCC AGGGCCTGGA TCCCAAGGCG
GCCGGAACCG CCCTGGCCCA CATGATCACC GGGTCGATCC TGAACTCAGG CGGGATCTTC
CTGGCGATCC TCGGCCTGGT CGCCGCCGTC ATGGCCGTGG TGCTGCTCGT GACCTACATG
GTGCGGGTCG CGCTGATGGT GCTTCTCGCG GTGGCCGCGC CGGTCGCGCT GGCCTGCCAC
GCGCTGCCGC AGACCGACGG GATGGCCCGG CTGTGGTGGC GGGCGGTGAT CGGCACCCTG
GCGATCCAGC TGGGCCAGTC GCTCACGCTC GTCACCGCGC TGCGGATCTT CCTTGACCCC
GGCGGGCAGA CCGCGCTGGG GTTCCCGACC GGCGGCGGCC TGACCGACCT GCTCGTGTTC
ATCACCCTGT TCTTCATCCT GATCAAGATT CCGTTCTGGG TCGGCCGCAG CATCTTCGGC
CGCTCCCAGC TGCTGGCGAT GGCGAAAGGC ATCGCGACCT ACAAGCTCAT GGGCGCGGCC
GGTCTTCGAG GCCGGCGCCA GCGCTCCGGC GGACGGCCGT CCGCCGGTCG CGGTCCGGGC
CCGACGGGTC CCGGCGGGCG AGGCGGAGGC GGCAGGGGTC CGGGCGGGCC GCTGGTGCCC
GCAGGTCCGC GTCCCCGGCC GGGCGGCCGG GCGACGGTCA CCGTCACCCG CGTCCAGCCG
AACTCCCCAC GCGTCATCGC CGGAGAGGTC GTCGGCACCC GCGCACTCCC GGCGGTGGCC
GCCTCGCCCG GCCCGGTCGC AGCGCTGCCG ATGGGCACGC AGGACCGCCT TGCCAAGCGG
CAGGCCGAGA CCGCGAGCAA GCCCAAGCCG CGCGTGGTGC AGCCGGGGCT GTTCCCTCCG
GTGGCCAGGA CACCTCGCCC GGGTCCGCCC TCGACCCTGA CGGAGTTCGT CCCGCCCCCG
CCGCCGCGAT CCGCGCAGCG CCAGCCCGCG CTGTTCGACC GCAGCGGCCA GATCACCCCG
GCCGCATCTC CGGCCGCGCC GACCCCGCCC CCGAGCAAGC CCTCGACGAC ATCGCCAGCC
CGACGGCCCA GGCCCTCGTC CGGCCCGTCG ACCGGGGCGA CGGCCGCTGC TGCCGCAGCC
GCCGCCGCAG CGAGCAGCTC GCGCCGCACC GCACCGACAC GCAAGGCAGC CCCGGCCCGG
ACGCCGGCCG CAGCCTCCGC GAGCGGAGCG GCTGCCACGA GCAGCGCGCG GCCCAGACCA
GCGGCCAAGA CCCCCGCCGC CGCGACCCCA GCGCGCCCGG CTCCGGTCAA GGCCCCGACC
ACGATGGCCG CGCCGAAGAA CGCCTCCGGC CGAGGCGCCG CCGCACGCAA ACCCACGACA
CCAGGAGGTT CCCGTTGA
 
Protein sequence
MTTEPKSVPP RAGGVGGRAS ARPAARAFAA LLAAIAALMI AIPAASAATA QKAAAPVAAA 
PIRADPAPQP SPSPSGPGIV LGPGAPTPTD TVPAPGASGG TSSSGGNSDP SFWDIPGQIE
KAINDWFASL VTDALNPVLN LLGSTVLSTP DVTTMPRVGQ IWTTMALLAN GFYVLFVLAG
GVIVMTHETL QTRYAMKDIA PRLFAGFLVG NASLAVLGLI IHLADSLAAG IMGQGLDPKA
AGTALAHMIT GSILNSGGIF LAILGLVAAV MAVVLLVTYM VRVALMVLLA VAAPVALACH
ALPQTDGMAR LWWRAVIGTL AIQLGQSLTL VTALRIFLDP GGQTALGFPT GGGLTDLLVF
ITLFFILIKI PFWVGRSIFG RSQLLAMAKG IATYKLMGAA GLRGRRQRSG GRPSAGRGPG
PTGPGGRGGG GRGPGGPLVP AGPRPRPGGR ATVTVTRVQP NSPRVIAGEV VGTRALPAVA
ASPGPVAALP MGTQDRLAKR QAETASKPKP RVVQPGLFPP VARTPRPGPP STLTEFVPPP
PPRSAQRQPA LFDRSGQITP AASPAAPTPP PSKPSTTSPA RRPRPSSGPS TGATAAAAAA
AAAASSSRRT APTRKAAPAR TPAAASASGA AATSSARPRP AAKTPAAATP ARPAPVKAPT
TMAAPKNASG RGAAARKPTT PGGSR