Gene Caci_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3858 
Symbol 
ID8335211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4367897 
End bp4369939 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content62% 
IMG OID644956990 
Producthypothetical protein 
Protein accessionYP_003114593 
Protein GI256393029 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0611595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0467338 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTGA CCACGTCGCT GGCTGGCCGA GTCCGGAACA CGAGCCTGCC GAAGAGCCAT 
GCGCTCTTGC CGCTCCTTGA GGCTGTCGTT AACGGCATTC AGGCGATCGA CGCCCGATTC
GGCGACGACG TCGAGCGTGG TCGTCTGAGC GTCAGGATCC AGCGCAGCCA GCAGGAGGAA
CTCGACTTTG GTCCCGCTGG CCCTGGGCGT GTGGCGCTGA AGCCAATAGT CGCCTTCAGC
GTCGAGGACA ACGGGGTGGG CTTCACCTCG GCGAACATGA CGTCATTCGA GACGTTGGAC
AGCGACCACA AGGCCGCCAT CGGCTGCCGT GGTGTGGGAC GCCTGCTTTG GCTCAAGGCC
TTCGACAGGG TCTCGGTCTG CAGCGCCTAT GAGGACGAAG CCGGTGGCCT CCACGGACGG
CAGTTCCGGT TCTCCGTAGA GGGGGAGGTG GAGCTGGACG GGGAAGCGGA CGGCCTAAGC
AATGTCGGCA CGATCGTGAA CCTCGAAGGG TTCAAGAAGC CGTTCCAGCA GAACGCGGTG
AAGTCCGTCG ACGCCATCGC TCGCGAGACG TTCGAGCACT GTATCTGGTA CTTCCTCCGC
CCAGGCGGCG CGCCTGACAT TACGGTGACC GACGACGACG AGACGGTCTC GCTCAAGGAT
CTGATGGACG AATTTGTGTT CTCTACGATG TCGATAACTT CTATCGACGT CAAAGGTGAG
AAGTTCGATA TGATTAACCT TCGCCTCAAG TCCTCGACAC GCAGTCTCAC GCCACGGCTG
TACTGGTGCG CCGCGAGCCG CGTGGTGATG GAGGAGAACC TCACGAGCAA GGTGCCAGGG
CTTTACGGCC GACTCAAGGA CGAAGCATCC TCCCCATTCA CTTACGTCTG CTACCTATCT
TCCGGCTCTC TCGACAACCA CGTCCGGGCC GACCGCACAG GCTTCGACAT TGCCGAGCGT
GTGCCGGGCG CGACGCTGTT CGAAGACGTG TCGCTGGAGG ACATCCGCGA GGGCGTGCTC
AGGGAAGTCG AGAGTATCCT CGCCACCCCG CTCAGTGCAG CGCGCGAGGA AGGCAAGGTC
CGCGTCAACG AGTTCGTGAG CAACCGTGCG CCGAGGTACC GGCCAGTTCT GTCGCGGATC
GAGTCGCTCG GCGTGTCCGT GGATCCGTCC ATCAAGGACC ACGACCTTGA GTTGTTGCTG
CACAGCAGCC TACAGAAGCT CGAAGCCGCC GCGATTGCCG AGGGTCAGGC CGTCTTCGAT
GAAGCCGGCT CCTCTCGGTC GGATGACTAC GCCGAACGCC TCGCTCGGTA TCTGGACACG
GTGAAGGACA TCAACCAGTC CGACCTGGCC GCGTACGTCT CGCGCCGACG AGTGATCCTC
GACGTGCTCG CCAGACTGAT CAGGTCCGAC GACCACGGCA GGTACAGCAG GGAAGACGCC
ATCCACTCGC TGCTCATCCC GATGCGGGCC GACTCGAACG GGATTGGCAC CGACGCCTCG
AACCTGTGGA TCATCGACGA GGGGCTTGCG TTCCACGACT ACCTTGCCTC CGACAAGACG
CTCAAGAGCA TGCCGATTAC AGGATCCGAG TCCACGATGG AGCCCGATGT GCTCGCAACT
CGGCTCGTCG GCTCCCCGGT GCTGGCCTCA GAGGGCGAGT CGCTCCCGCT GCCGTCCATC
GTCGTAATCG AGATCAAGCG GCCGATGCGT AACGACGCGT CGGAGGACAA AGACCCGATC
CAGCAATGTC TGGAATATGT GAACCGTGTG CGCGCTGGTG GCGTGAAGAC CGCATCAGGG
CGGCAGATCC CCGAGACGCA TGAGGCGCCC GCTTTCTGCT ACGTCATCGC CGATCTCACG
CCGACGATGG TGCAGCGGTG TAAATATGCG AGCCTGCGTC CCACCCACGA CGGACTCGGT
TACTTCGGTT ACAACGAGCC GTACAAGGCA TACATCGAAG TGGTGAGCTT CGACCGTCTT
GTCAACGCGG CCACTGAGCG GAACCGAGCG TTCTTCGACA AATTGGGACT TCCGTCCAGT
TGA
 
Protein sequence
MALTTSLAGR VRNTSLPKSH ALLPLLEAVV NGIQAIDARF GDDVERGRLS VRIQRSQQEE 
LDFGPAGPGR VALKPIVAFS VEDNGVGFTS ANMTSFETLD SDHKAAIGCR GVGRLLWLKA
FDRVSVCSAY EDEAGGLHGR QFRFSVEGEV ELDGEADGLS NVGTIVNLEG FKKPFQQNAV
KSVDAIARET FEHCIWYFLR PGGAPDITVT DDDETVSLKD LMDEFVFSTM SITSIDVKGE
KFDMINLRLK SSTRSLTPRL YWCAASRVVM EENLTSKVPG LYGRLKDEAS SPFTYVCYLS
SGSLDNHVRA DRTGFDIAER VPGATLFEDV SLEDIREGVL REVESILATP LSAAREEGKV
RVNEFVSNRA PRYRPVLSRI ESLGVSVDPS IKDHDLELLL HSSLQKLEAA AIAEGQAVFD
EAGSSRSDDY AERLARYLDT VKDINQSDLA AYVSRRRVIL DVLARLIRSD DHGRYSREDA
IHSLLIPMRA DSNGIGTDAS NLWIIDEGLA FHDYLASDKT LKSMPITGSE STMEPDVLAT
RLVGSPVLAS EGESLPLPSI VVIEIKRPMR NDASEDKDPI QQCLEYVNRV RAGGVKTASG
RQIPETHEAP AFCYVIADLT PTMVQRCKYA SLRPTHDGLG YFGYNEPYKA YIEVVSFDRL
VNAATERNRA FFDKLGLPSS