Gene Caci_3741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3741 
Symbol 
ID8335094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4220144 
End bp4223467 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content67% 
IMG OID644956881 
Productalpha-L-rhamnosidase 
Protein accessionYP_003114484 
Protein GI256392920 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00158847 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATTC GGCGCCGCTT TGCCACCGCG CTCGCGCTGA TCCTCGCCTT TGCCGCCGTG 
ACGCCGGTCG CCTCCGCGGC GCCGTCGCCG TCGCTGTCGC TGTCGGCGTC CTCGGCCGGA
ACCGAAGTGT CGGTCAGTGC TCTGCAGACC GATGCCACGA CGAACCCTCT TGGGATCGAC
GACCCGCACC CGAGTCTGAG CTGGAAGCTG GCGTCACAGA TCAACGGCGA GTATCAGAGT
GCTTATCGGA TCGTGGTCGC CGGCAGCCAG AGCGATGCGA GCGCCGGGGT GGGGAGCGTA
TGGGACAGCG GTGAGGTGGC TTCGGGCAAC TCAGTGGGCA TCGCGTATGG CGGTCCGGCG
CTGGCGAGTG AGAAGACCTA CTACTGGGCT GTGCAGATCT GGGACGAGCA CGGAACGGCT
TCGGGCTGGA GCGCTCCGGC TCAGTGGGAG ATGGGTCTGC TCAATCCCGG CGATTGGCAG
GGCGCGCAGT GGATCAGCCC GACGTCCGCG AGCGCCGCGT CGCCGTTGCT GCGTAAGGAC
TTCACGCTGG CCAAGCCGGT GGCGAGGGCT CGGGCGTATG TCTTCGGGCT CGGCTTCTAC
GAGCTGCATC TCAACGGAGG CAAGGTCGGC GATCGGGTTC TGGAGCCGGC GAGTACCCCT
TACGCGCAGC GTGATCTGTA CGCCGCGTAT GACGTCACCG GCACCGTCAA GCAGGGCGGC
AATGCGGTCG GGCTGTGGCT CGGCAACGGT TACGACGCGA ACTTCAGCCA GTACGGCTTC
CGGTGGCTGG GACCGAAGCA GGCGATCGTG CTCATCGACG TCACCTTCAG CGATGGCACA
CATCAGGCCG TCACCAGTGA CGGCAGTTGG ACGTGGTCGC CGAGTCCGAT CACCGCTGAC
GGCATCTACA ACGGCGAGTC CTACGACGCC CGGATGGTGC AGCCTGGTTG GGACGCGGCC
GGGTTCAGCG CGTCGTCGTT GCAGCCGGTG CGGACCGTCG CCGCGCCGGG CGGGTCGCTG
GTGGCCGACG CGATGCCGCC GATCCGGGTG ACGCAGACGC TGAACGCGGT GAAGCTGACG
TCGCCGTCGC CGGGTGTGTA CGTGTACGAC TTCGGTCAGA ACGTCTCGGG CTGGGAAGTG
CTGCGTACGC AAGGCGCGGC GGGCTCGACG GTGAAGATGC AGACCGCTGA GGACCTGCTC
GCGAACGGAA CGATCGATAC CACGACCAAC CGAAACGCGC TGTCCACGGA CTCTTATACG
CTCGCGGGTA CGGGATCAAC GGAGACGTAC GAACCGCGCT TTACGTATCA CGGCTTCCGG
TATGTGGAGG TCACCGGGGA TCCGCAGACT CCGGTCGTGA GCAGCCTCCA GGCGCGTGTT
GTTCATGCGG ACCTGGCGTC GACCAGTACG TTCAGCTCAT CTGACGCGAC GCTGAATCAG
ATATGGCAGA ACAACCAGCG GACCATGCTG AACAACCAGA TGAGCACGCC TACGGACAAT
CCGGTGCGGG ATGAACGTAC GCCTCCGGGG ATGGACGTGC AGGCGTACCA CGATGCGTCG
ACTGCCGAGT TCGGGATGGA CACCTACTAC GCCAACTATC TGCGGGACAT GCCGCCGGGG
ACCGCGCTGC CCAGCGACGG CGGGAACGCT CAGCAGCCTG ATATGGGTGG CGACCAGGTG
ACGCTGGCCT GGACGCTGTA TCAGCAGTAC GGCGATATCG CCACGCTCGC TGCGATGTAT
CCGCTGATGA AGAAGTTCGT CGATACGAAC GCCACCAATG TGCCTGGCCA CATCTGGTCC
ACCGGCTTTG GTGACTGGTG TCCGCCGGAC CACAGCTCGA ATGCCAACGG CGGTATGGGG
AATCCGAGCG CCGGGGCGTG CACCAGCGAG GTGCCGATCG TCAATACGGC GCTGTCGTAT
CTGCAGGCTT CGGACGTCGC CAAGGCTGCG ACGGCGCTGG GGCAGGCTGG GGACGCGGCG
CACTACACGC AGCTGGCCGG CGATATCGCG AACGCTTTCA ACGCGGCGTT TCTGAACTCC
GACCACGCGA GCTATGGGGA TGGACGCGAA ACGACGAGCA TTTTGCCGCT CGCGTTCGGG
ATGGTGCCGG CCGCCGATGT GCCCGCGGTC GGGGCGCGGC TCGTGGACAC GGTCGTGCGT
GGCAACGGCG GACACCTCGA TACCGGGATC TTCGGGACGC GGTATCTGAT GGACGCGCTG
GCGGGCATCG GGCGTACCGA CCTTGCGATG ACGGTGCTGG ATCAGGGCAC GTATCCGGGC
TACGGCTTCG AGATCGGCAA GGGGGCGACG ACGGACTGGG AGGAGTGGAC GTATGCCTCC
TCGATGGAGT CCCATGATCA CGCGATGTTC GCCGGGATCA ATGCCTCCTT CGCCAGCAGG
CTTGCGGGTA TCGAGCCGAC GGGCCCGGGT TACAGCACGG TCAGCATCGC GCCGCAGATT
CCCGCCGGTC TTCAGAGCGT CGCGGCTTCT GTCAGCACGG TTCGGGGGAC GGTCGCCTCG
TCGTGGTCGG TGAGCGGCGG CAAGGTCACG ATGGACGTCA CGGTTCCGGT CGGGGCGACG
GCGAGCGTGC GGGTGCCGAG CTTCGGGCAG GGGAGCGTGA CGACGGTCAG CCCGGCGGGC
GCCGGTGCGG TGCAGGTGTC CGCGGGCGGG ACTTCGAGCA CGTATACGGT CGGCTCCGGC
AGCTGGGAGT TCACCGGCTC GCTCGTTCCG GCGTCGAGCA CCGTGCTGCC GGGGACGTGG
ACTCAGTGTG CGGTCGAGAC CGGCAACTGC TCGGTCGCGG GCACCGAGAC GGTGGCGTTC
GGCGCGCAGG GTAAGTACGA CTACGCGACC GTGAGTGGCA CCGTTGCCTG CTCCAACACG
GTATTCGGCG ACCCGGACTC GGGCGTCTCC AAGGCCTGCT ATGCCGAGCC GGCGCCCGCG
ACGACGCCCG GCGCGTGGCA ACAATGCGCG GCGGAGACGG CGACGTGCTC CTTCGCGGGC
ACCGAGACCG TGGCATTCGG CGCCCAAGGC AAGTACACCT ACGCGACCTT GACCGGCGGC
ACCCCTTGCA CCACCACGAT CTTCGGCGAC CCGGCGTACG GCATCCCCAA GTCCTGCTTC
ATCGAAGCCC CACCCCCCGC CGCCACAACC TGGATCCCCT GCGCCGCCGA AACGGCCACC
TGCACCCTCA CCGCCAGCCG CGACGTGGCC TTCGGCGCCC ACGGCGACTA TGCCTACCGC
ACAGCGAACG CCCCGACAAC CTGCACCAAC GCAGTCTTCG GCGATCCGGC GCCGGGCGCG
GTCAAGGCTT GCTACCTGCA GTGA
 
Protein sequence
MLIRRRFATA LALILAFAAV TPVASAAPSP SLSLSASSAG TEVSVSALQT DATTNPLGID 
DPHPSLSWKL ASQINGEYQS AYRIVVAGSQ SDASAGVGSV WDSGEVASGN SVGIAYGGPA
LASEKTYYWA VQIWDEHGTA SGWSAPAQWE MGLLNPGDWQ GAQWISPTSA SAASPLLRKD
FTLAKPVARA RAYVFGLGFY ELHLNGGKVG DRVLEPASTP YAQRDLYAAY DVTGTVKQGG
NAVGLWLGNG YDANFSQYGF RWLGPKQAIV LIDVTFSDGT HQAVTSDGSW TWSPSPITAD
GIYNGESYDA RMVQPGWDAA GFSASSLQPV RTVAAPGGSL VADAMPPIRV TQTLNAVKLT
SPSPGVYVYD FGQNVSGWEV LRTQGAAGST VKMQTAEDLL ANGTIDTTTN RNALSTDSYT
LAGTGSTETY EPRFTYHGFR YVEVTGDPQT PVVSSLQARV VHADLASTST FSSSDATLNQ
IWQNNQRTML NNQMSTPTDN PVRDERTPPG MDVQAYHDAS TAEFGMDTYY ANYLRDMPPG
TALPSDGGNA QQPDMGGDQV TLAWTLYQQY GDIATLAAMY PLMKKFVDTN ATNVPGHIWS
TGFGDWCPPD HSSNANGGMG NPSAGACTSE VPIVNTALSY LQASDVAKAA TALGQAGDAA
HYTQLAGDIA NAFNAAFLNS DHASYGDGRE TTSILPLAFG MVPAADVPAV GARLVDTVVR
GNGGHLDTGI FGTRYLMDAL AGIGRTDLAM TVLDQGTYPG YGFEIGKGAT TDWEEWTYAS
SMESHDHAMF AGINASFASR LAGIEPTGPG YSTVSIAPQI PAGLQSVAAS VSTVRGTVAS
SWSVSGGKVT MDVTVPVGAT ASVRVPSFGQ GSVTTVSPAG AGAVQVSAGG TSSTYTVGSG
SWEFTGSLVP ASSTVLPGTW TQCAVETGNC SVAGTETVAF GAQGKYDYAT VSGTVACSNT
VFGDPDSGVS KACYAEPAPA TTPGAWQQCA AETATCSFAG TETVAFGAQG KYTYATLTGG
TPCTTTIFGD PAYGIPKSCF IEAPPPAATT WIPCAAETAT CTLTASRDVA FGAHGDYAYR
TANAPTTCTN AVFGDPAPGA VKACYLQ