Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3741 |
Symbol | |
ID | 8335094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4220144 |
End bp | 4223467 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956881 |
Product | alpha-L-rhamnosidase |
Protein accession | YP_003114484 |
Protein GI | 256392920 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00158847 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATTC GGCGCCGCTT TGCCACCGCG CTCGCGCTGA TCCTCGCCTT TGCCGCCGTG ACGCCGGTCG CCTCCGCGGC GCCGTCGCCG TCGCTGTCGC TGTCGGCGTC CTCGGCCGGA ACCGAAGTGT CGGTCAGTGC TCTGCAGACC GATGCCACGA CGAACCCTCT TGGGATCGAC GACCCGCACC CGAGTCTGAG CTGGAAGCTG GCGTCACAGA TCAACGGCGA GTATCAGAGT GCTTATCGGA TCGTGGTCGC CGGCAGCCAG AGCGATGCGA GCGCCGGGGT GGGGAGCGTA TGGGACAGCG GTGAGGTGGC TTCGGGCAAC TCAGTGGGCA TCGCGTATGG CGGTCCGGCG CTGGCGAGTG AGAAGACCTA CTACTGGGCT GTGCAGATCT GGGACGAGCA CGGAACGGCT TCGGGCTGGA GCGCTCCGGC TCAGTGGGAG ATGGGTCTGC TCAATCCCGG CGATTGGCAG GGCGCGCAGT GGATCAGCCC GACGTCCGCG AGCGCCGCGT CGCCGTTGCT GCGTAAGGAC TTCACGCTGG CCAAGCCGGT GGCGAGGGCT CGGGCGTATG TCTTCGGGCT CGGCTTCTAC GAGCTGCATC TCAACGGAGG CAAGGTCGGC GATCGGGTTC TGGAGCCGGC GAGTACCCCT TACGCGCAGC GTGATCTGTA CGCCGCGTAT GACGTCACCG GCACCGTCAA GCAGGGCGGC AATGCGGTCG GGCTGTGGCT CGGCAACGGT TACGACGCGA ACTTCAGCCA GTACGGCTTC CGGTGGCTGG GACCGAAGCA GGCGATCGTG CTCATCGACG TCACCTTCAG CGATGGCACA CATCAGGCCG TCACCAGTGA CGGCAGTTGG ACGTGGTCGC CGAGTCCGAT CACCGCTGAC GGCATCTACA ACGGCGAGTC CTACGACGCC CGGATGGTGC AGCCTGGTTG GGACGCGGCC GGGTTCAGCG CGTCGTCGTT GCAGCCGGTG CGGACCGTCG CCGCGCCGGG CGGGTCGCTG GTGGCCGACG CGATGCCGCC GATCCGGGTG ACGCAGACGC TGAACGCGGT GAAGCTGACG TCGCCGTCGC CGGGTGTGTA CGTGTACGAC TTCGGTCAGA ACGTCTCGGG CTGGGAAGTG CTGCGTACGC AAGGCGCGGC GGGCTCGACG GTGAAGATGC AGACCGCTGA GGACCTGCTC GCGAACGGAA CGATCGATAC CACGACCAAC CGAAACGCGC TGTCCACGGA CTCTTATACG CTCGCGGGTA CGGGATCAAC GGAGACGTAC GAACCGCGCT TTACGTATCA CGGCTTCCGG TATGTGGAGG TCACCGGGGA TCCGCAGACT CCGGTCGTGA GCAGCCTCCA GGCGCGTGTT GTTCATGCGG ACCTGGCGTC GACCAGTACG TTCAGCTCAT CTGACGCGAC GCTGAATCAG ATATGGCAGA ACAACCAGCG GACCATGCTG AACAACCAGA TGAGCACGCC TACGGACAAT CCGGTGCGGG ATGAACGTAC GCCTCCGGGG ATGGACGTGC AGGCGTACCA CGATGCGTCG ACTGCCGAGT TCGGGATGGA CACCTACTAC GCCAACTATC TGCGGGACAT GCCGCCGGGG ACCGCGCTGC CCAGCGACGG CGGGAACGCT CAGCAGCCTG ATATGGGTGG CGACCAGGTG ACGCTGGCCT GGACGCTGTA TCAGCAGTAC GGCGATATCG CCACGCTCGC TGCGATGTAT CCGCTGATGA AGAAGTTCGT CGATACGAAC GCCACCAATG TGCCTGGCCA CATCTGGTCC ACCGGCTTTG GTGACTGGTG TCCGCCGGAC CACAGCTCGA ATGCCAACGG CGGTATGGGG AATCCGAGCG CCGGGGCGTG CACCAGCGAG GTGCCGATCG TCAATACGGC GCTGTCGTAT CTGCAGGCTT CGGACGTCGC CAAGGCTGCG ACGGCGCTGG GGCAGGCTGG GGACGCGGCG CACTACACGC AGCTGGCCGG CGATATCGCG AACGCTTTCA ACGCGGCGTT TCTGAACTCC GACCACGCGA GCTATGGGGA TGGACGCGAA ACGACGAGCA TTTTGCCGCT CGCGTTCGGG ATGGTGCCGG CCGCCGATGT GCCCGCGGTC GGGGCGCGGC TCGTGGACAC GGTCGTGCGT GGCAACGGCG GACACCTCGA TACCGGGATC TTCGGGACGC GGTATCTGAT GGACGCGCTG GCGGGCATCG GGCGTACCGA CCTTGCGATG ACGGTGCTGG ATCAGGGCAC GTATCCGGGC TACGGCTTCG AGATCGGCAA GGGGGCGACG ACGGACTGGG AGGAGTGGAC GTATGCCTCC TCGATGGAGT CCCATGATCA CGCGATGTTC GCCGGGATCA ATGCCTCCTT CGCCAGCAGG CTTGCGGGTA TCGAGCCGAC GGGCCCGGGT TACAGCACGG TCAGCATCGC GCCGCAGATT CCCGCCGGTC TTCAGAGCGT CGCGGCTTCT GTCAGCACGG TTCGGGGGAC GGTCGCCTCG TCGTGGTCGG TGAGCGGCGG CAAGGTCACG ATGGACGTCA CGGTTCCGGT CGGGGCGACG GCGAGCGTGC GGGTGCCGAG CTTCGGGCAG GGGAGCGTGA CGACGGTCAG CCCGGCGGGC GCCGGTGCGG TGCAGGTGTC CGCGGGCGGG ACTTCGAGCA CGTATACGGT CGGCTCCGGC AGCTGGGAGT TCACCGGCTC GCTCGTTCCG GCGTCGAGCA CCGTGCTGCC GGGGACGTGG ACTCAGTGTG CGGTCGAGAC CGGCAACTGC TCGGTCGCGG GCACCGAGAC GGTGGCGTTC GGCGCGCAGG GTAAGTACGA CTACGCGACC GTGAGTGGCA CCGTTGCCTG CTCCAACACG GTATTCGGCG ACCCGGACTC GGGCGTCTCC AAGGCCTGCT ATGCCGAGCC GGCGCCCGCG ACGACGCCCG GCGCGTGGCA ACAATGCGCG GCGGAGACGG CGACGTGCTC CTTCGCGGGC ACCGAGACCG TGGCATTCGG CGCCCAAGGC AAGTACACCT ACGCGACCTT GACCGGCGGC ACCCCTTGCA CCACCACGAT CTTCGGCGAC CCGGCGTACG GCATCCCCAA GTCCTGCTTC ATCGAAGCCC CACCCCCCGC CGCCACAACC TGGATCCCCT GCGCCGCCGA AACGGCCACC TGCACCCTCA CCGCCAGCCG CGACGTGGCC TTCGGCGCCC ACGGCGACTA TGCCTACCGC ACAGCGAACG CCCCGACAAC CTGCACCAAC GCAGTCTTCG GCGATCCGGC GCCGGGCGCG GTCAAGGCTT GCTACCTGCA GTGA
|
Protein sequence | MLIRRRFATA LALILAFAAV TPVASAAPSP SLSLSASSAG TEVSVSALQT DATTNPLGID DPHPSLSWKL ASQINGEYQS AYRIVVAGSQ SDASAGVGSV WDSGEVASGN SVGIAYGGPA LASEKTYYWA VQIWDEHGTA SGWSAPAQWE MGLLNPGDWQ GAQWISPTSA SAASPLLRKD FTLAKPVARA RAYVFGLGFY ELHLNGGKVG DRVLEPASTP YAQRDLYAAY DVTGTVKQGG NAVGLWLGNG YDANFSQYGF RWLGPKQAIV LIDVTFSDGT HQAVTSDGSW TWSPSPITAD GIYNGESYDA RMVQPGWDAA GFSASSLQPV RTVAAPGGSL VADAMPPIRV TQTLNAVKLT SPSPGVYVYD FGQNVSGWEV LRTQGAAGST VKMQTAEDLL ANGTIDTTTN RNALSTDSYT LAGTGSTETY EPRFTYHGFR YVEVTGDPQT PVVSSLQARV VHADLASTST FSSSDATLNQ IWQNNQRTML NNQMSTPTDN PVRDERTPPG MDVQAYHDAS TAEFGMDTYY ANYLRDMPPG TALPSDGGNA QQPDMGGDQV TLAWTLYQQY GDIATLAAMY PLMKKFVDTN ATNVPGHIWS TGFGDWCPPD HSSNANGGMG NPSAGACTSE VPIVNTALSY LQASDVAKAA TALGQAGDAA HYTQLAGDIA NAFNAAFLNS DHASYGDGRE TTSILPLAFG MVPAADVPAV GARLVDTVVR GNGGHLDTGI FGTRYLMDAL AGIGRTDLAM TVLDQGTYPG YGFEIGKGAT TDWEEWTYAS SMESHDHAMF AGINASFASR LAGIEPTGPG YSTVSIAPQI PAGLQSVAAS VSTVRGTVAS SWSVSGGKVT MDVTVPVGAT ASVRVPSFGQ GSVTTVSPAG AGAVQVSAGG TSSTYTVGSG SWEFTGSLVP ASSTVLPGTW TQCAVETGNC SVAGTETVAF GAQGKYDYAT VSGTVACSNT VFGDPDSGVS KACYAEPAPA TTPGAWQQCA AETATCSFAG TETVAFGAQG KYTYATLTGG TPCTTTIFGD PAYGIPKSCF IEAPPPAATT WIPCAAETAT CTLTASRDVA FGAHGDYAYR TANAPTTCTN AVFGDPAPGA VKACYLQ
|
| |