Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5220 |
Symbol | |
ID | 8336574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6002093 |
End bp | 6004264 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644958318 |
Product | hypothetical protein |
Protein accession | YP_003115920 |
Protein GI | 256394356 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.840911 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA GGGCTCTGTC CTTCCTGGCC GCCGGCACGC TCGCCGCCGT CGCCGGACTC GGTACCGCCT CCGCCGCCCC GGCGCCGTCC GCGACCGCCG CGGCGGCGCC GAGCCCGATG TGGGCCACCC AGCTGCAGTT CGACAACAAC GGGACGGCCT GGTCCCAGGC GAGCTTCGCG GCGTTGAAGG CCAAAGGCCT GACGACCGCG GAGATCGACA TGCCCTGGGG CACGATCGAG CCCTCGAAGG GCAGCTTCAG CTTCACCGAG CTGGATCAGG AGCTGGCGAA CGCCTCCGCG GCCGGGATCA AGCTGATACC GATCTTCTGG TCCTCCGGCT GGGGCGGCAG CCCGGCCTCC TGGGTCACCG GCCGCGAGGC CGACAGCACC GGGGCGAGCA GTCCCGCTCC CGTGTGGTGG GACCCGGTCA ATCAGCCCGC GTACTTCGAC TACGTCACCA AGACGGTCTC GCACATAGCC GCCAACGCCG GCTACGGCGG CAGCATCCTG GACTACGGAT TCCTCGACGC GCAGTGGGAC ATCAACGGCG GCGCCTCCGG GTGGGCTCCG GCCGACATCG CCGAGTTCCA CACCACCTAC CTGCCGAACA CCTACGGCAC CGTCGCGGCG TTCAACAGCA AGTATCAGAC CTCTTACGCC TCATTCAGCG CCGTCCCGGC CGCTGCCATC GGGCAGCCGC TATGGGGCGT TTACCAGGCG TTCCGAGCCT GGAGCGTGCA GGACACCTAC GGCCGCCTCA CCGCGGCGGT CCGCGCGGTC ACCGCCTCGA CGCCGCTGTA CTACTACTTC GGCGGGCACT TCGGGAACGC GGTGAACTAC GCCAACATCC CCGACATCTT CTTCAGCCTG GCCAAGCAGT ACTCGGCCAC GGTGATCGTC GACGCCGCGC AGTCCCCCGG CCTGGCGTTG ACCTTCGGCA GCCTGGCTCG CGCCTACGGC GTCCCGCTCG CGCAGGAGTG GACGGCTCCC AGCGACAGCA CGCAGCTGTC CGCGCAGGCG GTGCAGTGGA TGGCGAACTA CGCCATGGGC CTGCCGGAAG GCGGCGGCGA GGACTTCTTC ATCCACGACG GGACGCAGAA GGACGTCGTG GGCTGGCCGA TCTACACCTC CTGGCTGCCG TCGATGCAGC GCATCAGCGG CTCCTATCCG CAACAGCCGG TCGCCGTCTA CATGGACTTC TCCCAGGCCT ACGGCAACAC CGGCGGCGGC GCGGTCGGCA GCATGGAGGA CGCGATCTCC AACCTGTGGA ACGGCTACCA GGCCGGATTC GCGGTCGTCA CCAGCCAGGA GGTCGCCAAC GGGACCGTGA AGCTGTCCTC GTACAAGGCG ATCCTGCCGA TGAACGGCAC CGATGCGAAC CTCAGCGCCT ACCAAGCCGC CGGCGGCACG CTGCTGAGCA ACGGCTCGCA AATGGCTTCC TACTCCTCGG CCTACGCGAC GCTGGCCAAC ACCGGCGTGC TGCAGGTCGT GCCAGCCGTC GCCGCGAGCG GGACCGGCGC GACGGTGACG CTGGCGGACA TCACCTCGGG CACCGCCTAC AACGCCGCGG TGACCTTCAA GTTCGCAGGG CTGGGATTGG CAGCCGGGAG CTACCACGTC ACCGACGCCA GCGGGAACGC GGTACCGCAG AACCCTGTCA GCGGCGGGAT CTGCACGGCG CCGAACATCC AGCCGGCGCA GCTCGTGCAG TGGAACATCG TGGCCGGCGC GGCGCCGGCC GGCACGCCGG TTCCCGCGGC GTGCGGCGGC TCGGCGAGCC CGGTCATCAG CCTGCGAGCC CACGCGAACA ACGACATCGT GACGGCGGAC AACGCCGGAG CCAGCCCGCT GATCGCCAAC CGCACCGCGA TCGGTACCTG GGAGCAGTTC GACCTGATCA CCAACTCCGA CGGCAGCGTC AGCCTTCGCG CACACGCCAA CGGCGACATC GTCAGCGCCG ACAACGCCGG CGCCTCGCCG CTGATCGCCA ACCGGACCGC GATCGGCCAG TGGGAGTCCT TCGACCTGCT CACCAACGCC GACGGCAGCG TCAGCCTCCG GGCACACGCC AACGGCGACA TCGTCACGGC GGACAACGCC GGCGCCGCAG CGCTGATCGC CAACCGCACC GCCATCGGAC CCTGGGAGGA GTTCGACCTC ATCCACGACT GA
|
Protein sequence | MKMRALSFLA AGTLAAVAGL GTASAAPAPS ATAAAAPSPM WATQLQFDNN GTAWSQASFA ALKAKGLTTA EIDMPWGTIE PSKGSFSFTE LDQELANASA AGIKLIPIFW SSGWGGSPAS WVTGREADST GASSPAPVWW DPVNQPAYFD YVTKTVSHIA ANAGYGGSIL DYGFLDAQWD INGGASGWAP ADIAEFHTTY LPNTYGTVAA FNSKYQTSYA SFSAVPAAAI GQPLWGVYQA FRAWSVQDTY GRLTAAVRAV TASTPLYYYF GGHFGNAVNY ANIPDIFFSL AKQYSATVIV DAAQSPGLAL TFGSLARAYG VPLAQEWTAP SDSTQLSAQA VQWMANYAMG LPEGGGEDFF IHDGTQKDVV GWPIYTSWLP SMQRISGSYP QQPVAVYMDF SQAYGNTGGG AVGSMEDAIS NLWNGYQAGF AVVTSQEVAN GTVKLSSYKA ILPMNGTDAN LSAYQAAGGT LLSNGSQMAS YSSAYATLAN TGVLQVVPAV AASGTGATVT LADITSGTAY NAAVTFKFAG LGLAAGSYHV TDASGNAVPQ NPVSGGICTA PNIQPAQLVQ WNIVAGAAPA GTPVPAACGG SASPVISLRA HANNDIVTAD NAGASPLIAN RTAIGTWEQF DLITNSDGSV SLRAHANGDI VSADNAGASP LIANRTAIGQ WESFDLLTNA DGSVSLRAHA NGDIVTADNA GAAALIANRT AIGPWEEFDL IHD
|
| |