Gene Caci_5220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5220 
Symbol 
ID8336574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6002093 
End bp6004264 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content69% 
IMG OID644958318 
Producthypothetical protein 
Protein accessionYP_003115920 
Protein GI256394356 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.840911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA GGGCTCTGTC CTTCCTGGCC GCCGGCACGC TCGCCGCCGT CGCCGGACTC 
GGTACCGCCT CCGCCGCCCC GGCGCCGTCC GCGACCGCCG CGGCGGCGCC GAGCCCGATG
TGGGCCACCC AGCTGCAGTT CGACAACAAC GGGACGGCCT GGTCCCAGGC GAGCTTCGCG
GCGTTGAAGG CCAAAGGCCT GACGACCGCG GAGATCGACA TGCCCTGGGG CACGATCGAG
CCCTCGAAGG GCAGCTTCAG CTTCACCGAG CTGGATCAGG AGCTGGCGAA CGCCTCCGCG
GCCGGGATCA AGCTGATACC GATCTTCTGG TCCTCCGGCT GGGGCGGCAG CCCGGCCTCC
TGGGTCACCG GCCGCGAGGC CGACAGCACC GGGGCGAGCA GTCCCGCTCC CGTGTGGTGG
GACCCGGTCA ATCAGCCCGC GTACTTCGAC TACGTCACCA AGACGGTCTC GCACATAGCC
GCCAACGCCG GCTACGGCGG CAGCATCCTG GACTACGGAT TCCTCGACGC GCAGTGGGAC
ATCAACGGCG GCGCCTCCGG GTGGGCTCCG GCCGACATCG CCGAGTTCCA CACCACCTAC
CTGCCGAACA CCTACGGCAC CGTCGCGGCG TTCAACAGCA AGTATCAGAC CTCTTACGCC
TCATTCAGCG CCGTCCCGGC CGCTGCCATC GGGCAGCCGC TATGGGGCGT TTACCAGGCG
TTCCGAGCCT GGAGCGTGCA GGACACCTAC GGCCGCCTCA CCGCGGCGGT CCGCGCGGTC
ACCGCCTCGA CGCCGCTGTA CTACTACTTC GGCGGGCACT TCGGGAACGC GGTGAACTAC
GCCAACATCC CCGACATCTT CTTCAGCCTG GCCAAGCAGT ACTCGGCCAC GGTGATCGTC
GACGCCGCGC AGTCCCCCGG CCTGGCGTTG ACCTTCGGCA GCCTGGCTCG CGCCTACGGC
GTCCCGCTCG CGCAGGAGTG GACGGCTCCC AGCGACAGCA CGCAGCTGTC CGCGCAGGCG
GTGCAGTGGA TGGCGAACTA CGCCATGGGC CTGCCGGAAG GCGGCGGCGA GGACTTCTTC
ATCCACGACG GGACGCAGAA GGACGTCGTG GGCTGGCCGA TCTACACCTC CTGGCTGCCG
TCGATGCAGC GCATCAGCGG CTCCTATCCG CAACAGCCGG TCGCCGTCTA CATGGACTTC
TCCCAGGCCT ACGGCAACAC CGGCGGCGGC GCGGTCGGCA GCATGGAGGA CGCGATCTCC
AACCTGTGGA ACGGCTACCA GGCCGGATTC GCGGTCGTCA CCAGCCAGGA GGTCGCCAAC
GGGACCGTGA AGCTGTCCTC GTACAAGGCG ATCCTGCCGA TGAACGGCAC CGATGCGAAC
CTCAGCGCCT ACCAAGCCGC CGGCGGCACG CTGCTGAGCA ACGGCTCGCA AATGGCTTCC
TACTCCTCGG CCTACGCGAC GCTGGCCAAC ACCGGCGTGC TGCAGGTCGT GCCAGCCGTC
GCCGCGAGCG GGACCGGCGC GACGGTGACG CTGGCGGACA TCACCTCGGG CACCGCCTAC
AACGCCGCGG TGACCTTCAA GTTCGCAGGG CTGGGATTGG CAGCCGGGAG CTACCACGTC
ACCGACGCCA GCGGGAACGC GGTACCGCAG AACCCTGTCA GCGGCGGGAT CTGCACGGCG
CCGAACATCC AGCCGGCGCA GCTCGTGCAG TGGAACATCG TGGCCGGCGC GGCGCCGGCC
GGCACGCCGG TTCCCGCGGC GTGCGGCGGC TCGGCGAGCC CGGTCATCAG CCTGCGAGCC
CACGCGAACA ACGACATCGT GACGGCGGAC AACGCCGGAG CCAGCCCGCT GATCGCCAAC
CGCACCGCGA TCGGTACCTG GGAGCAGTTC GACCTGATCA CCAACTCCGA CGGCAGCGTC
AGCCTTCGCG CACACGCCAA CGGCGACATC GTCAGCGCCG ACAACGCCGG CGCCTCGCCG
CTGATCGCCA ACCGGACCGC GATCGGCCAG TGGGAGTCCT TCGACCTGCT CACCAACGCC
GACGGCAGCG TCAGCCTCCG GGCACACGCC AACGGCGACA TCGTCACGGC GGACAACGCC
GGCGCCGCAG CGCTGATCGC CAACCGCACC GCCATCGGAC CCTGGGAGGA GTTCGACCTC
ATCCACGACT GA
 
Protein sequence
MKMRALSFLA AGTLAAVAGL GTASAAPAPS ATAAAAPSPM WATQLQFDNN GTAWSQASFA 
ALKAKGLTTA EIDMPWGTIE PSKGSFSFTE LDQELANASA AGIKLIPIFW SSGWGGSPAS
WVTGREADST GASSPAPVWW DPVNQPAYFD YVTKTVSHIA ANAGYGGSIL DYGFLDAQWD
INGGASGWAP ADIAEFHTTY LPNTYGTVAA FNSKYQTSYA SFSAVPAAAI GQPLWGVYQA
FRAWSVQDTY GRLTAAVRAV TASTPLYYYF GGHFGNAVNY ANIPDIFFSL AKQYSATVIV
DAAQSPGLAL TFGSLARAYG VPLAQEWTAP SDSTQLSAQA VQWMANYAMG LPEGGGEDFF
IHDGTQKDVV GWPIYTSWLP SMQRISGSYP QQPVAVYMDF SQAYGNTGGG AVGSMEDAIS
NLWNGYQAGF AVVTSQEVAN GTVKLSSYKA ILPMNGTDAN LSAYQAAGGT LLSNGSQMAS
YSSAYATLAN TGVLQVVPAV AASGTGATVT LADITSGTAY NAAVTFKFAG LGLAAGSYHV
TDASGNAVPQ NPVSGGICTA PNIQPAQLVQ WNIVAGAAPA GTPVPAACGG SASPVISLRA
HANNDIVTAD NAGASPLIAN RTAIGTWEQF DLITNSDGSV SLRAHANGDI VSADNAGASP
LIANRTAIGQ WESFDLLTNA DGSVSLRAHA NGDIVTADNA GAAALIANRT AIGPWEEFDL
IHD