Gene Caci_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2028 
Symbol 
ID8333372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2296296 
End bp2298389 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content69% 
IMG OID644955178 
ProductBeta-galactosidase 
Protein accessionYP_003112789 
Protein GI256391225 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.119513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGC GCGTAGAAGG ACTTTTGTAT GGAGCGGACT ACAACCCGGA GCAGTGGCCG 
GAGTCCGTCT GGAAGCAGGA CGTCAACCTG ATGCGGGACG CGGGCGTCAC GATGGTGACG
CTCGGGGTGT TCTCCTGGGC CCGGTTGCAG CCCGCCGAGG ACCAGTGGGA CTTCGAGTGG
CTGGACCGGC TGATGGACCT GTTCCACAGC CGGGGGATCG CCGTGGACCT GGCGACGGCG
ACGGCCTCGC CGCCGGCGTG GTTCGTGCGC CAGTACCCGC AGACCCTGCC GGTCACGGCG
GACGGCGTGC GCCTGGAGTT CGGCTCCCGG CAGCACTACT GTCCCTCCTC GCCGGTCTAC
CGCGAGGCGG CGACCCGGCT GGCCCGCACG ATCGCCGAAC GGTACGCCGA GCACCCGGCG
CTGGCGCTGT GGCACATCCA CAACGAGTAC GGCGACCACG TCGCCGAGTG CTTCTGCGAC
GTCTCGGCCG AGGACTTCCG CAGCTGGCTG CGCGAGCGCC ACGGCGACGA CATCGCGCAG
CTGAACTTCG CCTGGGGCAC CGAGTTCTGG TCCCAGCGCT ACTCGGACTT CGCCCAGATC
GAACCGCCGC GCACCGCGCC GGGCCCGATC AATCCGGGGC AGCTGCTGGA CTGGCGCCGC
TTCAGCTCCG ACGCCCTGCT CGCCTGCTTC CTGGCCGAGC GCTCCGTTCT GAAGGAGGTG
ACACCGGAGA TCCCGGTCAC GACCAACTTC ATGTCGATGA TGAAGGACCT CGACTACTGG
AAGTGGGCCG CCAATGAGGA CGTGGTCTCC GACGACGCGT ACCCGGACCC GGCCGACGGC
TCGGCCCATG TGCTCGCCGC GATGAACTAC GACCTCATGC GCTCCCTGGG CAACGGCCGG
CCGTGGCTGC TGATGGAGCA GGCGCCGTCG GCGGTGAGCT GGCGCGCGGT GAACGTGCCC
AAGACCCCGG ACCAGCGGCG CCTGTGGTCG CTGCAGACCG TGGCACGCGG CGCGGACGGC
GTCATGCACT TCCAGTGGCG CGCCTCGCGG GCCGGGGCGG AGAAGTTTCA CAGCGCCCTG
CTTCCCCACG GCGGAACCCA GTCGCGGGGG TGGCGCGAGA CGGTCCGGAT CGGCGAGGAG
CTGGCGCAGC TCAAGGAAGT CGCGGGGAGC CGGATCGAGA GCGCGGTCGC CATCGTCCTG
GACTGGAACT CCTGGTGGGC GCTGGAGGCC GAGGACCATC CCTCGGCGCG CGTGCGGCTC
AGGGAGCAGA TCCTGAGCTG GTACTCGGTG CTGCACCGCT GGAACCACGT CGTCGACTAC
GTGCCGCCCG ACGCGGATCT GAGCCCGTAC AAAGTGGTGC TCGCGCCGAA CCTGTACTCG
GTGAGCACCG AGAACGCCGC CCGGCTGACC GACTACGTGT ACGGCGGAGG GTACCTCGTC
GTCGGTTTCT TCAGCGGGAT CGTCGACGAG CTGGACCACA TTCATCAGGG CACTGATGGC
ACCGGCGGCG GGTATCCCGG ACCGCTGCGG GAGGTGCTCG GGGTCGCGGT CGACGAGTGG
TGGCCGATCG CGGACGGACG CTCGGTGCCG GTCACCTTCA GCGCCGACGA GGAGGCCGCG
AAGGCGAAGA AGAAGCACGT CAGCGCCAGC TCCGGCATCG GCTACCACCC GGCGCCGGCG
GCCGTGCGCT GGAGCGAGTG GCTCGGCACG ACCACCGCCA GATCAGTCGC GCACTACACC
GACGGACCGT TGAAGGGGCG TCCGGCGGTC ACCTGCAACG AGTTCGGCGA GGGCCGCGCC
TGGTACGTCA GCTGCGATCT GGGCGGGGAC ATCGAGAAGG TCCTCGGGGA GGCCGTCCGT
CCCGCCGTGG TGTGGCCGTC GCTGGCCTCG ACGCTGGCGT TCACCGGGGT CGAGGTGGTC
TGCCGCAGTT CGGAGACGCA CCACTACTAC TTCCTGCTCA ATCACAGCGA CAAGCCGGTC
GACCTGGGCT CGTCCCTGCC GTACGGCGCG GTGAACCTGC TGACCGGCAA TCGTCCCACC
CATCTGGCCG CGCAAGGCGT CGTGGTCCTG AAGGTAGGAA GGATAGGAAG GTGA
 
Protein sequence
MITRVEGLLY GADYNPEQWP ESVWKQDVNL MRDAGVTMVT LGVFSWARLQ PAEDQWDFEW 
LDRLMDLFHS RGIAVDLATA TASPPAWFVR QYPQTLPVTA DGVRLEFGSR QHYCPSSPVY
REAATRLART IAERYAEHPA LALWHIHNEY GDHVAECFCD VSAEDFRSWL RERHGDDIAQ
LNFAWGTEFW SQRYSDFAQI EPPRTAPGPI NPGQLLDWRR FSSDALLACF LAERSVLKEV
TPEIPVTTNF MSMMKDLDYW KWAANEDVVS DDAYPDPADG SAHVLAAMNY DLMRSLGNGR
PWLLMEQAPS AVSWRAVNVP KTPDQRRLWS LQTVARGADG VMHFQWRASR AGAEKFHSAL
LPHGGTQSRG WRETVRIGEE LAQLKEVAGS RIESAVAIVL DWNSWWALEA EDHPSARVRL
REQILSWYSV LHRWNHVVDY VPPDADLSPY KVVLAPNLYS VSTENAARLT DYVYGGGYLV
VGFFSGIVDE LDHIHQGTDG TGGGYPGPLR EVLGVAVDEW WPIADGRSVP VTFSADEEAA
KAKKKHVSAS SGIGYHPAPA AVRWSEWLGT TTARSVAHYT DGPLKGRPAV TCNEFGEGRA
WYVSCDLGGD IEKVLGEAVR PAVVWPSLAS TLAFTGVEVV CRSSETHHYY FLLNHSDKPV
DLGSSLPYGA VNLLTGNRPT HLAAQGVVVL KVGRIGR