Gene Caci_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4420 
Symbol 
ID8335774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5020627 
End bp5022381 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content70% 
IMG OID644957523 
ProductBeta-galactosidase 
Protein accessionYP_003115125 
Protein GI256393561 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.653544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.574053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGTAC TCGACATCAC CGGCGACGGC TTCAGCCTCG ACGGTCAGCC CTTCCGGATC 
GTCTCCGGCG GCCTGCACTA TTTCCGAGTC CATCCGGCGC AGTGGTCCGA CCGGCTGCGC
AAGGCCCGCC TGATGGGCCT GAACACCATC GACACCTACA TCCCGTGGAA CCTGCACGAG
CGGCGCCCCG GCACGTTCGA CTTCGGCGGG ATCCTGGACC TGGCGGCGTT CCTGGACGCC
GCCGCCGCCG AAGGGCTGCA CGTCCTGCTG CGGCCCGGGC CGTACATCTG CGGGGAGTGG
GAGGGCGGCG GGCTGCCGTC GTGGCTGCTC GCCGACCCGG ATCTGGCGCT GCGCAGCACC
GATCCGGCGT TCCTGCAGGC GGTCGAGGCG TACCTCGACG CGATCATGCC GATCGTGCTG
CCCCGGCTGG GGACGCGCGG CGGACCGGTC ATCGCCGTGC AGGTGGAGAA CGAGTACGGG
GCGTACGGCT CCGACACCGC CTATATGGAG CGGCTGTACG AGGCGCTGAC GTCGCGGGGT
ATCGACGTAC CCTTCTTCAC CTCCGACCAG CCCAACGACC TGGCGGACGG CGCGCTGCCC
GGCGTCCTTG CCACCGCGAA CTTCGGCGGC AAGGTGACCG CCTCGCTCGC GGCACTGCGT
GCGCAGCAGC CGACCGGACC GCTGATGTGC GCGGAGTTCT GGAACGGCTG GTTCGACTAC
TGGGGCGGCA CGCACGCGCA GCGCTCCGCC GAGGACGCCG GCGCCGCGCT GGAGGAGATG
CTGCAAGCCG GCGCTTCGGT GAACTTCTAC ATGTTCCACG GCGGCACCAA CTTCGGATTC
ACCAACGGCG CCAACGACAA GGGGACGTAC CGCGCCACGG TCACGTCCTA CGACTACGAC
TCGCCGCTGG ACGAAGCCGG GGACCCGACG GAGAAGTACC GGCGCTTCCG CTCCATCATC
GGCAAGTACG AGACGGTGCC GGACGAGGAA GTCCCGGAGC CGGGGGAGAA GCTGGCGCCG
GTCTCGGTGG CTCTGACCGG GCGCGCGGCG TTGTTCTCCG AGGCGAGTTT GGCTTCCTTG
GGCGTGGCGC AGAACTCTGA GACACCGCTG ACGATGGAGC TGCTCGGTCA GGACTTCGGT
TTCGTGCTCT ACGAAACCCG GCTTCCCGCG GCGGGTCCGG CGACGCTGAC GTTCGACGAG
ATCGGCGACC GCGCGCAGGT GTTCGTCGAC GGTCAGCCGG TCGGCGTGCT GGAGCGCGAG
CGGCACGAGC ATGTGCTGTC GTTCCTGGTG CCGCGCGCCG ATGCGCAGCT GCGCGTGCTA
GTGGAGAACC AGGGTCGGGT GAACTACGGC CAGAAGCTCG CCGATCGCAA GGGTCTGATA
GGCGCGGTCC ATCTCGACGG CGCGCCGCTC ACCGGCTGGA CTTCGCGTCC GCTGCCGCTG
GACGACCTGA CCGGGCTGGC CTACGCCGAG CTCGACGGCC CGGCGGTCGG ACCCGGCTTC
CACCGAGGCA CGTTCGACCT CGACCGATGC GCGGACACCT ACCTGCACCT GCCCGGCTGG
ACCAAGGGCG TGGCCTGGAT CAACGGCTTC AACCTGGGTC GCTACTGGTC GCGCGGCCCG
CAGGGGTCGT TGTACGTGCC CGGACCGGTG CTGCGTGCCG GAACGAACGA GCTGGTCGTC
CTCGAGCTGC ACGGCGCGCG CGCCGCGGCG GCCGAGCTGC GGCCGGTCCC GGATTTGGGA
CCGACGGAGC TGTGA
 
Protein sequence
MAVLDITGDG FSLDGQPFRI VSGGLHYFRV HPAQWSDRLR KARLMGLNTI DTYIPWNLHE 
RRPGTFDFGG ILDLAAFLDA AAAEGLHVLL RPGPYICGEW EGGGLPSWLL ADPDLALRST
DPAFLQAVEA YLDAIMPIVL PRLGTRGGPV IAVQVENEYG AYGSDTAYME RLYEALTSRG
IDVPFFTSDQ PNDLADGALP GVLATANFGG KVTASLAALR AQQPTGPLMC AEFWNGWFDY
WGGTHAQRSA EDAGAALEEM LQAGASVNFY MFHGGTNFGF TNGANDKGTY RATVTSYDYD
SPLDEAGDPT EKYRRFRSII GKYETVPDEE VPEPGEKLAP VSVALTGRAA LFSEASLASL
GVAQNSETPL TMELLGQDFG FVLYETRLPA AGPATLTFDE IGDRAQVFVD GQPVGVLERE
RHEHVLSFLV PRADAQLRVL VENQGRVNYG QKLADRKGLI GAVHLDGAPL TGWTSRPLPL
DDLTGLAYAE LDGPAVGPGF HRGTFDLDRC ADTYLHLPGW TKGVAWINGF NLGRYWSRGP
QGSLYVPGPV LRAGTNELVV LELHGARAAA AELRPVPDLG PTEL