Gene Cag_1657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1657 
Symbol 
ID3747675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2154547 
End bp2156325 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content50% 
IMG OID637774195 
Productbeta-N-acetylglucosaminidase 
Protein accessionYP_379952 
Protein GI78189614 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTT ATCATTCTCT CTTATTTCCC TTTTTTTCAT TGAAAAAAAA ATGGAGCCGC 
AAAAGCCGCC GACTCTTCAT GCTTTTTATG GTACTTACGT TGAGCGTTGG CAACCATGCA
AGCGCTCGTG TAGGGGGTGA ATCGTGGCGT GCGGAGCAAA TTTTTAGCAA GCAGAGCGAT
GAGCTTGAAG AGCAATTGCA CGCCATGAGC TTAGCCGACA AAGTTGGGCA AATGATTATT
GCCGACATTG AGCCAACAGC TTTTTCTCCC CGCAATAAAA AGGTGCTGCT GCTGAGCCGT
TTAGCGCAAG AAGGCAAAAT TGGCGGGGTT ATTTTTATGA AAGGCGATGC CAAAAGCACG
GGCGCCGTAG TGAATCACTT GCAAGCTTTA GCGCCGTTGC CACTGCTTTT TAGCTCCGAT
ATGGAACGAG GAGTTGCCAT GCGCATTAGT GGCACCACCG AGTTTCCGCC AAACATGGCG
CTTGGCGCTA CCGCTGATCC AAAGTTAGCT GAAGACATGG CAACGGCTAT TGCTCAAGAA
GCCACCTTGC TTGGAATGCA CCACAACTAC GCGCCAACGG TTGACCTTAA TAGTAATCCC
CGCAACCCCG TTATTAACAC GCGTGCTTTT GGCGATACCA TTCCGCTTAC CATTGTAATG
GCAAATGCCA TTATTAAGGG ATTGCAATCG CACGGCGTAC TTGCCACCGC GAAGCATTTT
CCCGGACATG GCAACGTTAC GGTGGATAGC CATGTGGCGC TTCCCGTATT ACAAGCTACT
CGTGAGCAAC TTGAGGCTTA CGAGCTTATT CCTTTTCGTG CAGCAATTGA GCAAGGCGTG
GCAACCATTA TGGTGGGGCA TCTTGCCGTG CCAGCACTAA CGGGCAACAT GGAGCCTGCA
ACCATTTCAC CTGCCATTGT AACCACGTTG TTGCGTCAAG AACTTGGCTT TAAAGGGCTA
ATTATTACCG ATGCGCTTAA CATGAAGGCG CTTTATAACG GCAGCAACGT TGCCACTCTT
TCGGTGCGAG CTGTGCAAGC AGGGAACGAC TTGCTGCTTT TTTCGCCCGA CCCCGAAGCT
ACCCATAGCG CAGTTGTGCA AGCCGTTGAA GCGGGGCAAA TTCCCCTTGA GCAAATTAAC
GCTTCGGTGC GACGCATTTT GCAAGCCAAA CAATGGCTAA AGCTTGAAAA GCATCGCGAG
GTAGATAGTG AGGATATTGA AGAGGATGCC AATCCAGCAA GCCATCGCGA ACTTGCCCGC
AAAATTGCTG AACATGCCGT AACGTTAGTG AGCGATGTGG AACGCAATGT GCCGCTTAAA
AAGAGTGAGC AGCTCCTTCA CCTTATTGTA CAAGATCGGG TGAATTACCA AACAGGGCGC
AATTACCTTC GCCAACTCAG CGAACGCTAT CCCACCATAA CTCATCTACG CATTAACCCT
AAAAGCGATG CGCTTGATTA TGCTATTGCC ACCGAACTTG CCATGAACGC CTCAAGCGTG
CTTGTAACAT CTTACGTGCA ATCGCTAAGT AGCAATGGCG AACTCAAACT TACTGCCGAA
CAGCAAAACT TTTTGCACTT ATTACCGACG GTGGTTCAGC GTGGTACGCC CATGGTGTTG
CTGTCGCTTG GCACGCCGTA TATTAGCAAC TATTTTCCAG AGTTTACAAG CTATCTTTGC
ACCTACTCGT TTGACGAGGA GAGCGAACGT GCGGCTCTGC AAGTGTTGCA GGGCGAGCTT
ACGCCTCGTG GTGTGCTGCC TATTGTGCTT GGGCAGTAG
 
Protein sequence
MSSYHSLLFP FFSLKKKWSR KSRRLFMLFM VLTLSVGNHA SARVGGESWR AEQIFSKQSD 
ELEEQLHAMS LADKVGQMII ADIEPTAFSP RNKKVLLLSR LAQEGKIGGV IFMKGDAKST
GAVVNHLQAL APLPLLFSSD MERGVAMRIS GTTEFPPNMA LGATADPKLA EDMATAIAQE
ATLLGMHHNY APTVDLNSNP RNPVINTRAF GDTIPLTIVM ANAIIKGLQS HGVLATAKHF
PGHGNVTVDS HVALPVLQAT REQLEAYELI PFRAAIEQGV ATIMVGHLAV PALTGNMEPA
TISPAIVTTL LRQELGFKGL IITDALNMKA LYNGSNVATL SVRAVQAGND LLLFSPDPEA
THSAVVQAVE AGQIPLEQIN ASVRRILQAK QWLKLEKHRE VDSEDIEEDA NPASHRELAR
KIAEHAVTLV SDVERNVPLK KSEQLLHLIV QDRVNYQTGR NYLRQLSERY PTITHLRINP
KSDALDYAIA TELAMNASSV LVTSYVQSLS SNGELKLTAE QQNFLHLLPT VVQRGTPMVL
LSLGTPYISN YFPEFTSYLC TYSFDEESER AALQVLQGEL TPRGVLPIVL GQ