Gene Jann_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1944 
Symbol 
ID3934395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1934646 
End bp1935947 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID637904298 
ProductBeta-glucosidase 
Protein accessionYP_509886 
Protein GI89054435 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.781532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCA AACGCTCCGA TTTCCCGGAG GGCTTTCGCT TCGGGGTCGC GACCTCCGCC 
TACCAGATCG AGGGGCACGC CCAGGGCGGC GCGGGCCTGA CCCATTGGGA CAGTTTTGCC
GCCACGCCCG GCAATGTCGT GCGGTTTGAG GACGGCGCGC GGGCCTGCGA CCACCTGAAC
CGGCTGGACG AGGATCTGGA CCTGATCCGC GATCTGGGCG CGGACGTCTA CCGGTTCTCC
ACCTCCTGGG CGCGGGTGAT GCCGGAGGGG CGTGGAGCGG CCAACAAGGA CGGCCTCGAT
TTCTACGATC GCCTCGTCGA TGGGCTGCTG GAGCGGGGCA TCGCGCCCGC CGTCACGCTC
TATCACTGGG AATTGCCGCA GGCGCTGGCC GATAGGGGCG GCTGGCGCAA CGCCGACATG
CCCGATTGGT TTGCCGACTA CACCGAGACC ATCATGTCCC GCATCGGCGA CCGCACCTGG
TCCGCCGCTC CGATCAATGA GCCGTGGTGC GTCAGTTGGC TGTCGCATTT TGAGGGCCAC
CACGCGCCGG GACTGCGTGA TATCCGCGCC ACCGCGCGGG CCATGCACCA CGTGCTGGTC
AGCCACGGGC GGTCGATCCA GGTCATGAAA GGTCTGGGCG TGAAAAACCT CGGGGCCGTG
TGCAATTTTG AATGGGCCAT GCCCAACACC GACAGCGATG CCGACATCGC CGCCGCCGCG
CGCTATGACG CGATCTACAA CCGCTTCTTT CTGGGCGGGC TGTTCAAGGG CGACTACCCG
GCAGAGGTGA TGGAGGGGTT GGAGCCCCAC CTGCCCGATG GCTGGCAGGA CGATTTCGCC
ACCATCCGCT CACCGCTCGA TTGGGTGGGG GTGAATTACT ACACCAACAA ACGCATCAGC
GCGACCGATG ACCCCTGGCC CGCCTATGCC TATGCGCCCA CCCAAGGCCC CCTGACCGAC
ATGGGGTGGG AGGTCTACCC GCAGGGGTTG CAGGATTTTC TGACCCGCAC CGCCCGCGAA
TACACTGGTG ATCTGCCGAT CTATGTCACC GAAAACGGCA TGGCGTCCGC CACCACGCCC
GACCCCGACC GGATCGCCTA TCTGACCGAC CACCTGCACA GCGTTCAGGC CGCGATTGCC
GACGGCGCCC CCGTTGCGGG CTATTACGTG TGGTCCCTGA TGGACAATTA TGAGTGGGCT
TTGGGATACG AGAAACGCTT CGGCCTCGTC CATGTGGATT TTGAGACCTT GGCACGGACG
CCCAAAGCGT CCTATCACGC ATTAGCAAAT TGGTGGCGCT GA
 
Protein sequence
MDFKRSDFPE GFRFGVATSA YQIEGHAQGG AGLTHWDSFA ATPGNVVRFE DGARACDHLN 
RLDEDLDLIR DLGADVYRFS TSWARVMPEG RGAANKDGLD FYDRLVDGLL ERGIAPAVTL
YHWELPQALA DRGGWRNADM PDWFADYTET IMSRIGDRTW SAAPINEPWC VSWLSHFEGH
HAPGLRDIRA TARAMHHVLV SHGRSIQVMK GLGVKNLGAV CNFEWAMPNT DSDADIAAAA
RYDAIYNRFF LGGLFKGDYP AEVMEGLEPH LPDGWQDDFA TIRSPLDWVG VNYYTNKRIS
ATDDPWPAYA YAPTQGPLTD MGWEVYPQGL QDFLTRTARE YTGDLPIYVT ENGMASATTP
DPDRIAYLTD HLHSVQAAIA DGAPVAGYYV WSLMDNYEWA LGYEKRFGLV HVDFETLART
PKASYHALAN WWR