Gene Cphamn1_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2440 
Symbol 
ID6376135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2602647 
End bp2603768 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content50% 
IMG OID642684918 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001960816 
Protein GI189501346 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA CCATAGTTCT CGTTTTCTCT CTCCTCCTGC TCACCGTGAA CGCATGTGCT 
GCGGATCTGC CCGAAGAGGA TGAGCTCAGC AGGAAAATAG GCCAGATGAT TATGGTCGGA
TTCAGGGGAA CCTCTCTTCA GGAAGCCCCT GGGCTCATGC AGGATATAAC AAAACGTCAC
CTTGGAGGCG TGGTTCTGTT CGACTATGAC GTTCCATCGA AATCTACCGG CAGAAACATC
ACCTCCCGTG AACAACTCCA AAAACTGACG ACCGGGTTAC AGCAGACATC GGCAGTTCCT
CTTTTTATCG CTATCGACCA GGAAGGCGGC AGAGTTTCCA GATTAAAGAC AACCTGCGGC
TTCCCGGCGA GCGTCACGGC TGCCGGGCTC GGCAAGCTGA ACAACACTGA CAGCACCTTT
CAGTCTTCGC TTGCAACAGC GAAAACCCTG CATAACAGCG GCATCAATGT CAATTTCGCT
CCCGTTGTCG ATCTCAACAG CAATCCTGAA AACCCCGTTA TCGGTTCTCT TGAAAGAAGT
TTTTCAGCGG ACGCTGCAAT TGTGTACAAA CATGCACGTG CAACAGTAGA GGCGTTTCAC
ACACAAAACA TTATTGCCGC TCTCAAACAC TTTCCCGGCC ATGGCAGCTC AACTACCGAC
ACCCATAAGG ATTTTACCGA TATTACCGGA AGCTGGAAAA AAAACGAACT TGACCCCTAC
AGGCGTCTGA TTGAAAACGG CTATACCGAT CTTGTCATGA CTGCTCATGT CTACAACGCC
AATCTTGACA ACCGTTACCC TGCGACACTC TCAAAGCAGA TCATATCAGG CCTGTTGCGC
GATTCCCTTG GATTTAACGG CCCTGTCATA AGCGATGACA TGCAGATGCA GGCGTTAGCC
GCGCATTACG ACCTTCGAAC CGCCATTACC CTGGCCCTCG AAGCCGGCGT AGACATTCTT
CTCTTTGCCA ATAACTCGGT CTATGATCCA GATATTGCCG AAAAAGCCGT ATCGATCATC
CGTTCACTGG TCGAAGAGGG AACGCTGAAC CCGAATCGTA TCGACGCCTC CTACAAGCGG
ATCATGAAAC TGAAAACGCA CTACCTGAAA ACATCGACAT GA
 
Protein sequence
MKQTIVLVFS LLLLTVNACA ADLPEEDELS RKIGQMIMVG FRGTSLQEAP GLMQDITKRH 
LGGVVLFDYD VPSKSTGRNI TSREQLQKLT TGLQQTSAVP LFIAIDQEGG RVSRLKTTCG
FPASVTAAGL GKLNNTDSTF QSSLATAKTL HNSGINVNFA PVVDLNSNPE NPVIGSLERS
FSADAAIVYK HARATVEAFH TQNIIAALKH FPGHGSSTTD THKDFTDITG SWKKNELDPY
RRLIENGYTD LVMTAHVYNA NLDNRYPATL SKQIISGLLR DSLGFNGPVI SDDMQMQALA
AHYDLRTAIT LALEAGVDIL LFANNSVYDP DIAEKAVSII RSLVEEGTLN PNRIDASYKR
IMKLKTHYLK TST