Gene Cphamn1_0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0174 
Symbol 
ID6373828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp169183 
End bp170934 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content49% 
IMG OID642682693 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_001958630 
Protein GI189499160 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.655452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATTC GATATCTTAT GGCTGCAGTC TTTCTGCTGC TCTATACGTT TTCTTCTCCT 
CCCTCCAGAC CCGCTCTTGC AGAGGCCTTT CCGGCTTACA AGAACGCAAC AGCACAGGAA
ATATTCAGAG AAAAAGACAA GTGGGTTGAA AAGCAGCTCA GCGAGATGAC GCTTTCTGAT
AAAATCGGTC AGATGCTAAT CGCTCACAGC CCGGCAAAAT TCCGAAGTAC TGACGACAGT
TACTACAAGA AACTTTCCCT TCTGGTAAGC CAGGGTAAAG TCGGCGGGAT CATGTTTCTC
AAAGGCAATA CCAACGATGC CGCTGTTCTT GCCAACAGGT TTCAGTTTAT TGCTCCAAGA
CCGCTGCTTA TCAGTGCGGA TATGGAAAAA GGACTTGCCA TGAGACTTGA CGGCGCCACA
GAGTTTGCTC CAAGCATGGC CCTTTCGGCA ACAGGCAGAC CGGATCTTGT CTTCAAAATG
GCTGGCGTGA TCGCTCAGGA AGCCAAAGCA CTGGGCATCT ACCACAGTTA CGGGCCCAGT
GTCGATCTGA ACACTAACCC GCTCAATCCG GTGATCAATA CCAGGTCATA CGGTGATAAC
ATCCCCTTGA CCATAGAGAT GTCGAATGCG TTTATTGACG GACTGCAATC GAACGGTATC
ATCGCCACAG CAAAGCATTT TCCCGGACAC GGGGACGTCA CGGTCGACAG TCATATCAAT
CTTCCTGTTC TCAACGCGGA TAAAAAACGT CTGGAACGCG TTGAACTGAA ACCATTCATA
GCAGCTATAG ACCACGGAAT AATGAGCATC ATGATCGGCC ACCTCGCGAT CCCGGCTTAT
ACAGGCAGCA TGACACCGGC GACGCTCTCA TGGAGAATTG TCACAAAACT CCTGAGAAAG
GAACTGGGTT TCGATGGCCT CATCATTACC GACGCGCTGA ACATGAAGGC GCTCTATCAG
TCCTACACTC TTGAAGATAT TTCTTTACGT GCCGTTGAAG CAGGCAACGA CCTGCTTCTT
TTCTCACCTG ACCCGGAACG TACCCACACC ACCCTGCTCA ACGCTGTGAG AAGAGGCAAA
CTCTCGGAAA AACAGATCAA CAAGTCCGTA CGCCGGATAC TTCTGGCCAA AAGGTGGCTT
GGCCTTGATA AAAATCGCCT GGTCAACCTG AACAGTATCC ACGGCCAGAT GAACCTGAAA
AGCCATCGGG AACTTGCAGA GAATATCGCC GACAACGCTA TAACCGTCAT AAGAGACAAG
CATCAGGCCC TTCCTGTCAG GCAAGAAAAC AAAAACAACA TCCTGCATAT CGTTCTCGAA
AACAAGCGCT ACTCACTGTC GGGAGAATCT TTTTCAGACA AGCTGTACAG GGCATTCCAG
GCTAAAACCA TACGTCTGGA CCATAACTCC AGCGCCCGTG ACTATCTCGA CGCTGCTGAT
AAAGCCAAAC GCGCGTCGAC CATTATCGTC TCAACCTATG TTGAAGTGCT TTCCGGCACA
AAGTCACTGG CTGTAAGTAA AGGGCAGGAG GAATTCATCA GCACACTTGT TCGCGATCTG
CCGTCAAAGC GTTCATGTAT TATGATTTCA TTCGGAACGC CCTACCTGAT CAACCAGTTT
CCCGACATAC CTGCTTTCAT CTGTACCTAC TCATCTTCTG AGCTCAGTGA AGATTCCGCC
GTCAGGCTGC TGCAGGGAAA AATCAAGCCG ACAGGAAAAC TCCCCATATC CCTTACGGAA
AACCGGCGGT AA
 
Protein sequence
MLIRYLMAAV FLLLYTFSSP PSRPALAEAF PAYKNATAQE IFREKDKWVE KQLSEMTLSD 
KIGQMLIAHS PAKFRSTDDS YYKKLSLLVS QGKVGGIMFL KGNTNDAAVL ANRFQFIAPR
PLLISADMEK GLAMRLDGAT EFAPSMALSA TGRPDLVFKM AGVIAQEAKA LGIYHSYGPS
VDLNTNPLNP VINTRSYGDN IPLTIEMSNA FIDGLQSNGI IATAKHFPGH GDVTVDSHIN
LPVLNADKKR LERVELKPFI AAIDHGIMSI MIGHLAIPAY TGSMTPATLS WRIVTKLLRK
ELGFDGLIIT DALNMKALYQ SYTLEDISLR AVEAGNDLLL FSPDPERTHT TLLNAVRRGK
LSEKQINKSV RRILLAKRWL GLDKNRLVNL NSIHGQMNLK SHRELAENIA DNAITVIRDK
HQALPVRQEN KNNILHIVLE NKRYSLSGES FSDKLYRAFQ AKTIRLDHNS SARDYLDAAD
KAKRASTIIV STYVEVLSGT KSLAVSKGQE EFISTLVRDL PSKRSCIMIS FGTPYLINQF
PDIPAFICTY SSSELSEDSA VRLLQGKIKP TGKLPISLTE NRR