Gene Hhal_1261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1261 
Symbol 
ID4710726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1368955 
End bp1369995 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content69% 
IMG OID639855734 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001002838 
Protein GI121998051 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGGCC CGCTCATGAT TGGAATCGCG GGGGGTGAAC TCCCAGCGCA GGCGCGAGAG 
CTCCTGCGGC ATCCGGCCGT CGGTGGGGTC GTTCTCTTCA CGCGTAACTT TAAAAATACG
CAGCAGCTTC ATGATCTGAC ACAGTCCATT CACGCCGTGC GTGAGCCGCC CCTGCTGGTC
GCTGTCGATC AGGAGGGCGG CCGCGTGCAG CGCTTTCGGG AGGGCTTCAC CCCGCTGCCC
GCCGCTTCCT GGTTCGGTCG ATTGTACGAT CTCGATCCCG ATCAGGCCCG GCAGAGTGCG
CACCGGGCGG GGTGGGTGAT GGCCGCGGAA CTGCGCGCCT GCGGGGTGGA TCTGAGTTTC
GCGCCGGTTT TGGATCTCGA TGGGGGGGTG AGCCACGTGA TCGGTTCTCG CGCCCTGCAC
CGTGAGCCGG CCGCGGTGGC GCGGCTCGGT CACGCCTGGA TGCGCGGCAT GCGTCAGGCG
GGGATGGCGG CGGTAGGCAA GCACTTCCCG GGGCACGGTT CGGTCGAGGC CGACAGCCAC
GTGGCCTTGC CCTGCGATGA CCGTTCACTG GCAGAGATCG CGCGTCGGGA TCTGGTACCC
TTTCAGCGCT TGGCGGCCTC GGGACTCCCA GGGGTCATGG CTGCGCACCT GGTCGTCCCC
GATGTCGACG ACCGGCCGGC CGGTTTCTCC CCACGCTGGA TCGGTGACAT CCTGCGCCGC
CGGGTCGGCT TCCTCGGTGC GGTCTTCAGT GATGACCTCG GCATGCGCGG GGCGGAGACC
GCCGGTACCA TGTTCGACCG GGTGGATGCG TGCCTGGGGG CGGGCTGCGA TGTGGCGCTG
GTCTGCGATC CGGCCGAGGC CGAGGCGCTG CTCGGCGAGG CGTCCGGGGA TCGCTGGCTG
GATCCCACGT CGGCGCTGCG TCTGGTACGC ATGCACGGCC GACCGGCGCC CGGGTGGCCC
GAGTTACGGG CGGACGGCGC CTACCAGACA GCGGTCTGCG AATTGACCGA GGGCGTTCCC
GGGTTCGGGG CGCCGGGGTG A
 
Protein sequence
MLGPLMIGIA GGELPAQARE LLRHPAVGGV VLFTRNFKNT QQLHDLTQSI HAVREPPLLV 
AVDQEGGRVQ RFREGFTPLP AASWFGRLYD LDPDQARQSA HRAGWVMAAE LRACGVDLSF
APVLDLDGGV SHVIGSRALH REPAAVARLG HAWMRGMRQA GMAAVGKHFP GHGSVEADSH
VALPCDDRSL AEIARRDLVP FQRLAASGLP GVMAAHLVVP DVDDRPAGFS PRWIGDILRR
RVGFLGAVFS DDLGMRGAET AGTMFDRVDA CLGAGCDVAL VCDPAEAEAL LGEASGDRWL
DPTSALRLVR MHGRPAPGWP ELRADGAYQT AVCELTEGVP GFGAPG