Gene Hhal_0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0137 
Symbol 
ID4710670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp156150 
End bp157310 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID639854595 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001001733 
Protein GI121996946 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCGAG TCACGCCGCG CGTCGCTGCC CTCGCCGGTC TCCCGCTGAT CGGCGGGTCC 
GCCCTCGCCG GGGAGGTGAT CGAGGCCAAC CACGACACCG AGTACCACGG CGTGCGCATC
GTGCAGGTGG CCACGGACCT GGAACACCCC TGGGGGCTGG CCTTCCTGCC CGACGGCGGC
ATGCTGGTCA CCGAGCGCCC GGGCCGCATC AACCGGATCG AGGACGGCCA GGTCGAGCGC
CTATCAGGCG GCCCCGAGAA CGTCTTCGCC CGCAACCAGG GCGGGATGCT CGATATCGCC
CTCCATCCGG ACTTCGATGA CAACCGCCAG GTCTACTTCA CGTACGCGCA CGGTGATGCC
GACGAGACCA CCGTAGCGCT GGCGCGGGCA CGCCTCGATG AAGACGCGCC CCGGCTGACC
GACCTCGAGG AGCTCTTGGT GGCCGATGCC GGGGCCAGCC CCGGGCGGCA CTACGGCTCA
CGGATCGATT TCAAACCGGA CGGGACCCTG CTCATGACCG TCGGTGACCG CGGCGATGAC
GAACACGATC CGGACAGCCA CCGCGCCCAG GACAACAGCA ATCACGTCGG GACCACCCTG
CGCCTGAAGG ACGACGGCTC GGTGCCCGCC GACAACCCCT TCGTTGAAGA CGACGAGGTG
CGCGACGAGA TCTACACCTA CGGCCACCGC AACGCCCAGG GTCAGTTCAT TCACCCGGAG
ACCGGTGAGA TCTGGCAGAG TGAGCACGGG CCGCGAGGTG GCGATGAACT CAACCGGGTC
CAGGCCGGGC ATAACTACGG GTGGCCGATC ATCTCCCATG GCCGCGACTA CGCCACCCAG
GAACCGATCG GGACCGGGCG CCATGCCGAG GGCATGGAAT CGCCCATCCG GGACTGGACC
CCGGCCATCG CACCCTCGGG GCTGGATCAC TACAGCGGCG AGGCGTTCCC GCGCTGGGAG
GGGGATTTCC TGGCCGGCGC GCTGGTGCGC CCGGCGGTGC GCCGCGTGGT CGTCGAGGAC
GACACGGTGG TCCACGAAGA GGAGATCCTG CGCGACGCCG TGGGTCGGGT CCGCGCCGTC
CAGGAGGGCC CCGAGGGACG GATCTATCTG CTGACCGACG AATCCGATGG CGGCATCTAC
CGCCTGGAAC CTGCCGACTG A
 
Protein sequence
MIRVTPRVAA LAGLPLIGGS ALAGEVIEAN HDTEYHGVRI VQVATDLEHP WGLAFLPDGG 
MLVTERPGRI NRIEDGQVER LSGGPENVFA RNQGGMLDIA LHPDFDDNRQ VYFTYAHGDA
DETTVALARA RLDEDAPRLT DLEELLVADA GASPGRHYGS RIDFKPDGTL LMTVGDRGDD
EHDPDSHRAQ DNSNHVGTTL RLKDDGSVPA DNPFVEDDEV RDEIYTYGHR NAQGQFIHPE
TGEIWQSEHG PRGGDELNRV QAGHNYGWPI ISHGRDYATQ EPIGTGRHAE GMESPIRDWT
PAIAPSGLDH YSGEAFPRWE GDFLAGALVR PAVRRVVVED DTVVHEEEIL RDAVGRVRAV
QEGPEGRIYL LTDESDGGIY RLEPAD