Gene Haur_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3022 
Symbol 
ID5734879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3817801 
End bp3818913 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content52% 
IMG OID641280166 
Productglucose-6-P dehydrogenase subunit-like 
Protein accessionYP_001545788 
Protein GI159899541 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3429] Glucose-6-P dehydrogenase subunit 
TIGRFAM ID[TIGR00534] opcA protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGCG AAATTTCCAA TAGCCCACGT CCAGCCAATG TTGCTGAGAT TGAAGATCAA 
CTGCGCGATT TGTGGCGTGA ATTGGGCGAT CAGCATCGCG ATGAACATTA TGTGATGACC
CGCGTTTGCA CCATGACTGT GATTGGCTAT GGTGCAAACC AAACCCTCGC CAAACGAGTA
CGCACGGCCT TGCCGCAAGT GTTCGGCGTG CATCCATGCC GCGCAATTTT GATCGAAACT
GGCAGCGAGG CCGAAGAACT CAGCGCATGG GTCAGCACCG TTTGTCAACC AAGTAGCGAA
GAACACGAGC AAGTTTGTTG CGAACAAATT ACCTTCTGTG TTGGCGAGCA AATGCGGCGA
CGTTTGCCCG GCACAGTGCT ACCTTTGGTC GTTTCCGATT TGCCCTTGTT TGTCTGGTGG
CCTGGGCCGT TGCACCCTGC TAGCAATGTG CGTACCCAGT TGTTTGCTCA TGCTGATCGC
TGGATTATGG ATTCGGCGGA TTTTCTCGAC CCGCTGCCCG ATTTGGCGCG TTTGCACAAG
ATGGTTATCA GCGATCAAAC CGATGCGGTC AGCGATTTGA CTTGGGCACG CTTGACCCCG
TGGCGCACTG CGTTTGCCCA AATTTTCGAT GCAATGGCCA TGCGCCCAGT GCTCGAAAAC
CTTAGCAACA TCAAAGTAAC CACCGGACGA CATCAAGCAG CTGGCTTGCT GAGCATCGGC
TGGATGGCAA CCTGCCTTGA TTGGCAGTTG ATCAGCGCGA GTGGCAATAG CGAAACCTTG
CTTTGCAATT TCAAACATCG CAATGGCACG GTTACGATCA GCATGCAAAC CAGCGATCGA
ACTGGCGAGG AAGTGCCATT TATCGAAGTA CAGGCCGCCA AACATAAAGC CAGTATCACC
GTAGGCCGCA GCAGCAACAA ACAAGCCTTG CTAGCCAATG TGCATCTTGA TGGTGAAACT
CGTTGTCAGA TGAGCGAACT ACCAAGGCCC AGCGATAGCC TGTTGCTCTT GAATGAACTA
AATATGTATA GTCATGATCG AGTTTATGAA CGGGCTTTAG CGATTGTCGC CGCAATTGCT
CAAGCAACCG ATCCCCATGG AGCACATGTA TGA
 
Protein sequence
MMGEISNSPR PANVAEIEDQ LRDLWRELGD QHRDEHYVMT RVCTMTVIGY GANQTLAKRV 
RTALPQVFGV HPCRAILIET GSEAEELSAW VSTVCQPSSE EHEQVCCEQI TFCVGEQMRR
RLPGTVLPLV VSDLPLFVWW PGPLHPASNV RTQLFAHADR WIMDSADFLD PLPDLARLHK
MVISDQTDAV SDLTWARLTP WRTAFAQIFD AMAMRPVLEN LSNIKVTTGR HQAAGLLSIG
WMATCLDWQL ISASGNSETL LCNFKHRNGT VTISMQTSDR TGEEVPFIEV QAAKHKASIT
VGRSSNKQAL LANVHLDGET RCQMSELPRP SDSLLLLNEL NMYSHDRVYE RALAIVAAIA
QATDPHGAHV