Gene Haur_4562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4562 
Symbol 
ID5736407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5838866 
End bp5840110 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content52% 
IMG OID641281724 
ProductROK family protein 
Protein accessionYP_001547321 
Protein GI159901074 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTC CTTCGGCATT GACAGCTGAT CTGAGTTTGA TGCGAGAACT CAACCGCGCC 
CTGGTACTCC AGTTAATTCG GCGTGAAGGA CGGATTTCGC GGGCAGATAT CGCCAAGCAC
ACCAAGTTAA GTCGTTCGAC CGTTTCAAGC ATCATTAACG ATCTGATTGA CGCAAGCCTT
GTTACGGAAA CGGGTATTGG TACCTCAAAA GGTGGCCGAC GGCCAATTAT CCTCGAATTC
AATTATCAAG CCAATTATAT TATTGGGCTT GATGTTTCGC GCAACGCCGT TAGCGCCGTT
ATTACCGATT TGAACGCCCG AATCTGTTCA CGCCGTCAAA TTTCCTTCAA TGTTAACGAC
GGCCCCACCG TTGGCATGCC GCTGATCAAA CAATTGATTA GCACGATGCT GACCGAATCG
CCGGTTGGTC GTGGCCGCAT TAGTGCAATT GGAGTTGGCG TGCCTGGCCC ATTGGATTTT
CGCAATGGCC GCACAATTGC CCCGCCAGTT ATGCCTGGTT GGGATAACGT GCCAATTCGC
GAAGAGTTAA GCCAAACCTT CCGTTTGCCC GTATCAATTG ATAACGATGC CAACTTGGCC
GCGATGGCCG AGTATCGTTG GGGCGCAGGC CAAGGCGCTC AAAACATGGT CTACTTATAT
ATGAGTAGCG CCGGGATTGG TGCTGGCTTG ATTATTGATA GTCATTTATT CCGTGGCTCG
ATCGGCAGCG CTGGCGAGGT TGGCCATACC ACTCTCAGCG TCGAAAACGA TGAATCATTT
GGGCCAATCA ACGCTGGCTC GCTCGAAGCC TTGGCTTCAC AAATCACGGT TTTACGGCTG
GCCCGCGAGC AAAAGCTGAT TAGTGCCGAT GATGATGTGC ATACCTTGGT GCGTAAAGCC
GAGAGCAGCC CTGAAATTCA AGCTATTTTG CGGCGAACTG CCCACTATCT TGGTGTGGCA
ATTGCCAGTA TCATTAATAT ATTCAATCCT GATCGGGTTG TGATTGGCGG GGTTATTCCG
GAAACCTCAC CATTATTAAT CGAGACGATT CGAGCAACGG TGGCGCGACG CGCTTTATCG
ATTGCAGTGA ATAATACCTC GATTGTACCA GGCGCACTAG GCCGCAATGT CGCAGCTTTA
GGAGCCGCAG CCCTTGCTAC CGAGCGCTTA TTTGCGCCGC CAGCCCTGGA ACGCCCCGCA
ACTTTAGGGG TGCATTCCAA CGAGGTTGGT TCGTTGGCAA GCTAA
 
Protein sequence
MNRPSALTAD LSLMRELNRA LVLQLIRREG RISRADIAKH TKLSRSTVSS IINDLIDASL 
VTETGIGTSK GGRRPIILEF NYQANYIIGL DVSRNAVSAV ITDLNARICS RRQISFNVND
GPTVGMPLIK QLISTMLTES PVGRGRISAI GVGVPGPLDF RNGRTIAPPV MPGWDNVPIR
EELSQTFRLP VSIDNDANLA AMAEYRWGAG QGAQNMVYLY MSSAGIGAGL IIDSHLFRGS
IGSAGEVGHT TLSVENDESF GPINAGSLEA LASQITVLRL AREQKLISAD DDVHTLVRKA
ESSPEIQAIL RRTAHYLGVA IASIINIFNP DRVVIGGVIP ETSPLLIETI RATVARRALS
IAVNNTSIVP GALGRNVAAL GAAALATERL FAPPALERPA TLGVHSNEVG SLAS