Gene Haur_4786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4786 
Symbol 
ID5736630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6102162 
End bp6103262 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID641281951 
Productheat domain-containing protein 
Protein accessionYP_001547545 
Protein GI159901298 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATGG ATGTTGCGAT TGAGTATTTG CTGGGACGAC TAGCCAAAGA CACTCCAACG 
GAATCTTTTT CTGGTCTATT GCTGCAATTG ACTTGGCTTA TGGATGATCA GGGCGCAGAT
ATTCATCGAA TCATGCGCAA CTGGCTTGAC GCTGATCAGA TTGAAAAAGT CAAAGTTGCG
CTCGCGATCG ATGAAGTTTT TTTACTTGAA TCGGATGCAG AATATCAAGC AAGTATCACT
CGGATTGTTA ATCGTTGGCC CGAACTTGAG CCAATGTGTC GCAAATTTTA TCAGCGTTGG
GATGGTAGCA AACGTGGTAT CCCCTTACCG AATCGGCGTA AAGCGGCTCC CGTGCAATCG
CTCACCGACT GGCTACCGAT TCTTGCTATC TATCAGCATC CAGAGATGGA GAAGGCAAAA
GATGCTTTTT GGGCGGCTGG CTCAGCGAGT TTGCCTTATC TCTTGCCTCT GATTGATGAT
GGAACGACAT GGGCAGCCAG TGAGGCTTGT GAATTGGTTG GCATTCTTGG CGATCCAGTT
GCGATTCCAG TCCTGCGCAA CGCATTGGGA CGTGATTCCA ATATTCCTGA GCTTGATATT
GATATTTTGA CTGCGTTGCG TCAATTGCAT GATTCTGGCG GCCCATGGAT TTATCCGATG
CTCAATGATC CTTCGCCAAA ACGCCGCGGA TTTGCGTGTC AATATGTCGG CATAACCCAA
GATTATGAGG CGCTGCCTTT AATTATCAGC TTACTTAATG ATCCTGATCC TCGGGTTCGC
AGTTTTGCGG TGGATACATT AGCTGTGTTC AAAGATGGCA AACCATCAGC AGTAATCGAT
CCGATTTTCT ATAAACTTGT CATGGATCAT CAATCGGGGA TTCGATTGGA AGTACTACGA
TCATTTATGA AATGTTGTGA CCCATTCTAT ATTCCATTTT TTATCACAGC GTTACGCGAT
CCATACTATC TCTGTCGAAG TGTCGCAATA CAATCCTTGG GCAAAGTTGA TCCGATTAAT
TTAGGTTGTT ATCTTGATTC TATGGTAGAC GAGACAGATA GCGATGTTCA AAGGACAATT
AAGGCTTGGC GTGATTTTTG A
 
Protein sequence
MRMDVAIEYL LGRLAKDTPT ESFSGLLLQL TWLMDDQGAD IHRIMRNWLD ADQIEKVKVA 
LAIDEVFLLE SDAEYQASIT RIVNRWPELE PMCRKFYQRW DGSKRGIPLP NRRKAAPVQS
LTDWLPILAI YQHPEMEKAK DAFWAAGSAS LPYLLPLIDD GTTWAASEAC ELVGILGDPV
AIPVLRNALG RDSNIPELDI DILTALRQLH DSGGPWIYPM LNDPSPKRRG FACQYVGITQ
DYEALPLIIS LLNDPDPRVR SFAVDTLAVF KDGKPSAVID PIFYKLVMDH QSGIRLEVLR
SFMKCCDPFY IPFFITALRD PYYLCRSVAI QSLGKVDPIN LGCYLDSMVD ETDSDVQRTI
KAWRDF