Gene Haur_5274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5274 
Symbol 
ID5737232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp59182 
End bp60741 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content45% 
IMG OID641282438 
Producthypothetical protein 
Protein accessionYP_001548029 
Protein GI159901784 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTAC ACCATTGCAC TATTGGACAC TATCGTGTTC TTCGCAATCT TCAGATTAAA 
TTCAATAACA CGGATAACGA ACGTCTAGGG ATTGATTTTC TAGTAGGCCA GAATGGCAGC
GGCAAATCAA CTCTGCTTCA AGCTATTACT TTTATAATCC AGCAATTAGA AAAGGGAAGC
GATGTGCCCT TTCGCTTCTA TCTAGAGTAC GAACTTGGCT CAGCGAATGC CAAGCAGAGT
ATCCGCATTT ATAACTATGA ATTGGATGAA GATTCTAACC AAAAAAAGCT GGACAGGGCA
AGAGCAGACA CGCGTACAGT AAATACTGAA TGGCAGGAGC GTCATGGCTC ATTGGAAGAA
ATCCTACCAC GAGTAATAAT CCTTACAACA GGCAGAGAGC AGGAATGGCA AACACTTTTG
AAACCATCTC AGCAGAGCGT TACGCTTGAT CCGCTTGAAG TTCTATCGGG ATCACTAGCA
GATTCCGATA TTCACGCTCA GCATGAGCAA CGACTACTTA GTGAGCGGGT AGGGGCATCA
ATTGTTTTTG AACCAAGTGC TCAATCAGAA TCACAGAAAA TCACGTTTGT CCCAACCGAA
GCATTGCCAC TTGTAACACT ATGCGGGGTT TTAGCAGAAC TTTCGCATAA TGGCATACTT
AACTCAATGA AAGATATTTT CGAAGATGTT CGCATTCGGC AAGTATGTGC GTTCTCTCTC
CGCTTTCGAC TCAACCTTGC TGGCTCGAAT GAGCGACGTG AAATTCAGGA GCTTGGGCGA
AAGGCTACGC GTGCGGTGCA TATAGGTTCT GACTGGCTAC TCTATTTCGA CTTAACAGAT
CAGCAACAAT CGAGTATCCA AGATTTGTTG GGCGACCGGG GTGGAGCTTT CGCTTTCTAC
CAACAACTAC GGCGGTTGAG TAGGAATGCA ATTGCAGCAG AGCGGGTGCT TCAAGAGGTT
AATATCTTTG TTAAACGTGG CCCGAAGGAC GAAGATCCAG TTAAAGATCG CGAGATTGCT
CAGAATGCAC CATTGCACCC ACTTGCTTGG TTGAGTGATG GGGAACGTAG CTTCATAGGC
CGGATGAGTC TATTCAGTAT GCTCCGCGAT CAAGACCAAC TTATTCTGCT TGATGAACCT
GAAGTTCATT TTAATGACTA CTGGAAACGC CAGATAGTTG ATCGTTTAGC AACATTGCTC
GAAGATGGCA AGTGTCATGC CCTGATTACA TCTCACTCCA GCATCACGCT GACTGATGTT
CCACGTGAGG ATATTATTGT GTTGCATCGT GGAGAGCAAT ATACTCAGAG TGCTGGGAGT
CCTACGCTTA AAACACTGGC TGCTGATCCG AGTGATATTA TTATTCATGT TTTCGATTCG
CCTTATGCGA CAGGCCAATA TGCTGTTAAG AAAGTTAAAG CGATATTGGA TGAAGTTGGC
CAGCGTAATA ACAAGCAGGC ACAGCAAAAG CTTAAAGACT TGCTTAACGA AGTTGGGCCT
GGGTATTGGA GCTATCGAAT CCGGCGGGTA CTGTATAGGA TGAGTAACGA TGCTTCATAA
 
Protein sequence
MRLHHCTIGH YRVLRNLQIK FNNTDNERLG IDFLVGQNGS GKSTLLQAIT FIIQQLEKGS 
DVPFRFYLEY ELGSANAKQS IRIYNYELDE DSNQKKLDRA RADTRTVNTE WQERHGSLEE
ILPRVIILTT GREQEWQTLL KPSQQSVTLD PLEVLSGSLA DSDIHAQHEQ RLLSERVGAS
IVFEPSAQSE SQKITFVPTE ALPLVTLCGV LAELSHNGIL NSMKDIFEDV RIRQVCAFSL
RFRLNLAGSN ERREIQELGR KATRAVHIGS DWLLYFDLTD QQQSSIQDLL GDRGGAFAFY
QQLRRLSRNA IAAERVLQEV NIFVKRGPKD EDPVKDREIA QNAPLHPLAW LSDGERSFIG
RMSLFSMLRD QDQLILLDEP EVHFNDYWKR QIVDRLATLL EDGKCHALIT SHSSITLTDV
PREDIIVLHR GEQYTQSAGS PTLKTLAADP SDIIIHVFDS PYATGQYAVK KVKAILDEVG
QRNNKQAQQK LKDLLNEVGP GYWSYRIRRV LYRMSNDAS