Gene Haur_2313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2313 
Symbol 
ID5734215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2953399 
End bp2955843 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content51% 
IMG OID641279454 
Producthypothetical protein 
Protein accessionYP_001545081 
Protein GI159898834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.862901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGTA TTTCAAGTTT GATCTTGCTG ATGATTCTGG TCGGCTGTGG GGTGGCTGCT 
CCAGCGCCCA CGAACCCTGC CGTAACCAGT GTGAGCGGCG AGGCCACCAA TTTGCCGCCC
GATTCAACCA AAACCAGTTT GCCAGTGATC ATGACCGGTG ATCCGAATGC TCACCCACGT
TTATGGCTAA CTCAAGCCGA TTTGCCGCGA TTGCGAGCGT GGGCCAGTGA TTCCAACCCA
ATGTATCGTG ATGGCTTGAA GTTGGTGGCT GAAGAGGCCA AAATTACCAT GGATGCTGGC
GATGTGCCAA TCCGTGATTG TGGCAGCACC GAATACGAAG AGTTTCCAAC CGAAATGTAT
GCTGAACTCT TCGCTTTTAT GTCGTTGATT GATCCCAATG AGGCGGCTCG CGCCGATTAT
GCGCAACGAG CGCGAACCCT GTTGATGTAC ATCATTAATA TTGCGGCGCA AGGCCCAGCT
GAAAATGATG ATTATCTGTG CCCTGAAACT CAATCGACAG GCTACCCGCC ATTTCGTAGC
CCACGCTTTT TTACCGAGGA TTCAAACCGC GCCCGTTGGC ATGGCGAGGC CTTTCCCTTG
GTCGTCGATT GGATTTACCC AGTCTTGAGT GCTAGCGATA AAGCGGCGAT TCGCGGGGTC
TTTTTACGCT GGTCAGATGA AATTGTCCAA CGTGCCTATC ATCATCCCGA GCCAGTTGGC
GTGATCAACG ATCCAAGTTT GATTGCCGAT ACAGCTCAAG TACGTTGGTC TGGCAATAAC
TATTTTGTGG CCCATATGCG CAATTTGGGC ATGATGGCAC TAGCATTTGA TGCCAATGAC
GACCCAAATA ATCAACTGCG CGATTATTTG AACAACGCCA CCGGTGCTTG GCTGTATATT
TTTGATCATC TGACCCGTAC CGATAGCAAG GGTGGCCTAT TGCCGGAAGG CTTTGAGTAT
AGCCCACAAA CCGCCAGCTA TGCGATTCAA TTTCTGTTGG CTTTGCAAAC CGCAGGCAAA
GATACCTGTG GCCCGCACTG CAAACTGACC GAAAATCCAT TTTGGGATGA TTTTGTTACC
AGCTACTTAC ACTCGTTGAG TTCCAACCCA ACCGAAGATC CTAATCATGG TTTGGTCTAT
CAGCCAGCCT GGTATGGCGA TGCCCAGAGC TATCACTTGG TCGATTTTAT CAATGCCTTT
GGTTCATTGG GGATTTACGA TCAACGGACT GGCAATGCTG CCCGTTTGGC CAAAGTGCGC
TGGATCGAAA CCGTCACCCC GCCAGGTGGA GCTGAGGGCT TGATCGAACG AGTTGGCAAC
CCCAATGATT TTCGCGATGC CATTATCTAT TTCATGCTGT TTGATCCGAC GGCTGCTGTG
GCAAGCGACC CACGGCCAAG CCTACCGCTG GATTTCTACG CTGAAGGCAT GCACAAAATT
TTCTCGCGTA CCAGTTGGGC CGATGATACA GGTTGGTTCA ATTTCAGCCT AAGTTGGAAT
TTTATCGACC ATCAACAGGC CGATGGCAAT CATTTTGAGT GGTTTCGCAA CGGCGAATGG
CTAACCAAAG CTCGCACTGG CTACGCTGAT ATCGCCGAGG GCATCGCCAG CTCCGAGTTT
CGCAACCTGA TCGCACTCGA AAATGATCAG CCTGATCGTG ATCCATCCGA TTGGCGGATC
GATCTATGGC AACGTGGCTC ACAATGGAAT TTAGTGCCAA CTGGTGATCC AAACTTAGTT
GCACATAGCC TTGATAGCCG TTTTACCTAT GCTTTGGGCG ATGCAACCAA CCTCTACAAT
AGCGAAAACG AGGCAACGAC CGATATTAGC CACGCTAGTC GTTCGATTGT CTGGCTCAAA
CCCGACACGA TTGTGACCTA TGATCGTGGC ACATCGCAAA CCGCCAACCG CTTCAAACGC
TGGTGGCTAC AATTGCCCAC ACCAGCAACC GTTAACGGCA ACCGCGCTAC GATGACAACT
GCTGGTGGAC AGACGTTGAA TGTAACCAGT TTATTGCCAG CGGGTGCGAC ATTAAGCGCC
GTCAACATTG CCGACCAACA ATCGGAGAAT ACTGCGGCCA GCGACGACCC CATGAAAGTG
CGATTGCGAA TTGATGCTCC TGGCAACCCA CAGGATGTAC GCTTTTTGCA GGTATTGCAA
GCTGCTAACA GTGGAGTCAC GCCAGCCAAC GTCAGCCTCA TTCAGGCTAG TTCTGGCAAT
TATGCTGGGG CACAAATTGG CTCGCAGGTG GTGCTCTTCC CAATTAATCT TGATCAGGCT
TTTGCCACAA TCGAATATAG CACCGCGAAC GCTGCCAACC TGCAATTAAT CACAGGATTG
CAACCAAACA CGGGCTACAC CGTCCAGCGC AATGGCAATA ATGTCAGTAT CAGCCAAGGT
GGCTCGCAAA TGAGCGATAG CGGTGGAGTT TTGGTAATAG AGTAA
 
Protein sequence
MRRISSLILL MILVGCGVAA PAPTNPAVTS VSGEATNLPP DSTKTSLPVI MTGDPNAHPR 
LWLTQADLPR LRAWASDSNP MYRDGLKLVA EEAKITMDAG DVPIRDCGST EYEEFPTEMY
AELFAFMSLI DPNEAARADY AQRARTLLMY IINIAAQGPA ENDDYLCPET QSTGYPPFRS
PRFFTEDSNR ARWHGEAFPL VVDWIYPVLS ASDKAAIRGV FLRWSDEIVQ RAYHHPEPVG
VINDPSLIAD TAQVRWSGNN YFVAHMRNLG MMALAFDAND DPNNQLRDYL NNATGAWLYI
FDHLTRTDSK GGLLPEGFEY SPQTASYAIQ FLLALQTAGK DTCGPHCKLT ENPFWDDFVT
SYLHSLSSNP TEDPNHGLVY QPAWYGDAQS YHLVDFINAF GSLGIYDQRT GNAARLAKVR
WIETVTPPGG AEGLIERVGN PNDFRDAIIY FMLFDPTAAV ASDPRPSLPL DFYAEGMHKI
FSRTSWADDT GWFNFSLSWN FIDHQQADGN HFEWFRNGEW LTKARTGYAD IAEGIASSEF
RNLIALENDQ PDRDPSDWRI DLWQRGSQWN LVPTGDPNLV AHSLDSRFTY ALGDATNLYN
SENEATTDIS HASRSIVWLK PDTIVTYDRG TSQTANRFKR WWLQLPTPAT VNGNRATMTT
AGGQTLNVTS LLPAGATLSA VNIADQQSEN TAASDDPMKV RLRIDAPGNP QDVRFLQVLQ
AANSGVTPAN VSLIQASSGN YAGAQIGSQV VLFPINLDQA FATIEYSTAN AANLQLITGL
QPNTGYTVQR NGNNVSISQG GSQMSDSGGV LVIE