Gene Haur_5217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5217 
Symbol 
ID5737175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp313062 
End bp317639 
Gene Length4578 bp 
Protein Length1525 aa 
Translation table11 
GC content39% 
IMG OID641282381 
Producthypothetical protein 
Protein accessionYP_001547972 
Protein GI159901726 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGAAC TTGGTGGTCC AACAACGCAA TCTGGCGTTT TCTATCAAAA TTCCATCTCA 
GCACTCTATC TTGGACGACT CTGTGATGGT GTGGTTCGTC CTCAAAACGA ATTGGTTGTT
GAAGTTCGGG TTGAAGCCCC TGACTCGGTT GACGATACCA TTATTACATT TGAAGATCAG
CATCGACTCT ATATTCAGGC TAAAGAGCAT ATACAGAAGG GGACGAAACC CTGGAACAAA
CTCTGGAAGG ATTTTTCTAA ACAATTTCAG AAGTCCCAAT TTCAGCCCTC ACGAGACCGA
CTGGTATTGT ATACGGGTGA AACGCGGGAG TTAATTCACA ACCTACGTGA GATATGCGAA
CGATCATATG AAAGTCAAGA TTATAATGAA TGGTGGGGAA GACTATCACA GGCACAACAA
GGTTTATTGA AGGACATCAT CCCTCTTGTC TCTGCTGTTG ATGATGATGA TGATAAAAAA
TTTATACTTT CTCTTTTCCG TCATATCGAT GTCGAGGTTT TTACCTTAAG GCAAATAGAA
CGAGATCTTG TTGCTTTATG GATGCCTGCA TCCAATGAAA GCTATCAAAC AGTCTTCCGT
TTACTACGAG ATCATGTCGG TGAAAAGGGT CGCCAACGTG GAATATTTAC ATTAAAAATC
CTAAAATTAA TACTATTAGA TCATGAAATT ATAATAAATA CACCATCAAT TGAAATTACA
CGCACGTCTA TTCAACGATG TAATGCGTTA TTGATGCAGT GTAGGAATAC TGTTGCTTCA
ACAGGCGTGT TTATAAAGCG TGTCGTAGTA GATAAGATTG TTGATTGGAT TAAGAATACA
CCAGATAGCG ATAACGTTGC TGTCCTTCTT GATCAGGCTG GCATGGGCAA AACCGTGGTT
ATGCGCTCGG TACTATCCAC ACTTCAACAT CAGGATATTA TTACTCTTGC GATTAAGGCA
GACCAACACC TATCTGGTAT CGTTGAATAC CCTGAACTCC AAACCATGCT CGCATTCTCA
GAGCCTGTAG AAGCAATGAT TAGGCGATTA GCCATAACTG GCAAGGTTGT TGTTCTCATT
GACCAAATTG ATGCACTATC ACTTTCGCTT GCTCATGATC AGCGATCGCT AGATGTTGTT
CTTCAGCTCA TTGCTTATCT TCGAAATATC CCCAATGTAC GCATTGTTTT ATCGTGTCGA
ACATTTGATT ACCATAACGA TCCGCGATTT CGACAAACCC ATATAGCACA TCATTTCGTT
CTTGAAGAGC TTGAAAGCAC CGATGTAGCT GTTGTTCTTA AGAAGCTTGA TATCGAGTTT
GCCTTTCTTG CGGAGGTGGC TAAAAAGCTT TTAAAAACGC CGCTTCATCT CGATCTTTTT
TGTTTGGCTA TGGAACAGCA GGCAACGCCA CCTAATTACC AAAGGATGCT TCCGTCGCTT
ACTAATCTGC AAGATCTGTA TGGACTCCTA TGGCAGAATG TCATTTTAAG AGCAGGTATG
ACTGTTCCAT CTGCAACGGA TCGAGAGGAC GTTTTGCGGT TGATGACAGA ACAGATGCAT
AAACACCAAC GAGTAACTGT AGCACAGTCC TTTTTTATGA GTAGAGATAC AATGCATCTT
GAGAAAGCAG TTAATTGGCT AGGAAGTGCT GGCATTCTTG TTCAAGGTAA TGCAAGCTGG
AGTTTCTTAC ACCAAACCTT TTTCGATTAT TGTTATGCAA AACAATTTGT GGAAAGTAAT
AATAGTCTTA CTCAAATGAT CATATCTTCT GATCAAGGTC TATTTGTCCG CCCACAATTG
ATCCAAATTC TTACATTTCT TCGAGGAAGT AATCATCGTC TATATATTCA AGAAATCCAT
GCGCTTTTTC ACCTTCAATC GCTTCGCTTT CACATCAAAG ATCTTCTCTT TCGTTGGTTT
GGTGCTTGTA TAGTCCCAAC GAATGATGAG TGGTCTTTAG CACAACGAAT GCTCATTGAT
CCTCTACATC GCGGTCGTTT TTTAATGGCT ATTTATGGAA ATCTTGGATG GTTTGAACGT
ATGCGAGGTA TAAAGATACA AGCTCTGCTG AAGTATGATG ATCAAATTCT TGATAGTGAA
GTTATTCCGT ATCTGGATTC ACTTACCGAA GTTGCTCAGA CGGATATAAT GCAAATCCTT
CAACCATATA TTGGTCGTAG CAACGAGTGG AATAATCGGT TAGGTTGGGT TCTATCACAT
ATTCGCCATT GGCAAACACT ACATGCTGCT GAGGCTCTTG AAAACCTTAT TCGAATAACA
GATATTGAGT CATTCAAGAA TTCTTATGAT TTAGATGATG TAACTAAAGC ATACCCCCGT
GTAGGATGTC GAATTCTTCG AATTATCTTA GACAAGGTTC TTGATTCTTA CCTTATTGAA
TGCGAAAAAC ATGCAAAGAA ACAAAATTAT GATACGAAGT GGTTTATATC ATTGCCAAAC
CTTTCTCGTA GCTTAGAGGC TTTGAATTCA GGCATGCTTA TGGAGTCATT GCAAAATGCG
AGCAAGCTAG AGCCATCCAT ATTTCTGGAA GAACTACTGC CTTGGATTGA AAAAGTTTGT
GAAATATCAA ACCAAAGCGA CAATACTGAC AATAAGTATC CCGATGATAT TTTGTCACAT
GATTGGTACG ATACAACATA TCCTGTTTAT CATACTCTAA TTCATAGTAT TATTACATCT
TTGACAATAC TAGCAAAAAC CGAATCTAAT ATTTTCCATG TATATGCAGA TCGTCTTGCC
TCAAAACCAT TTAGGACAAC CCAACAATTA TTAATTTATG TCTATAGAGA TGTTGCAGAT
CAGTATACTG AAGATGCATT CAAATTTCTT CTAGGTGATA ATAGACGATT GATGCTGGGT
GATGCAGATA TTTATGATAG TCGTCAACTT ATTAATACGA TATACCCATT TCTGTCGCAT
CAACAACGTA ATGATCTTGA GCAATATATT TTATTATTTA ATCCCCATTG GAATTGGAAA
GTATATGGAT TAGACGCATT AAGATGGTGG CGACAAGAAC AGATGTTCCT CTTACAATCT
ATTCCTGTTC AGTATCTCTC TGATAAAGGT TTGGCTCGAC TACAAGAGTT AGAGCGGAAG
TTTCCAAACT ATCGGGTTTC ATCTGATCCA CAGAGGGGAG GAGGGTTTAC CATTCCCTCT
CCTATTCCTA AAACAAAAGC TCAAGCACTT TCCAACAAAG CATGGCTACG TATTATGAAG
GAGGACCAAA CAAATAATTG GTACAAACGC TCGAGCCATA GGGGTGGAAT TGGAGAGCTT
GCACATGTTT TAACCGAAAT GATTAAACAA GAACCAGATC GGTTTTATCA ACTAGCATCG
CAAGCTTTGG ACTTCATCCA TCCAATCTTT GCTCAGGCAT TTATTAGAGG ATTTAGTGAG
CTACATACTC ACCCGGATAG GTTTTTCAAG CTCGTGCGAC TCTTTGGAAA TAAACTTAAT
CTTGAAGGTA AACGAACGGT TACGTGGCTT TTGGAAGAGC GTATCAATGA AGGTATTCCA
GATGATGTAA TATCAATGTT GAAGTCATGG ATATATACAT CTGCTAATTC TGATGAGCAG
AATTGGGAAA ATAATGATGA TTTATATAGC GGATCTATCA ACACTGTTCG GGGTTGTGCA
CTGCGTGTTG TTATGCAGGC TCTCAGAAAA CAAGAAGATC AAATTACATT ACAACATCAA
TGGCAAGTTA TAGATGTTAT TGCAGCCGAT CCGTCACGGG TTTTGAGAGC AGGAATTATT
TATGAATTAA TGTTTCTTTT TAACAATGAC AGTGATCGGG CTATCCATAT TTTCGAACAA
TCAATACAGT TATATCCTAT TCTACTCCTT TCGCATCCAA CGCAGGAATT TATCTATTAC
GGTATGTTTT ACGATTGTTC AAAGATGTTA CCCTACATTA ATGCACTCTT GAACGTAAAT
AGCGAAGAAG CCCAACGACG TGGTGCTGAG CTTATATCGA TTGCAGCCAT CTCATCAAAA
GTGTTAACGC CAGAAGCTTC TATAGAAGTG CAAGAGATTG CGAGTCGGGT ATTAACTGGA
CGTGCCCTAT GGCGTCGTGG AGCAGCGCGT GTCTATGCAA ATAATCTCAT TTATGAGTCG
TTTGCTATCT GCCTTGATGC GTTATCTATG TTGCTTAATG ATGATGATGA TGATGTATGT
TCGATAATAG GAGATATTTT TGAGAATTTA CGTGAGGATC ATTTCATTCT ATTACACACT
TTTATTAAAA ATTATGCTAC CTCTAAATCA GTTACAACAA ATACACGCGC GTTTTATAAC
TATCTGAGTG AATATGGGCT TCTTGATCCT GATTGGACCT TATCAGTAAT TATGCTAACG
TTACACAATC CCTATGAAAA ACCGAGGAGT CAATGGGCAA CACCACGTGA AGAAGCGATC
AGAATCGTAA CCCGCATTTA TACTCATCCA CTTACAGATG TTGAATTGCG TAAGCAAGCT
ATGGATGTAT TTGATGCATT AATGGAGAGA TATACCGGTC AGGCACTCAA AGTATTGTCG
GAGTGGGATC AGCGGTAG
 
Protein sequence
MPELGGPTTQ SGVFYQNSIS ALYLGRLCDG VVRPQNELVV EVRVEAPDSV DDTIITFEDQ 
HRLYIQAKEH IQKGTKPWNK LWKDFSKQFQ KSQFQPSRDR LVLYTGETRE LIHNLREICE
RSYESQDYNE WWGRLSQAQQ GLLKDIIPLV SAVDDDDDKK FILSLFRHID VEVFTLRQIE
RDLVALWMPA SNESYQTVFR LLRDHVGEKG RQRGIFTLKI LKLILLDHEI IINTPSIEIT
RTSIQRCNAL LMQCRNTVAS TGVFIKRVVV DKIVDWIKNT PDSDNVAVLL DQAGMGKTVV
MRSVLSTLQH QDIITLAIKA DQHLSGIVEY PELQTMLAFS EPVEAMIRRL AITGKVVVLI
DQIDALSLSL AHDQRSLDVV LQLIAYLRNI PNVRIVLSCR TFDYHNDPRF RQTHIAHHFV
LEELESTDVA VVLKKLDIEF AFLAEVAKKL LKTPLHLDLF CLAMEQQATP PNYQRMLPSL
TNLQDLYGLL WQNVILRAGM TVPSATDRED VLRLMTEQMH KHQRVTVAQS FFMSRDTMHL
EKAVNWLGSA GILVQGNASW SFLHQTFFDY CYAKQFVESN NSLTQMIISS DQGLFVRPQL
IQILTFLRGS NHRLYIQEIH ALFHLQSLRF HIKDLLFRWF GACIVPTNDE WSLAQRMLID
PLHRGRFLMA IYGNLGWFER MRGIKIQALL KYDDQILDSE VIPYLDSLTE VAQTDIMQIL
QPYIGRSNEW NNRLGWVLSH IRHWQTLHAA EALENLIRIT DIESFKNSYD LDDVTKAYPR
VGCRILRIIL DKVLDSYLIE CEKHAKKQNY DTKWFISLPN LSRSLEALNS GMLMESLQNA
SKLEPSIFLE ELLPWIEKVC EISNQSDNTD NKYPDDILSH DWYDTTYPVY HTLIHSIITS
LTILAKTESN IFHVYADRLA SKPFRTTQQL LIYVYRDVAD QYTEDAFKFL LGDNRRLMLG
DADIYDSRQL INTIYPFLSH QQRNDLEQYI LLFNPHWNWK VYGLDALRWW RQEQMFLLQS
IPVQYLSDKG LARLQELERK FPNYRVSSDP QRGGGFTIPS PIPKTKAQAL SNKAWLRIMK
EDQTNNWYKR SSHRGGIGEL AHVLTEMIKQ EPDRFYQLAS QALDFIHPIF AQAFIRGFSE
LHTHPDRFFK LVRLFGNKLN LEGKRTVTWL LEERINEGIP DDVISMLKSW IYTSANSDEQ
NWENNDDLYS GSINTVRGCA LRVVMQALRK QEDQITLQHQ WQVIDVIAAD PSRVLRAGII
YELMFLFNND SDRAIHIFEQ SIQLYPILLL SHPTQEFIYY GMFYDCSKML PYINALLNVN
SEEAQRRGAE LISIAAISSK VLTPEASIEV QEIASRVLTG RALWRRGAAR VYANNLIYES
FAICLDALSM LLNDDDDDVC SIIGDIFENL REDHFILLHT FIKNYATSKS VTTNTRAFYN
YLSEYGLLDP DWTLSVIMLT LHNPYEKPRS QWATPREEAI RIVTRIYTHP LTDVELRKQA
MDVFDALMER YTGQALKVLS EWDQR