Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5217 |
Symbol | |
ID | 5737175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 313062 |
End bp | 317639 |
Gene Length | 4578 bp |
Protein Length | 1525 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641282381 |
Product | hypothetical protein |
Protein accession | YP_001547972 |
Protein GI | 159901726 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAAC TTGGTGGTCC AACAACGCAA TCTGGCGTTT TCTATCAAAA TTCCATCTCA GCACTCTATC TTGGACGACT CTGTGATGGT GTGGTTCGTC CTCAAAACGA ATTGGTTGTT GAAGTTCGGG TTGAAGCCCC TGACTCGGTT GACGATACCA TTATTACATT TGAAGATCAG CATCGACTCT ATATTCAGGC TAAAGAGCAT ATACAGAAGG GGACGAAACC CTGGAACAAA CTCTGGAAGG ATTTTTCTAA ACAATTTCAG AAGTCCCAAT TTCAGCCCTC ACGAGACCGA CTGGTATTGT ATACGGGTGA AACGCGGGAG TTAATTCACA ACCTACGTGA GATATGCGAA CGATCATATG AAAGTCAAGA TTATAATGAA TGGTGGGGAA GACTATCACA GGCACAACAA GGTTTATTGA AGGACATCAT CCCTCTTGTC TCTGCTGTTG ATGATGATGA TGATAAAAAA TTTATACTTT CTCTTTTCCG TCATATCGAT GTCGAGGTTT TTACCTTAAG GCAAATAGAA CGAGATCTTG TTGCTTTATG GATGCCTGCA TCCAATGAAA GCTATCAAAC AGTCTTCCGT TTACTACGAG ATCATGTCGG TGAAAAGGGT CGCCAACGTG GAATATTTAC ATTAAAAATC CTAAAATTAA TACTATTAGA TCATGAAATT ATAATAAATA CACCATCAAT TGAAATTACA CGCACGTCTA TTCAACGATG TAATGCGTTA TTGATGCAGT GTAGGAATAC TGTTGCTTCA ACAGGCGTGT TTATAAAGCG TGTCGTAGTA GATAAGATTG TTGATTGGAT TAAGAATACA CCAGATAGCG ATAACGTTGC TGTCCTTCTT GATCAGGCTG GCATGGGCAA AACCGTGGTT ATGCGCTCGG TACTATCCAC ACTTCAACAT CAGGATATTA TTACTCTTGC GATTAAGGCA GACCAACACC TATCTGGTAT CGTTGAATAC CCTGAACTCC AAACCATGCT CGCATTCTCA GAGCCTGTAG AAGCAATGAT TAGGCGATTA GCCATAACTG GCAAGGTTGT TGTTCTCATT GACCAAATTG ATGCACTATC ACTTTCGCTT GCTCATGATC AGCGATCGCT AGATGTTGTT CTTCAGCTCA TTGCTTATCT TCGAAATATC CCCAATGTAC GCATTGTTTT ATCGTGTCGA ACATTTGATT ACCATAACGA TCCGCGATTT CGACAAACCC ATATAGCACA TCATTTCGTT CTTGAAGAGC TTGAAAGCAC CGATGTAGCT GTTGTTCTTA AGAAGCTTGA TATCGAGTTT GCCTTTCTTG CGGAGGTGGC TAAAAAGCTT TTAAAAACGC CGCTTCATCT CGATCTTTTT TGTTTGGCTA TGGAACAGCA GGCAACGCCA CCTAATTACC AAAGGATGCT TCCGTCGCTT ACTAATCTGC AAGATCTGTA TGGACTCCTA TGGCAGAATG TCATTTTAAG AGCAGGTATG ACTGTTCCAT CTGCAACGGA TCGAGAGGAC GTTTTGCGGT TGATGACAGA ACAGATGCAT AAACACCAAC GAGTAACTGT AGCACAGTCC TTTTTTATGA GTAGAGATAC AATGCATCTT GAGAAAGCAG TTAATTGGCT AGGAAGTGCT GGCATTCTTG TTCAAGGTAA TGCAAGCTGG AGTTTCTTAC ACCAAACCTT TTTCGATTAT TGTTATGCAA AACAATTTGT GGAAAGTAAT AATAGTCTTA CTCAAATGAT CATATCTTCT GATCAAGGTC TATTTGTCCG CCCACAATTG ATCCAAATTC TTACATTTCT TCGAGGAAGT AATCATCGTC TATATATTCA AGAAATCCAT GCGCTTTTTC ACCTTCAATC GCTTCGCTTT CACATCAAAG ATCTTCTCTT TCGTTGGTTT GGTGCTTGTA TAGTCCCAAC GAATGATGAG TGGTCTTTAG CACAACGAAT GCTCATTGAT CCTCTACATC GCGGTCGTTT TTTAATGGCT ATTTATGGAA ATCTTGGATG GTTTGAACGT ATGCGAGGTA TAAAGATACA AGCTCTGCTG AAGTATGATG ATCAAATTCT TGATAGTGAA GTTATTCCGT ATCTGGATTC ACTTACCGAA GTTGCTCAGA CGGATATAAT GCAAATCCTT CAACCATATA TTGGTCGTAG CAACGAGTGG AATAATCGGT TAGGTTGGGT TCTATCACAT ATTCGCCATT GGCAAACACT ACATGCTGCT GAGGCTCTTG AAAACCTTAT TCGAATAACA GATATTGAGT CATTCAAGAA TTCTTATGAT TTAGATGATG TAACTAAAGC ATACCCCCGT GTAGGATGTC GAATTCTTCG AATTATCTTA GACAAGGTTC TTGATTCTTA CCTTATTGAA TGCGAAAAAC ATGCAAAGAA ACAAAATTAT GATACGAAGT GGTTTATATC ATTGCCAAAC CTTTCTCGTA GCTTAGAGGC TTTGAATTCA GGCATGCTTA TGGAGTCATT GCAAAATGCG AGCAAGCTAG AGCCATCCAT ATTTCTGGAA GAACTACTGC CTTGGATTGA AAAAGTTTGT GAAATATCAA ACCAAAGCGA CAATACTGAC AATAAGTATC CCGATGATAT TTTGTCACAT GATTGGTACG ATACAACATA TCCTGTTTAT CATACTCTAA TTCATAGTAT TATTACATCT TTGACAATAC TAGCAAAAAC CGAATCTAAT ATTTTCCATG TATATGCAGA TCGTCTTGCC TCAAAACCAT TTAGGACAAC CCAACAATTA TTAATTTATG TCTATAGAGA TGTTGCAGAT CAGTATACTG AAGATGCATT CAAATTTCTT CTAGGTGATA ATAGACGATT GATGCTGGGT GATGCAGATA TTTATGATAG TCGTCAACTT ATTAATACGA TATACCCATT TCTGTCGCAT CAACAACGTA ATGATCTTGA GCAATATATT TTATTATTTA ATCCCCATTG GAATTGGAAA GTATATGGAT TAGACGCATT AAGATGGTGG CGACAAGAAC AGATGTTCCT CTTACAATCT ATTCCTGTTC AGTATCTCTC TGATAAAGGT TTGGCTCGAC TACAAGAGTT AGAGCGGAAG TTTCCAAACT ATCGGGTTTC ATCTGATCCA CAGAGGGGAG GAGGGTTTAC CATTCCCTCT CCTATTCCTA AAACAAAAGC TCAAGCACTT TCCAACAAAG CATGGCTACG TATTATGAAG GAGGACCAAA CAAATAATTG GTACAAACGC TCGAGCCATA GGGGTGGAAT TGGAGAGCTT GCACATGTTT TAACCGAAAT GATTAAACAA GAACCAGATC GGTTTTATCA ACTAGCATCG CAAGCTTTGG ACTTCATCCA TCCAATCTTT GCTCAGGCAT TTATTAGAGG ATTTAGTGAG CTACATACTC ACCCGGATAG GTTTTTCAAG CTCGTGCGAC TCTTTGGAAA TAAACTTAAT CTTGAAGGTA AACGAACGGT TACGTGGCTT TTGGAAGAGC GTATCAATGA AGGTATTCCA GATGATGTAA TATCAATGTT GAAGTCATGG ATATATACAT CTGCTAATTC TGATGAGCAG AATTGGGAAA ATAATGATGA TTTATATAGC GGATCTATCA ACACTGTTCG GGGTTGTGCA CTGCGTGTTG TTATGCAGGC TCTCAGAAAA CAAGAAGATC AAATTACATT ACAACATCAA TGGCAAGTTA TAGATGTTAT TGCAGCCGAT CCGTCACGGG TTTTGAGAGC AGGAATTATT TATGAATTAA TGTTTCTTTT TAACAATGAC AGTGATCGGG CTATCCATAT TTTCGAACAA TCAATACAGT TATATCCTAT TCTACTCCTT TCGCATCCAA CGCAGGAATT TATCTATTAC GGTATGTTTT ACGATTGTTC AAAGATGTTA CCCTACATTA ATGCACTCTT GAACGTAAAT AGCGAAGAAG CCCAACGACG TGGTGCTGAG CTTATATCGA TTGCAGCCAT CTCATCAAAA GTGTTAACGC CAGAAGCTTC TATAGAAGTG CAAGAGATTG CGAGTCGGGT ATTAACTGGA CGTGCCCTAT GGCGTCGTGG AGCAGCGCGT GTCTATGCAA ATAATCTCAT TTATGAGTCG TTTGCTATCT GCCTTGATGC GTTATCTATG TTGCTTAATG ATGATGATGA TGATGTATGT TCGATAATAG GAGATATTTT TGAGAATTTA CGTGAGGATC ATTTCATTCT ATTACACACT TTTATTAAAA ATTATGCTAC CTCTAAATCA GTTACAACAA ATACACGCGC GTTTTATAAC TATCTGAGTG AATATGGGCT TCTTGATCCT GATTGGACCT TATCAGTAAT TATGCTAACG TTACACAATC CCTATGAAAA ACCGAGGAGT CAATGGGCAA CACCACGTGA AGAAGCGATC AGAATCGTAA CCCGCATTTA TACTCATCCA CTTACAGATG TTGAATTGCG TAAGCAAGCT ATGGATGTAT TTGATGCATT AATGGAGAGA TATACCGGTC AGGCACTCAA AGTATTGTCG GAGTGGGATC AGCGGTAG
|
Protein sequence | MPELGGPTTQ SGVFYQNSIS ALYLGRLCDG VVRPQNELVV EVRVEAPDSV DDTIITFEDQ HRLYIQAKEH IQKGTKPWNK LWKDFSKQFQ KSQFQPSRDR LVLYTGETRE LIHNLREICE RSYESQDYNE WWGRLSQAQQ GLLKDIIPLV SAVDDDDDKK FILSLFRHID VEVFTLRQIE RDLVALWMPA SNESYQTVFR LLRDHVGEKG RQRGIFTLKI LKLILLDHEI IINTPSIEIT RTSIQRCNAL LMQCRNTVAS TGVFIKRVVV DKIVDWIKNT PDSDNVAVLL DQAGMGKTVV MRSVLSTLQH QDIITLAIKA DQHLSGIVEY PELQTMLAFS EPVEAMIRRL AITGKVVVLI DQIDALSLSL AHDQRSLDVV LQLIAYLRNI PNVRIVLSCR TFDYHNDPRF RQTHIAHHFV LEELESTDVA VVLKKLDIEF AFLAEVAKKL LKTPLHLDLF CLAMEQQATP PNYQRMLPSL TNLQDLYGLL WQNVILRAGM TVPSATDRED VLRLMTEQMH KHQRVTVAQS FFMSRDTMHL EKAVNWLGSA GILVQGNASW SFLHQTFFDY CYAKQFVESN NSLTQMIISS DQGLFVRPQL IQILTFLRGS NHRLYIQEIH ALFHLQSLRF HIKDLLFRWF GACIVPTNDE WSLAQRMLID PLHRGRFLMA IYGNLGWFER MRGIKIQALL KYDDQILDSE VIPYLDSLTE VAQTDIMQIL QPYIGRSNEW NNRLGWVLSH IRHWQTLHAA EALENLIRIT DIESFKNSYD LDDVTKAYPR VGCRILRIIL DKVLDSYLIE CEKHAKKQNY DTKWFISLPN LSRSLEALNS GMLMESLQNA SKLEPSIFLE ELLPWIEKVC EISNQSDNTD NKYPDDILSH DWYDTTYPVY HTLIHSIITS LTILAKTESN IFHVYADRLA SKPFRTTQQL LIYVYRDVAD QYTEDAFKFL LGDNRRLMLG DADIYDSRQL INTIYPFLSH QQRNDLEQYI LLFNPHWNWK VYGLDALRWW RQEQMFLLQS IPVQYLSDKG LARLQELERK FPNYRVSSDP QRGGGFTIPS PIPKTKAQAL SNKAWLRIMK EDQTNNWYKR SSHRGGIGEL AHVLTEMIKQ EPDRFYQLAS QALDFIHPIF AQAFIRGFSE LHTHPDRFFK LVRLFGNKLN LEGKRTVTWL LEERINEGIP DDVISMLKSW IYTSANSDEQ NWENNDDLYS GSINTVRGCA LRVVMQALRK QEDQITLQHQ WQVIDVIAAD PSRVLRAGII YELMFLFNND SDRAIHIFEQ SIQLYPILLL SHPTQEFIYY GMFYDCSKML PYINALLNVN SEEAQRRGAE LISIAAISSK VLTPEASIEV QEIASRVLTG RALWRRGAAR VYANNLIYES FAICLDALSM LLNDDDDDVC SIIGDIFENL REDHFILLHT FIKNYATSKS VTTNTRAFYN YLSEYGLLDP DWTLSVIMLT LHNPYEKPRS QWATPREEAI RIVTRIYTHP LTDVELRKQA MDVFDALMER YTGQALKVLS EWDQR
|
| |