Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5003 |
Symbol | |
ID | 5736962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 4800 |
End bp | 9140 |
Gene Length | 4341 bp |
Protein Length | 1446 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641282170 |
Product | hypothetical protein |
Protein accession | YP_001547761 |
Protein GI | 159901515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCTCG ACACCTTGCA GCAGCATCTG CTGTCGGCCT ATGACTCCAC CGCCCGCAGC TTGCGCTTGG TTTCCGCAAC GCTTGGTTCG GCCCCGATCG CGGAACTGTT AGGCCAACCC TATCTCCAAA TCCAGACACT GGAGATTACA GACACGGATG GCCCTCTCAC CGATGGTAGC GCTGTCGTGG TGAGCGGCCT GAGCACCCTG TTTGGTTTCA CCTTGGTCCA GGTCGCAGCC AGCTTCACGC TTGACGACCA AGCAATACCC CAGCTCGCCC TCAAGCTGCT GCTACCAGAC GGCACATCGG CTACGGCTTG GCGCTTCGCT GATAGCTTCC CCCAGTTGGC GGGCACAATC TTCGATGATG TCACCCTTTC TCCTGGAACG ATGCCGTTCC TGCTCTTCGC CTCGGCACCC TGCGACGATT CCGGTACTAG CTTCGCTGCT GGTCTGAGCT TTTATGGCGC ACTCACGCCG GGCGGCCCCA GTTTCGGCTA TGTCACTGCG GTGCTGGGGA CACTGGCCAG CGAGATCGCA ACTGGCCCGA TCATCCCGCT GCCCAACGGT CCAACTATTA ACCTTGCGCT AGACTTGGCA TCGAGCTTCA ACAAATATTT TCCCAGTTTG AGCCAAGAAT TACCGGTGGA ACTGCGGCTG CTGAGCGCCT TTGACCAGAG CATGCCGCCG ACCGCACCGC GCTCTATGGT CGGGCTTGAG CTTAGCACGA CCTTCGATCT TGGTACTGGC TCGATCCGAC TGGCCACCAT GCTGACGGGT ACCCGGATCG GTGTGCTCAA ATTGCGTGCC AGCATCGACA ATTTGCCATT GCCAGCACCT GGCCAACTGG TGGCGCTGCT GGGCGGTGAC GAGTTGGTTG GCTCGTTACC TGAGCGCTAC CAGAACCCCG ACACGGTGCG GCTCAGCGGC TTCGGCTTTG GTATCAGCTT GGCTACGCTC CAGCCGGTCA ATCTCTGGCT CCAGCTCGCC GCCCTCGAAG GGGAGGGCTG GGAGATCATC CCGGGCGACC TGACGCTGAA ACGGGTGCGA GCATGGTTTA CCGTCAATAA CCCGCTCGAA AGTACGCGCA GCGCCCAAAC CGCCATCTTC GCCGAGCTCG ACGTACCCAG TGCTAATCCA CTCTTTTCGA TGGAGGTGCA TGCGTATGCG CCCAACTACC GCATTCAGGC CGCGCTGGTC GAAGGCACGA CAGTTAAGCT AACCGACCTG CTCGCCGTGT ATCTGCCGCA GGTCACAGAC GCGCCGGAGT TTCTGCTGGA GGAGCTTGGG CTGGCGGTTG AGTTCGCTAG TCCAAAGAAC CGTCTGACCT TCGAGACGAC GATCGAGCAA GACACCCCGT GGATGCTGCC GTTGGGCGGA CTGGAGCCAC TCCAAGTTCA GTTCATCACT ATTGCCTTGG ATAACTTTAG CAACGGCGAC AGCATGGGCG GCCTGATCGA CGGCCAGCTT ACTATCTTGG GAGCCCAGAC CAGTTGCTTC TACCAGCTTC CTGGCTCGTT TCGCATCAGC GCCCACATCC CAGCCTTTGA CGTTAACCTG AAACGCATCG CCAGCGAATT GGCTGGCAGC GATTGGGTGC CGCCTAGCTG GCTGCCAGAT TTTACCCTGC CGCAGACGTA CCTCGCGGTC GAGCGCGATC GCGATGGCGA GCAGTCGATC TTTACGCTGC TGCTCCAGGC TGAGCCAGCG GGACTCGGCA CACTGGGCTT ACAGGTGCTA CGCGAGCGGG GCAGCTGGGG CTTCGCCGCT GGCGTAGACC TGATCGCCGA CCGCGTTTCA GATGTGCCTG GCTTGGCAGC GTTCAAGCCG TTCGACGATC TGTTCCAGTT CAGCAACCTA CTCTTGGTTA TCTCAAGCAT CGCGAGCCCA GCCTTTACCT TCCCCGACAT GAGCCACCTC GGGTCCAGCG GCAGTGGGGG TCGTCAGATC GTGCTGCCCA GCCAAGCCGG CGGGCTGCGG TCTGGCCTCA CGCTCTATGC AGATATCACA CTGAGCAACT CTCCGACACT GACGATGCTC CAGACCTTTC TTAAGATCTC GGGCGACATC GGTATCACCA TGCTCTTGGG TGAGAATCCG ATGCAGAATG CACGGCTGTT CGCCAGCGTT GACGTTCGCG TGCTCACCAC CCTGCGCATC GTGGCTGAGT TCGGTGCCAC ACTCCAAGGC AGTGAAACTG GGCTATTTCT CCAAGGCCAG GCACTGGTCG TGCTCGCTGA TCAGCCGCTG GAGTTTGACA TGGCGATGCT CCTCGTCGAT GACGGTGCAC TGATCGCCGG GAACCTCAAG GGCGGCCCGG TGAACTATCA CAGCCTACAA ATCAGCAATC TCGCGCTGGA GCTGGGCGTG GATTTCGAGG GCGTACCCAG CGTTGGCTTT GCGGCGACGA TCGACACGCC GACCTTCGAA AGTTCGCTGG CGATCTTCTT CGACTCAACC GACCCGGCCA ATAGCATGCT CGCCGGCGCG GTCAGCGACC TCACGCTGCG CGACGCGGTG GAGGTGCTTG TTGGTGGAGT GCTACCAGTG TCGCTTGGCG AGGCACTAGC CCAAGTCGGA CTCACGGGAA CAGGGCAATT TCTCCTGTCG GGCGACTTGT CCACAGCCTT GGACAGCTAC GACATGACGG CGGTCGCGGC GGCCCTCCAA GCGGTCGGCG TGACTTTTCC CCAGCAGGGT CAATCCGCGC TGTTGGTCGT GAACACGCCC GGCCATGTCT GGCACCTGAC CGATCTGACG ACCATGCGTC ACTTTGAGTT TGAGCGCCAG GGCAACCAGA TCGTGGGTGC GTTGGAGGCA CAACTCTACT GGGTACCAGG TTCACGTGGT GCCTACATTG GTCAGACCTA CTTTCCGAGC GGCTTTCTGC TGAGTGGTAC GATCGTCTTC TTTGGCCTAG AAGTCAGCCT GTCGTTCACA ATTAGCCGAG GGCAGGGCAT CAGTGCTGAG GCACAGCTAA GCCCGATCGC GATTGGCGGA TCGCTGCTCA CAATCACCAG TCTAGGCGGC AAGGCTGGCC CCCATCTCTC ACTTTCGACG GCGGCTCCAG AGCACTTTTT CCTGAGTGGC GACATTCAGT TGCTCGGTAT CATTGGTGTC GGCGTACAGA TCAGCATTGC GCAAGCCGGT CTGCTCTTCG ACCTGCGCGG TGAGCTGCTC CCGGGGATGA CCTTTGACCT GCACGCGAGT TTCACCAGCC TGCACAGTTT CAGTGCAAGC GGATCGGTGG TGCTGGGGGT CAGTCGCATG CTCGACTTCG GGCCGCTGGG GTCACTCAGC CTCAATGTCG GTATCAACGG ACAGCTAACG ATCGCCCACC AAGATGGTGT GAGCAGTGCT ACCTTCCAAG GTGGATTCCA GTTCGGCACG AACGCGTACA GGGTTGGCCC GATTACACTC GACATCAGCC GTGCGGCCCT CGCCGAACTG GGTGAGACCG TGGCACACCA TCTCGAGGCA GTTTTCCATC ACGTGATGCA CGACCTGCGA GGATGGTTCG AGTTGGTCAA ACAGAACCTA ATCCAAGGCA TCAGCGGGAT GGGCCAGATT ATCCACGTGT TGCGCCAGCA CTTCGGCCAA GACAAGCACG CCGTCGCGAC GCTGCTTAAG GAGTTGCTCG TCCAAGGTGT TGAGCCAATC GCCGATGCTC TGCGCAGCAC CTTCGAACTG GATAGCCGTG GGCTGGCACG CTTGCTGCAT GACACCGGCT ACGCGGTCGA GGATGTGACG CGGGCGCTGC GTACGGAATT CGACAAAAGC CACCGTGAGG CAGCGGCGAT CCTCAAGGAG GTTGGTTACT CGGCGGATGC GGTGGCACGG GCGCTCCTCC AGAAGTTCGA GAACGATCGA ACGCGGGCGA TCCAAGTGCT GCGCGAGGTA GGCTACGATG CCCGCGAGGT GACGGGTGTC ACCGTGGGGG TCTTCCAGCA GTCGGCGGCG GACACGGTGG CACTGCTCAA AGCGGCCGGC TACGAAGTCG AAGAGGCCGC CCGTGGCTTA CATCAGGCCT TGGCCTTGGA TAGCCGGGCT GTGGTGACGC TGCTCGGACA GTCAGGCTAT GACGTGAAGG CGGTCGGACG CGCGTTACAG CATACCTTCG ACAAAAATGC CAACGAGGCA TCGCGGATTC TCAAGGAACT GAATTACCCG ATCCACGATC TGGCCAAGGT TTTGCGCCAC GTCTATGATA AAGTAGCCAA GGAGGCTGGC AACGTGCTCA AGGAACTCGG TTATGCCAGT AAGGATGTCT CGGATTCGCT CCAAAAGGTC TTTGACTTAG GAGAAAAAGA AGTCGAGAAG CTCATCAAGG ACATTTTCTA G
|
Protein sequence | MTLDTLQQHL LSAYDSTARS LRLVSATLGS APIAELLGQP YLQIQTLEIT DTDGPLTDGS AVVVSGLSTL FGFTLVQVAA SFTLDDQAIP QLALKLLLPD GTSATAWRFA DSFPQLAGTI FDDVTLSPGT MPFLLFASAP CDDSGTSFAA GLSFYGALTP GGPSFGYVTA VLGTLASEIA TGPIIPLPNG PTINLALDLA SSFNKYFPSL SQELPVELRL LSAFDQSMPP TAPRSMVGLE LSTTFDLGTG SIRLATMLTG TRIGVLKLRA SIDNLPLPAP GQLVALLGGD ELVGSLPERY QNPDTVRLSG FGFGISLATL QPVNLWLQLA ALEGEGWEII PGDLTLKRVR AWFTVNNPLE STRSAQTAIF AELDVPSANP LFSMEVHAYA PNYRIQAALV EGTTVKLTDL LAVYLPQVTD APEFLLEELG LAVEFASPKN RLTFETTIEQ DTPWMLPLGG LEPLQVQFIT IALDNFSNGD SMGGLIDGQL TILGAQTSCF YQLPGSFRIS AHIPAFDVNL KRIASELAGS DWVPPSWLPD FTLPQTYLAV ERDRDGEQSI FTLLLQAEPA GLGTLGLQVL RERGSWGFAA GVDLIADRVS DVPGLAAFKP FDDLFQFSNL LLVISSIASP AFTFPDMSHL GSSGSGGRQI VLPSQAGGLR SGLTLYADIT LSNSPTLTML QTFLKISGDI GITMLLGENP MQNARLFASV DVRVLTTLRI VAEFGATLQG SETGLFLQGQ ALVVLADQPL EFDMAMLLVD DGALIAGNLK GGPVNYHSLQ ISNLALELGV DFEGVPSVGF AATIDTPTFE SSLAIFFDST DPANSMLAGA VSDLTLRDAV EVLVGGVLPV SLGEALAQVG LTGTGQFLLS GDLSTALDSY DMTAVAAALQ AVGVTFPQQG QSALLVVNTP GHVWHLTDLT TMRHFEFERQ GNQIVGALEA QLYWVPGSRG AYIGQTYFPS GFLLSGTIVF FGLEVSLSFT ISRGQGISAE AQLSPIAIGG SLLTITSLGG KAGPHLSLST AAPEHFFLSG DIQLLGIIGV GVQISIAQAG LLFDLRGELL PGMTFDLHAS FTSLHSFSAS GSVVLGVSRM LDFGPLGSLS LNVGINGQLT IAHQDGVSSA TFQGGFQFGT NAYRVGPITL DISRAALAEL GETVAHHLEA VFHHVMHDLR GWFELVKQNL IQGISGMGQI IHVLRQHFGQ DKHAVATLLK ELLVQGVEPI ADALRSTFEL DSRGLARLLH DTGYAVEDVT RALRTEFDKS HREAAAILKE VGYSADAVAR ALLQKFENDR TRAIQVLREV GYDAREVTGV TVGVFQQSAA DTVALLKAAG YEVEEAARGL HQALALDSRA VVTLLGQSGY DVKAVGRALQ HTFDKNANEA SRILKELNYP IHDLAKVLRH VYDKVAKEAG NVLKELGYAS KDVSDSLQKV FDLGEKEVEK LIKDIF
|
| |