Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5107 |
Symbol | |
ID | 5737065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 141871 |
End bp | 144846 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641282272 |
Product | hypothetical protein |
Protein accession | YP_001547863 |
Protein GI | 159901617 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.286492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGGAC GCTTAATCAT GACGGTCTAT GGCCGCACGC TGGCTGGAAT TGGCGTGGTG CTGCTGGTGC TGGCGCGGCT GCTTAGTCCG TGGCTGCTCG GCCATCTCAC CAGCCCACGC CTTGCCCCGT TGGCCACGCT GGTCACACGC ATACATGCGA TCAGTGGGTG GCTGCTGGTG GGCTACGGGC TGTGGGTGCT TGTCACGCTC GTCGCATGGC TGCGGTTGCG GCGACGGCCT GCGGTGACCG GCCAGTGGTA TCTGCTGCGC ATTCCCAAGC CCATGACGAC CGACCCGCAT AGTGCCACGG CGGCCCAGCT TGGCCAACGT GAGGCGGCGA TGGCGGGCGA GATCGTGCGG GTCTTGCAAG CGATTGTCGC GCACGCCCCT GCCGATCGGC GGATTGCCCT CGAACTCTGG TACACCGCGC AGGGGATTGC CTGGAGCCTC TGGCTCTCGC ATCCCGATTT GTATGCGCCC GTGTTCAGTG CGATGCAGGG CTTTGTGCCC GGCGGCGATC TGCTGGCCCA GCCCGATCCG GTTGCCACCG CTGCTCCCGC GCTGGAGTGG ACGCAGCACT CCCTTGCGCA GCCTGCGATC TATCCGTTAC GCCGTGCCGA TGCGATGGTT GGCGACCCGC TCGATGCCCT GATTGGGGGC TTGCGACCGC AACAGGGCGT GACCGCGATT GGCCTATCAC TGACCCTTGG AGCCGTGCCT GATGGCTGGC AGCGCCAAAA CCGTGCGCTG GCGGATTGGC TCGCCCTTGA TGCCAAGGAG CAGCAGGCCA GTGCCAGCAA AGAGACCGCT GTCGCGATGT TGGCCAAGCT GCAAAGCCCA TGCGTGACCA TTGCGCTGCG CACCGTCGTG CTGGCCGAGA GCGCCGCACT CGCCACCGCC CAACACCAGA TCCTGCTCGG CGCACTCGAC ACCTATACCA TGCAGCACGA CTCCATGCGT CAAGCCTGGA CCCATGGCCG TATCCACACC ACGATCACCC CCGCCGTGCG CGACCGCCAT ACCGTGCGCT GGTTGGCCAT GCCGTTGCCG CCGCTCGTAC CAACTCCCAC GCTGCCCGTG CTGAGCCTGC CCGAAGTCAC GGCGCTCTGG CATCTGCCAA CGGGTCGCCA TTTGACCGTG GCAATCTGGC GCAATAATCG CTTTGCCCCG CCTAGCCCGG CCTTGATGGT CAGTCCCATC GGGGAGCCAC CCCCAGCGTT TGCTGCGACC ACTCCCTTAA AGGATCGCAT CGTGCGGATC ACGGGCGGGG TCGCAGCGGA TGGCCAACCC GTCTATTTGG GCTTTCCGCT CAAAAGCTAC ACCTTCCATC AGCAAATTAC CGCCTCAACG GGGGCGGGTA AAAGTCACGC GGCCAAGGTG CAATTGGGTG AATTGCTGCG GATTGGGGCA GGCTTTGGCG TGATCGACTT CAAGGGGGAT TTGGTCAATG ACCTGTTGAC CATGATCCCC GATGATCGGC TCGATGATGT GATCGTCTTT GACCCGCTTG ACCCTGACCA CTGCCTTGGA TTGAACCTGC TCGATCCCAC CTATTTAAAT AAGGATACCG AACCGGATTT TTTGGTCGAA TTGCTCGAAG CCTTGGTGGC GAGTACGGAT AGCCACTGGG GTGATTCGGC GGGGATGCAG GAAGCCCTGC GCTATGGGGC CTTGACCCTG ATGGAAGGCG AGCCAACGCC GACGATTGCC CACTTGTATT TGCTCTTTTC CAGTAGTGCC TACCGCGCGA CCGTCCTAGA GCGGGTGACT GACCCCGACT TGGTGTTGTT CTGGACCTAT CAATTTCCCA AGCTATCGGA CACCCAAAAA TCCTCAATGA CCGCGCTCCA GCGGCGCTTG AGTCAGTTTT TGATCAATCG CACGGTGCGG ATTACCTGCA ACCAGACCCG CACCACCATT CCCTTCCGGC AGGTGATGGA TCAGCGCAAG ATTGTGCTGG CCAAGTTGCC CGTCGAATTG ATTGGGGCCA CCGCTGGCGG GATTCTGGCC AATGTGCTGA TTAACCTCGT GTTGGCGGCG GCGTTTAGTC GCTTGGATAC GCCCGAGGAC GAACGCGAGC CGTGGGTGCT GGTCGTCGAT GAGTTTCAGG AGGCCATGCG GCGCGGCGAT CCGGCCAATT ACCAGAAGAT TCTGGAACGC TTACGCTCGT TTGGGATTGG CCTGGTCTTG ATGCACCAAG GCACGAGCCA ACTGCCTGCG GAGATGTTGG CCACGACGCT GGAGATTGTG CAGACGCGCT TGATCCTCTC GGCCTTTGGC CCCGATGCCG CCGTCTGGCA GCGCCAATAT CCCGATGCGC GGCTGACCAT GCAGGACTGG GCCGGGATTC CCCTCCGCGA CGAAGGCTAT GCGGCGATCT CGATCAATGG CTTTCGCCAA CCAATCTGCA CGATCACCCC GCTGCCGCTC TGGCCTGCCC TGCCGCAAGG GAGAACAACC AGAACCCTGA CCACGCTGGC TTCCCCACCG CGTGATCCCC TCGACGAACG GCTGCGGGCA CTGGCCCAGT TTGATCGTGC GACGCGGCTG CGCTTGCTCC GCGATGCCCC ACCCGCGCTG TGGGAGGCCA TCCTCGCCCG CATGGAGCAG GATCGGGCTG ACCGCCATGC CCAGCTGCTC GCGCCGACCT GTCCCTTGCC GCGTGCCGCC CGCGTGCTGG CCTTGAGCCA TGCCAAAGCC GCAACCCCAC GGGATCTGGC GCTCGCCGCC TTAACCCGCC TGACCCTGAG TATTCCGCGT GATACGGTGG ACGACGAACC GCCTGCCAAG GGCAAACGCG GACGCAGTGG CGCGGCGGAA TCTCCATCGG AAGCGGCTCC CGCAGGGAGT CCCGTGGCAA CGCCAGCAGT GGTGGTTAAT CCGCGCAAGG TCGGCGTGGC CACCGTGATG CAGGCCGAGG CGACGGCCAC CAATCTCATG CACTTGTTAG CGGAGAATCC TGATGCGGGT TTATAA
|
Protein sequence | MHGRLIMTVY GRTLAGIGVV LLVLARLLSP WLLGHLTSPR LAPLATLVTR IHAISGWLLV GYGLWVLVTL VAWLRLRRRP AVTGQWYLLR IPKPMTTDPH SATAAQLGQR EAAMAGEIVR VLQAIVAHAP ADRRIALELW YTAQGIAWSL WLSHPDLYAP VFSAMQGFVP GGDLLAQPDP VATAAPALEW TQHSLAQPAI YPLRRADAMV GDPLDALIGG LRPQQGVTAI GLSLTLGAVP DGWQRQNRAL ADWLALDAKE QQASASKETA VAMLAKLQSP CVTIALRTVV LAESAALATA QHQILLGALD TYTMQHDSMR QAWTHGRIHT TITPAVRDRH TVRWLAMPLP PLVPTPTLPV LSLPEVTALW HLPTGRHLTV AIWRNNRFAP PSPALMVSPI GEPPPAFAAT TPLKDRIVRI TGGVAADGQP VYLGFPLKSY TFHQQITAST GAGKSHAAKV QLGELLRIGA GFGVIDFKGD LVNDLLTMIP DDRLDDVIVF DPLDPDHCLG LNLLDPTYLN KDTEPDFLVE LLEALVASTD SHWGDSAGMQ EALRYGALTL MEGEPTPTIA HLYLLFSSSA YRATVLERVT DPDLVLFWTY QFPKLSDTQK SSMTALQRRL SQFLINRTVR ITCNQTRTTI PFRQVMDQRK IVLAKLPVEL IGATAGGILA NVLINLVLAA AFSRLDTPED EREPWVLVVD EFQEAMRRGD PANYQKILER LRSFGIGLVL MHQGTSQLPA EMLATTLEIV QTRLILSAFG PDAAVWQRQY PDARLTMQDW AGIPLRDEGY AAISINGFRQ PICTITPLPL WPALPQGRTT RTLTTLASPP RDPLDERLRA LAQFDRATRL RLLRDAPPAL WEAILARMEQ DRADRHAQLL APTCPLPRAA RVLALSHAKA ATPRDLALAA LTRLTLSIPR DTVDDEPPAK GKRGRSGAAE SPSEAAPAGS PVATPAVVVN PRKVGVATVM QAEATATNLM HLLAENPDAG L
|
| |