Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5210 |
Symbol | |
ID | 5737168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 301391 |
End bp | 305734 |
Gene Length | 4344 bp |
Protein Length | 1447 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641282374 |
Product | hypothetical protein |
Protein accession | YP_001547965 |
Protein GI | 159901719 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.124073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAGC AGCCGACCGC CGCCTATCAT CGTGAGAGTG AACGCTTGCT CCACCTGCGC CATACTGGGT TTGTCGGGCG GGTCGCTGAA CTCGCCGATC TTCAGTCACT GATCGACGAT CTCCGCCCGA CGGGTGGCTA TGTCGTCGTG ACGGGCGACG CAGGGGTCGG CAAAAGCAGC CTGCTCGCGC ATGCCATTAT CCAGGCAGGG ATTGACCAAA CGCCCTATCA TTTTATTGCC CTCACCCCTG GTCGTGCCTA TCAGGTGGAC TTGCTGCGCC ATGTGATTGG CCAATTGGTG CGCAAGTACC CGGAAGCTGA GGCCGCGTTT TCCGCCGAGA CCTATCCCGC GCTACGCCTC ACCTTCCTGC GCATCGTGCA TGATCTTGCG GCGCAGGGTG TGCAGGAAAC GATCTATCTT GATGGGCTTG ATCAGTTGCA GCCCGACCTT GATGGACTGC GGGACATCAC CTTCTTGCCA CTCCAGCTCC CGCCCGGCAT CGTGATCGTG CTGGGGTCGC GCCCGGATGA GGCGCTTGAT CGCTTGGGGC TTGACCACGG GGTCGTCTAC GCAGTTCCCC CCTTGCGTGA GGCCGATGTG CTCCAGCGGT GGCAGCAGGT GCAGCCGATG CTGACCCCGG CCAGCGTTCA GCCGCTGGCC CGTTCGGTCG CAGGGAATGC CTTGCTGGTC GCGCTTGCCG CCACGCTCCT CGGCCACGCA GCGCCGACCG TTGTCCCGCC ATGGCTGGCC GAGGCAAGTC GCGACCCAAG CACCCTCTTT CGGGTGAGCC TCGACCGGAT CAAGCACCAC GACCCAATCC CGTGGCAGCG CATCATTCGC CCACTGCTCG CCGTCTTGGT CGTGACCCAA GAACCGTTGC TCCCATCGAG TATCGCAGCC CTGATTGAAC AACCCGTGGA GCAGGTGCAG GCCGTGCTGG GTCTGGTCAG TGAATGGGTG AGTGTCGCTG CTGATCAGCG GCTAGCGTTA CGCCATTTAA TGTTCTATGC CTATTTAGCA CAGCACGAAT TCGCTGCCGC CGACGTGCGG GTATGGCATC AGCGGTTGGC GACGTGGTGT GGGATAGGTC TGGCGACGAT CTGGGAGGAG AGCGGCGATC CAGCGGAACA GGCCCGACGC TGGTATGCGC GACATCACTA TGTGACCCAT CTAGCCTTGG CGGAACACTG GGCAACGCTG TGGCAGGTGA TTGATGCAGG CACGTATGGT GCGCACAAGG TGCGGTTTGA TCCCACGCGG CGACTGTATG GATTGGATAT GGATCGTGCC CGTGAGAGTG TGATCGCCGC GGGTGTCGGT GCAGCGATTG ACGAGCAGCT GGCCGAATTG CCCCGACTGT GGCGGTATAG TCTGCTGCGC ACGAGCCTGA CCGCCGATGC GGATGGATGG CTCGATGACA TGTTTGTGAT TGTCGCTGCC AGTGGCCGAC TGTCGGAAGC CGAGGCCCAG ATTGCACTCT GTTCTGACCC GAAACGCCAG GTGCAGTTGT GGTCGCGTAT CCTACCATAT GCGGAGCTGG AGCGACGGGT GGCTCTCATC CAACAGATGG AACTGGTCGC ACGGAATCTG CGCGATGCCG ATGACCGTGA TCGTGCGCTT GGGACGGTCG GGATGGCCTA TATCGACCAT GGGATGTTTG CGGTGGGGTA TCCGCTGATC CGTGCGCTCG CACATAAGCG TGATTCACGG TTGTATGCGG CCGTCGTGGA TCGGGTTGAT CATGGCGATA GTGCGCACGC CCAAGAGGTG ATGGCAGCGA TTCAAACACC AACATACCAA GTCCAAAGTG CCCTGCTCAT CGCCAAGGCC TTGCTGACGG GTGGAGCCTA CGAGGAGGCC CATGCGCTTC TGATGCAGAT TCGTCCCATG GCTCGGGATG GCGATCTGAA CACACTCCTG TGCCTGCTTG CTGATATCCA GTGGGCTTGG GGCAATCTCT CCCAGTCGCA GCTCCTGCTT GCTGAGGCCG AAGGATTGAT TCCGTCGATT GCTCATTGGG AGCATCCTGC TCGGTTTCGT ATGATCCTGG ATGGGTATTT AAGGCATGGG GATCGGGTCA ACGCCCTGCG GTGCACGCAG CAAGGTACTG ATGATAGCGT GCGTTATGAA ATCGTGAAAC TCTATCTTAC GCATGCCGAT GCGGTAACGG CTTCCCAAGT CGCTCCCATG ATAGCCCATG ATGGGTATCG TGACGGAGCC TATAAGGCCC TCATTCTCTG GTATTGTCAA CAGCAGGATC TCACGACGGC CCACGCGCTG CTGGCCTTGA TCGTTGAAGA TCAGCAACAC ATTGAAGGTG CCTATGTCGT GGCAAAGGCC TATGCCGAAC GCCAGCAATT TGACGCAATG GATGCATGCT TAACCATGGC ACTCGACGAT GATTTGGGAG AACTTAACTT TTTTGATTTT GATTGTGTCT TCGATATCGT CGAATTGTAC GCACGTTTTG GGTTTCATGC GCGTGCATGC GCGATTTTAG CGCGAGTCTT TCCCCTGATT CGTGAGCTTC CTCGCGATTG GCCCGAAGAG TTGCCGGAAT GGATTCGGTT TGCGGAAATC ACCAACCGCT ATGCCTATGC AGCCTTCTAT GATGACCTTG TCCGGGCCGT CTTTATCCCA AAAGAACCCC ATTCCTATGC GACGGCGGTT GGCACCATGG CCCAAGGCTA TGCCATGCAT GGTGATTTTA CGCGTGCTGA GCAGCTCGCT TCTGCACTTA CGTCCCCCGA TATGGCGATA ACGACGCTGC ATGCCTTGGC GAGGATCGCC CAGGCGGCTC AGAGACCACC CCTTGCTGCC CACTATTTAA GCGCGGCGCA TGCCCGTTTG AACACGATCG CCGATGCGAC TGCGCAGATT GCATGGTGTG GGCGGCTTGC AACGACGGCA TGGACAATGG GGTTGACGGA CGCGGCACGC ATGCTGATCG ATGACGGTCG GCAGCGATGG GCACAGCTCC CCATGCACCA GCAAGGGGTG AGTTCGCAGA GACTCGTTGA AGGGTATCAG GGGCAGGAGA ACCTTGGTGA AGCCCTCGCG ATTACCCAAT TAATGAAGGA CTCGCCGTAC TATCAGCAGT TGATCGAGAC GTTGATCGCT GGGTGTATAA AGGCCCAGGA TCTCGATCAG GCGTATGCCA TCCTGAAACA ATCCCATATC GCTGTCCCAC AGTACGTCGC CGCACTCTGT ACGATTGCGA TCACGGCCCG TGAACAGGGA AACATGGCGC TTGCGGCGCA TACATGCGCC GAAGCACTGC TTGCCTGTGA CCGCTATCCT GTTCGGCGGG AGCGATTGAA TTTCGTAAAA GACTTGGCGA TAAGCCAGTT AACCCATGGG AGCGATGCCT GCTTACCGCA ACTCTTGGCG ATGCTGCGAT CCAATGACCC ATGCTGGGAT CTTCCCGCCG ATGTTGCGAT GCGGTGTGCG ATCGCTGCAG TCTATGCGGA TCAGGGTGAT CGTGCGTCGT TTGCGGAGTG GCTGGGTGCT GCGTATGACG GTACACACCA CGTGTTGCAA TCAGAAGATG CTAGTCGCTG GGTGAGCATT GCGTATGAAA CGCTTGCCAA AACGTATCTG GCGTATGCTG ATGAAGCCGC CATCCAGGCG TTTCTTGCGG ATTTCGCGGC GGTCGTTCCA CGCTGTGTGG AGCGGGGTGA TGCGAGTGTT GCGATGAAGG TCTTAAGTCA AACCTATGCC CACTATGCCA TCCAATACCA GCCGTCATGG CGTGAAAAAG CCTATCAGGT GGCTCGCGCC ATTCCAGACC TGTGGGAGGG TGAGAATACG CTTGCAGTGG TGGCAACAGC CTATGTTCGC GTCCATGATC ATGCGGGGGT GAACATGATC ATCCATGAAA TGCGACAACG GAAATTCACG CGCCATGAGG CCATGATTCA ACAACACCAT AATCTGTGTA CGATTGCCTT GGCCTATGCC GAACAGGGCG ATCATGCCCA CGCCGCTGCC CTGATTGCGC CACTTGCCCC ATCGTCTCAT CGCGATCGCG TGCTGAAGGT GCTGATTGAG GACCATCGTC AGGCTGACCA GTGGGATGCG GTCTATCAAC TTGTGGATAC CTATTATGAT CGCACCGAAC GGGTAGCGGT TGTTCAACAG ATTATCACCG CCTATCGTGA ACGACAGCGT ATCCGTGAAA GTATCGCGCT TGTCCACACC ACGTGGCGCA GATGTAACCA TGCCGATGAA TTATGGAGTA TGAGCGCAAG CATCCTGCCA TTCCTGCTGA ACGATCCATC GCTTGGCCTA GCCTTGCTTG ATGCGGTGCC ATGGGTTGAG CAGCAACTGC GACGGTATGT CTAA
|
Protein sequence | MSQQPTAAYH RESERLLHLR HTGFVGRVAE LADLQSLIDD LRPTGGYVVV TGDAGVGKSS LLAHAIIQAG IDQTPYHFIA LTPGRAYQVD LLRHVIGQLV RKYPEAEAAF SAETYPALRL TFLRIVHDLA AQGVQETIYL DGLDQLQPDL DGLRDITFLP LQLPPGIVIV LGSRPDEALD RLGLDHGVVY AVPPLREADV LQRWQQVQPM LTPASVQPLA RSVAGNALLV ALAATLLGHA APTVVPPWLA EASRDPSTLF RVSLDRIKHH DPIPWQRIIR PLLAVLVVTQ EPLLPSSIAA LIEQPVEQVQ AVLGLVSEWV SVAADQRLAL RHLMFYAYLA QHEFAAADVR VWHQRLATWC GIGLATIWEE SGDPAEQARR WYARHHYVTH LALAEHWATL WQVIDAGTYG AHKVRFDPTR RLYGLDMDRA RESVIAAGVG AAIDEQLAEL PRLWRYSLLR TSLTADADGW LDDMFVIVAA SGRLSEAEAQ IALCSDPKRQ VQLWSRILPY AELERRVALI QQMELVARNL RDADDRDRAL GTVGMAYIDH GMFAVGYPLI RALAHKRDSR LYAAVVDRVD HGDSAHAQEV MAAIQTPTYQ VQSALLIAKA LLTGGAYEEA HALLMQIRPM ARDGDLNTLL CLLADIQWAW GNLSQSQLLL AEAEGLIPSI AHWEHPARFR MILDGYLRHG DRVNALRCTQ QGTDDSVRYE IVKLYLTHAD AVTASQVAPM IAHDGYRDGA YKALILWYCQ QQDLTTAHAL LALIVEDQQH IEGAYVVAKA YAERQQFDAM DACLTMALDD DLGELNFFDF DCVFDIVELY ARFGFHARAC AILARVFPLI RELPRDWPEE LPEWIRFAEI TNRYAYAAFY DDLVRAVFIP KEPHSYATAV GTMAQGYAMH GDFTRAEQLA SALTSPDMAI TTLHALARIA QAAQRPPLAA HYLSAAHARL NTIADATAQI AWCGRLATTA WTMGLTDAAR MLIDDGRQRW AQLPMHQQGV SSQRLVEGYQ GQENLGEALA ITQLMKDSPY YQQLIETLIA GCIKAQDLDQ AYAILKQSHI AVPQYVAALC TIAITAREQG NMALAAHTCA EALLACDRYP VRRERLNFVK DLAISQLTHG SDACLPQLLA MLRSNDPCWD LPADVAMRCA IAAVYADQGD RASFAEWLGA AYDGTHHVLQ SEDASRWVSI AYETLAKTYL AYADEAAIQA FLADFAAVVP RCVERGDASV AMKVLSQTYA HYAIQYQPSW REKAYQVARA IPDLWEGENT LAVVATAYVR VHDHAGVNMI IHEMRQRKFT RHEAMIQQHH NLCTIALAYA EQGDHAHAAA LIAPLAPSSH RDRVLKVLIE DHRQADQWDA VYQLVDTYYD RTERVAVVQQ IITAYRERQR IRESIALVHT TWRRCNHADE LWSMSASILP FLLNDPSLGL ALLDAVPWVE QQLRRYV
|
| |