Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4549 |
Symbol | |
ID | 5736394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5821936 |
End bp | 5824524 |
Gene Length | 2589 bp |
Protein Length | 862 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281711 |
Product | CHRD domain-containing protein |
Protein accession | YP_001547308 |
Protein GI | 159901061 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGAATC GGTTTCGAAC ACCTCGCATG TTCTCGTTCG TAACATTGCT CTCGCTATTT AGCCTTGTCG TTGGTCTAAA CTTAGCCCAA GCAGCCCCGC GCACCACTCA AGGCGGTGGT GGCTTTACCC CAGGCAATTT GGTGGTAGTA CGGGTTGGTA ATGGCAGTGG CACGCTTTCA AATGCTGCAA CTCCAGTTTT TCTCGATGAG TACACCCCCA ACGGCGATTT TGTGCAAGCT GTTGCTATGC CAACTGCCTT GAATGGCAAT AATCGCCGCC TTACCATGTC GGGTTCGGCA ACCTCGGAGG GTGCACTCAG CCTTTCAGGC GATGGTCAGT GGTTATTGCT AGCAGGCTAC GATGCTGATG TTGGTACGAC GAGTGTTGCA TCAAGCAATA GTGCCACGGT CAATCGGGTG GTTGCCAAAG TCAGCCTCAG TAGCCTAATT GATACAACCA ACGTGATTAC TGATGCCTAT AGCGCTAACA ATATTCGTGG CGCAACAGGT AACGGCGACA ACCTTTGGGC AACCGGAACT GGTAATCCGG GCGGAGTACG TTTTCTGAAT GGGATGGGCA CAACCACAAC CTTGATCACC CCAACCAACA CCCGCGTTGT ACATACCTAT GGTGGTAATC TCTACTTCTC GTCATCATCA GCTACATCGC GTGGTATTTT CCAAATTGGC AATGGCCTGC CAACCAGCGC AGGCCAAACG ATTACGCCAA TCGCTAGCGC AACCTCGCCC TATGGCTTTG TCTTTTTAGA TCGTGAGCCA AGTGTGGCAG GCGTTGACAC GTTGTATGTG GTTGATGATA CATCAACAAA CTTGCGTAAA TTCTCCTTCG ATGGCACAAC CTGGACATTG CAAGGTGTTT GGGCTGAGCC AAGCACTGGG TTTCATATTG CAGCCCAAGA TAATGGAGCC AACGGCGTTG ATTTATACCT GACTCGCGGA ACCAATAGCC TGAGCAAATT GACCGATACC GCAGCTTATA ACCAACCAAT TAATGCTGCG GCCCTAAATC CCATCGTTCC GGTTGGGGCC AACACAGCGT TTCGTGGAAT TATCGTTGTG CCCAGCCCAG TTCAAGCTAC GCCAACTCCC ACTGAAACTG CGACAGTAAC GCCGAGCGAA ACCCCATCGC CAACCGCCGT GCCAAGCTGT TTGCGCTTTA TGGTTAGTTT GGAAGGCTCG CAAGAAGTAC CACCAAGTGG CAGCAATGCA ACTGGCGGTG GCACAGTCGA TGTTGATACC GTCAACAATA TTTTGAGCTA TAACCTCAGC TACCAAGGCT TGAGCGGCAC CGAAACGGCT GCGCATATTC ACGGTTTTGC TCCACGCGGA GCCAATGCTG GCGTTTTAGT TGGCTTGAGC ACTGGTTCGC CCAAAGTTGG CATCTTCAAC TACAGCGAAG CCCAAGAAGC CAATATTTTG GCCGGCCAAG CCTACGTCAA TATTCATACC GATAGTTTCC CTGGTGGCGA AATTCGCGGT CAAATCGATG GTGCAACGAT TAATTGTCCA CCGCCAACCG CTACGCCAAC CGAAACGACT ACTCCAACTG CCACTGAAAC CCCATCTGCT ACGCCAATTC CAAGCTGTTT GCGCTTTATG GTTAGTTTGG AAGGTTCGCA GGAAGTGCCG CCAAGCGGCA GCACTGCAAC TGGCGGCGGC ACAGTCGATG TTGATACCGT CAACAATATT TTGAGCTATA ACCTCAGCTA CCAGGGCTTG AGCGGCACCG AAACGGCTGC GCATATTCAC GGTTTTGCCC CACGTGGAGC CAATGCTGGA GTCTTAATTG GCTTGAGCAC TGGTTCGCCC AAAGTTGGCA CCTTCAACTA CGCTGAGAAT CAAGAAGCCA ATATTTTGGC CGGCCAAACC TACGTTAATA TTCATACCGA TAGTTTCCCT GGTGGCGAAA TTCGCGGCCA AATCGATGGT GCGACGATTA ATTGTCCACC GCCAACCGCA ACCCCGACCG AAACTGTTAC GCCAAGCGCT ACGGCAACTG CGACCGCAAC CGCAACAGGC ACGGCCACAC CAAGCCTAAC CCCAATCGTT ACACCACCAC CAAGCTGTAT CACCATGACG GTAAGTGGCA GTGGTTCGCA AGAAGTACCA CCGAATAACA GCAATGGCAT GGTGATGGGC ATGATTGAAA TTAATACGGT TGCCAACACG ATCAACTACA ATCTCAGCTA CCACGATTTG AGCAGCGCCG AAACTGCCGC GCATATTCAT GGTTTTGCGC CGCGTGGCTC GAATGCTGGC GTATTATTCA ACTTGCCGCT TGGTGCTAGC AAAGTTGGCT CGGTCAGCTA CGCCGAAAAT CAAGAAGCCA ATATTTTGGC CGGCCAGACC TACATTAATA TTCATAGCAG CAATTTCCCG GGTGGCGAGC TTCGCGCCCA GCTTGATGGT GCAACCGCCT TATGTGCCAC CCCAACACCT ACGGCGACAG GGACTGCTAC GAACACGCCA ACCAACACAG CGACGGCGAC AGTGACCAAT ACCCCAACTA ATACCGCCAC TGCTGTGCCG CCAACCTTTA GAATCTACCT GCCATTGACC ATGAAATAA
|
Protein sequence | MLNRFRTPRM FSFVTLLSLF SLVVGLNLAQ AAPRTTQGGG GFTPGNLVVV RVGNGSGTLS NAATPVFLDE YTPNGDFVQA VAMPTALNGN NRRLTMSGSA TSEGALSLSG DGQWLLLAGY DADVGTTSVA SSNSATVNRV VAKVSLSSLI DTTNVITDAY SANNIRGATG NGDNLWATGT GNPGGVRFLN GMGTTTTLIT PTNTRVVHTY GGNLYFSSSS ATSRGIFQIG NGLPTSAGQT ITPIASATSP YGFVFLDREP SVAGVDTLYV VDDTSTNLRK FSFDGTTWTL QGVWAEPSTG FHIAAQDNGA NGVDLYLTRG TNSLSKLTDT AAYNQPINAA ALNPIVPVGA NTAFRGIIVV PSPVQATPTP TETATVTPSE TPSPTAVPSC LRFMVSLEGS QEVPPSGSNA TGGGTVDVDT VNNILSYNLS YQGLSGTETA AHIHGFAPRG ANAGVLVGLS TGSPKVGIFN YSEAQEANIL AGQAYVNIHT DSFPGGEIRG QIDGATINCP PPTATPTETT TPTATETPSA TPIPSCLRFM VSLEGSQEVP PSGSTATGGG TVDVDTVNNI LSYNLSYQGL SGTETAAHIH GFAPRGANAG VLIGLSTGSP KVGTFNYAEN QEANILAGQT YVNIHTDSFP GGEIRGQIDG ATINCPPPTA TPTETVTPSA TATATATATG TATPSLTPIV TPPPSCITMT VSGSGSQEVP PNNSNGMVMG MIEINTVANT INYNLSYHDL SSAETAAHIH GFAPRGSNAG VLFNLPLGAS KVGSVSYAEN QEANILAGQT YINIHSSNFP GGELRAQLDG ATALCATPTP TATGTATNTP TNTATATVTN TPTNTATAVP PTFRIYLPLT MK
|
| |