Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4721 |
Symbol | |
ID | 5736565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6030293 |
End bp | 6031981 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281886 |
Product | hypothetical protein |
Protein accession | YP_001547480 |
Protein GI | 159901233 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAC CTGAGTTGGT GTCGTTGCCA AATGTTCCAC GCCGCCGTCG TTTCTTTCGC GTCGCGTGGT TTTTTGTGCG CTTAGTTGCC CATATTATCG TCTGGGATTT GTTGCTCGGA CGGCAGTTTT GGTTTCGCTG GTACGCGGAT CGCACGCGGA TCGAGCGCTA TCGGCGTTTT TCGCGGCGCT TCCGCGATTT GGCGATCGAT CTTGGCGGCG TGATGATTAA GCTTGGGCAA TTTGCTTCAA CTCGCGTCGA TGTGTTGCCG CCAGCCGTGG TTGAAGATTT GATCGGCTTG CAAGATGAAG TTTCGCCCGT ACCATTTCGC TTGATTCAAG CCACAATCGA ACACGAATTA GGCCAACCAC TCGATCATAT TTTTAAAACT TTTGAGCGTG AACCAATCGC TGCTGCGTCG TTTGGCCAAG TGCATTTCGC CACGCTGCAC AACGATCAGC CGATTGCCAT CAAAATTCAA CGCCCACAAA TTGAGCAATT TGTTGAAATC GATATTGCTG CACTGCGCTG GGTTGCTAGT TGGATGCAAT ATTATGGCCC AATTCGCCGT CGCACCGATT TACCAGCGCT GATCGAAGAG TTTTCGCGAA TTACTCTGCG CGAACTCGAT TACCTAAGCG AAGCTGATCA CGCCGAGCGC TTTCAGCGTA ATTTTGCTGG CAACGATCAT ATTTATGTGC CAAAAATTCA GCGCGATTAT TCAACTGAAC GAATTTTGGT GATGGAGCGA ATCGAGGGCA TTAAAATTTC GGAATATGCG GCGCTCGATG CAGCTGGAAT AGATCGGCTG GATTTGGCCG AAAAGCTTTA TTTGGCCTAT TTGCAACAAT GCTTTACCGA TGGCTTTTTT CATGCCGACC CACACCCAGG CAATTTGTTT GTGCGGCCTG TCGGCGAGCG CTTGGCGAAT GGCAAACAAC CCTTTGTAAT CACCTTCCTC GATTTTGGCA TGGTCGATTC AATTCCCCAA AGCGTGATGG ATGGCCTCGC CACGATTGCA GCTGGCGTGG TAATGCGCGA ACCACAACGT ATGATCGATG GCGCACGCTC GATTGGCGTG GTCATGCCCA ATGCCAATGA TCAACAATTA CGCCAAGCCT TGGAAATTTG GTTTTCCTAT ACCTATGGCC GCACAATTCG CGAGTTGCAA CAAATCGATG TTGAGGGTTT TGTCGGCGGA TTAAGCGAAT TGCTCTATGA TTTGCCGTTT CAGTTGCCTC AATCATTACT CTTTTTGGGA CGCACGGTAG GGATTATCGG TGGCGTGGCG GCTGGTTTAG CGCCCGATTT TGATATTTTC AGCGTGACTA AACCCTTTGC CTTACGCTTT ATTCGTGAGC AAACCAGTGG CCGCGATCTG CGCGAACGGG TAATTAACGA AGGCCGCGAA TTAATTACCG ACCTCAGCCA AATTCCGCGC CATGCCAAAC AATTTTACGT CAAGGCTGCT CAAGGCGATT TGCAAGTGCG CACCGAAATT GTCAAACTAG AGCGCACCAC CAAACGAATC GAGCGGGCAT TGAGCCGTCT TACCGCAGGG ATTGCGGCAA GCGCCTTGAT CATCAGCGCG AGCATTCTCC AAGCCCAACA GATTTATAGC CCTTGGATGT GGTGGTTGGC TGGTGGTTTG TTGATTTGGT CATTGTTGCC ACGCTTCAAT CAAAATTAA
|
Protein sequence | MSKPELVSLP NVPRRRRFFR VAWFFVRLVA HIIVWDLLLG RQFWFRWYAD RTRIERYRRF SRRFRDLAID LGGVMIKLGQ FASTRVDVLP PAVVEDLIGL QDEVSPVPFR LIQATIEHEL GQPLDHIFKT FEREPIAAAS FGQVHFATLH NDQPIAIKIQ RPQIEQFVEI DIAALRWVAS WMQYYGPIRR RTDLPALIEE FSRITLRELD YLSEADHAER FQRNFAGNDH IYVPKIQRDY STERILVMER IEGIKISEYA ALDAAGIDRL DLAEKLYLAY LQQCFTDGFF HADPHPGNLF VRPVGERLAN GKQPFVITFL DFGMVDSIPQ SVMDGLATIA AGVVMREPQR MIDGARSIGV VMPNANDQQL RQALEIWFSY TYGRTIRELQ QIDVEGFVGG LSELLYDLPF QLPQSLLFLG RTVGIIGGVA AGLAPDFDIF SVTKPFALRF IREQTSGRDL RERVINEGRE LITDLSQIPR HAKQFYVKAA QGDLQVRTEI VKLERTTKRI ERALSRLTAG IAASALIISA SILQAQQIYS PWMWWLAGGL LIWSLLPRFN QN
|
| |