Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4184 |
Symbol | |
ID | 5736046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5336724 |
End bp | 5337743 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641281339 |
Product | hypothetical protein |
Protein accession | YP_001546944 |
Protein GI | 159900697 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCGC TGATCGCTGG CGAATGGCAA GGCCAAGCTG CCCAACGCTG TTTTCAGGAG TTCGAGAGCG TAATTGTGCC AACCTATCAG CGCTTGATTA ATGTATTTGA CACTAGTGCC CAAGTAACCC TTGAGATTGC CAAATTAATG CAAGCGGCTG AAGCCGAAGC TGCGGCTCTC TTTTCTATAA ATGCCAGCTC ATCTAAAAAT CGACTAATGG TAGACCCATT AGAACTCCCA GATTTACCGC ATACAGAACC TTTAGATGGC AGTAAAACTT TAAAGCCTGA ACTCATCACA ATGCCATCGC CACCGCCACC GCCGACTCAA CCTAATACGG GCGACGGAAG TGGGACACAT GGGGCAATGG GGCAACCAAC TGAGCAGGAT AAAGCCAATC ATTCTAAAAT GAGTAGGGCT GCAACTCTTG CACGATTCTC CGGAGTTTTT TCAGAAACCG GCTATCATAT GCAACATTTC CTGAATAATT CTGGAGAGCC ACTCACTGTT TCCGTCGATG ATATGCTTGA TGATATGCCT GTATTTAAAA GTAAGGTTGA ATTTAGGTAT GAAAATGAAA TTATTCCCCA AATTAATCAA AAACTTCGAT CTGATTATCA TGGCGAACCA TTAGAATTTC ATGTAACAAT TCCTTGGAAA TCCAATTTTT ATCCTGATCG AAGTGAGAAT AAAAATTGGT ATTACGCAGT AGGAGGATTT AGTTATGCCC AAACTGCTTA TGTTCGAGTA ACTCCAGCTC TAGATGGAAC TCCAAATGTC GAAGTTATTT CTCAAGTACA TATGTTTGAT CGATATAACT GGGATCAAGG TAAAGCGGTT ACCATTCCAT CTTCAGGAAT TGAATGGATT GATAATAGTA CAATTGCAAA TGATCATATT ACAGATGAAG AAATGGGAAG GCTTCATGGG ACAGGGATCG CGCAAGAATA CGATCTAACA GGGACATCGA GTGGTCAGAA GCATTTTTAT ACCTATGATT CATTTATTGG TCTAAAATAA
|
Protein sequence | MQPLIAGEWQ GQAAQRCFQE FESVIVPTYQ RLINVFDTSA QVTLEIAKLM QAAEAEAAAL FSINASSSKN RLMVDPLELP DLPHTEPLDG SKTLKPELIT MPSPPPPPTQ PNTGDGSGTH GAMGQPTEQD KANHSKMSRA ATLARFSGVF SETGYHMQHF LNNSGEPLTV SVDDMLDDMP VFKSKVEFRY ENEIIPQINQ KLRSDYHGEP LEFHVTIPWK SNFYPDRSEN KNWYYAVGGF SYAQTAYVRV TPALDGTPNV EVISQVHMFD RYNWDQGKAV TIPSSGIEWI DNSTIANDHI TDEEMGRLHG TGIAQEYDLT GTSSGQKHFY TYDSFIGLK
|
| |