Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3336 |
Symbol | |
ID | 5735206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4205405 |
End bp | 4206484 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280483 |
Product | TrkA domain-containing protein |
Protein accession | YP_001546100 |
Protein GI | 159899853 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0569] K+ transport systems, NAD-binding component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.819763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAG CCAAAACCCC TAGTCGCAAA GCGCGTTGGC GAAGACTGAT CGCGGCCTCG TTGCGCGATG GTTGGATTGT GTTTCGCGAT TCGGGAATTT GGCTGGGTTT ATGGCTTTTG TTGTGGTTGG GGTTTACCCT GGCAATTTGG GCAGGCACGC GCCCAGTTTT AGCTTTTAAC CAAGCGCTCT ATCAAGCATT TAGCCAAATG ACCCTCAACC CCGTGCCACT GCCTGAGCCA TGGTGGCTGC AAGTATTGTT TTATCTCGCT CCAGCCTTGA ATATTATTTT GCTGGCGCGA GGTGCGCTCA ATATGGGTAT TTTGCTGTTC GATAAACGCA ATCGGCGGGA GGCTTGGCAA ATGGCGTTAG CATCAACCTA TCGTGATCAT ATTATTGTGT GTGGTTTGGG CAAAATTGGC TATCGGGTGG TTGGGCAATT GCTGGCCAGC GGCTGCGATG TGGTGGTAAT TGATGCTCAC AACGATGGGC CGTTTCACGA ATTGGTGATG GGTCAGCATG TGCCAGTGAT TATTGGCGAT GCTCGTCAGC CTGAGCTGCT GCACGAAGCA GGTTTGCGCC ATGCCACCTC GCTCACTGTG GTTACTGGCG ACGATTTGAC CAACCTCGAT ATTGCCTTGA CCGCCCGCGA ATTGCATCCT GATATTCATA TTGTGATGCG GGTTTTTAAC GATTCATTGG CGAGCAAACT TAGCTCAGCC TTTCATATTC AGACCGCATT TAGCACCTCG GCGCTGGCCG CCCCAACTCT TGCCGCTGCG GCCTTGGGTC GCGGCATTAC CAACGCCTTG TATGTGGCAG GCAAGTTGCT CTCGACGGTG GAAATTACGG TAGCCCGCGA TGGCATTTTT GATGGACGCT TGATCCAGAC GGTTGAAAAT CAGCACGATA TTTCGGTGCT TTATCGGCGT GGGCGCAATG GCGAAGATTT ACGCCCACGC GGCGATGAAC GTTTGAGCAG CGGCGATCAA TTGGTGATTA TCGGGCCGTT GGCAGCGATT AATCAGATTC AAACGCTGAA TAAACCGAAT GCCGCGCCGC ATGCGCCCTA TCGACTTTGA
|
Protein sequence | MKPAKTPSRK ARWRRLIAAS LRDGWIVFRD SGIWLGLWLL LWLGFTLAIW AGTRPVLAFN QALYQAFSQM TLNPVPLPEP WWLQVLFYLA PALNIILLAR GALNMGILLF DKRNRREAWQ MALASTYRDH IIVCGLGKIG YRVVGQLLAS GCDVVVIDAH NDGPFHELVM GQHVPVIIGD ARQPELLHEA GLRHATSLTV VTGDDLTNLD IALTARELHP DIHIVMRVFN DSLASKLSSA FHIQTAFSTS ALAAPTLAAA ALGRGITNAL YVAGKLLSTV EITVARDGIF DGRLIQTVEN QHDISVLYRR GRNGEDLRPR GDERLSSGDQ LVIIGPLAAI NQIQTLNKPN AAPHAPYRL
|
| |