Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1037 |
Symbol | |
ID | 5732941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1183535 |
End bp | 1184614 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278172 |
Product | hypothetical protein |
Protein accession | YP_001543813 |
Protein GI | 159897566 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAA TAGCTAACAA GCTTGCCATT GTGCGTGCAG CGATTGCACG TTATAAATAT GGCGCTGTGC GGTTGCGCGG TGTCGATTGG TTTGCATGGG CCACAGCGGG CTTAGATAAT GTGGTGATTT TAACGACTGA AACTGGGATT GCCGAGGTGG TGATTACTGC TGATCAGGCC TTGATCGTGA CTGATTCAGT TGAAGTTGAT CGCCTACGCG ACCAAGGTAT TCCCGCCGAA TATCAACTTT GGGCAACCAG TTGGTCTACG CCATATTTGC TCAATCAAGC AGTACGCGAG TGGAGCAATG CAACCTTGGT GGCCTCAGAT CGGCCAGCTC AAGGCGAGGT TGGCCTACCA CCTGAATTAG TTGCGGCTAA ATTGCGCTTA GTGCCCGAAG AAATTGCCCG TTATCAAGCG CTGGGTCGCA CGACTACCCA GATTATGACC AAGGTACTGA GTGCTGCAAA ACCAGAGTGG ACAGAATTTC AATTGGCGGG AGCCGCCGCC GCCGAGCTTT GGAGCCATGG CATTCACCCA GCCTTGACCT TGGTTGGTGG CGAGCGCCGT TTGCCGTTGT ATGGCCACTT GCCAGCCACC CACGAGCCAA TCGGCCAACG CGCGATGTTG GTAGTTTGTG CGCGGCAAGG CGGTTTGTAT GCCAATGTCA GCCGCTACAT TCATTTTCGA CCTGAAACTG CCGCCGAACG GGCCAGCTAC GAGCAAGTGA TTGCAATTGA AGCCGAGATG ATCGCCGCCG CCCAAATTGG CAGTACGGTT GGCGCAGTCT ACGATGCAGC CGTAGCGGCT TATACCAAGC GTGGTGTCGT CGAGCAGATG CAGCGGTTGC ATCAAGGTGG CACAACTGGC TATCGTTCAC GCGAAGTTGT GGCGCGGCCA GGCGAACCAA CGATCATCGA AGCCAACACA GCAATGGCTT GGAATCCCAG TTTAGCTGGC GTGAAAATTG AAGATACAAT TATTCGCAGC GCTACTGGAA TCGAAGTTCT TTCGATTGAT CCTGCTTGGC CAAGTGTGGA GTATGCAGGT TTACGCCGAG CATTACCATT AGTCATTTAG
|
Protein sequence | MTEIANKLAI VRAAIARYKY GAVRLRGVDW FAWATAGLDN VVILTTETGI AEVVITADQA LIVTDSVEVD RLRDQGIPAE YQLWATSWST PYLLNQAVRE WSNATLVASD RPAQGEVGLP PELVAAKLRL VPEEIARYQA LGRTTTQIMT KVLSAAKPEW TEFQLAGAAA AELWSHGIHP ALTLVGGERR LPLYGHLPAT HEPIGQRAML VVCARQGGLY ANVSRYIHFR PETAAERASY EQVIAIEAEM IAAAQIGSTV GAVYDAAVAA YTKRGVVEQM QRLHQGGTTG YRSREVVARP GEPTIIEANT AMAWNPSLAG VKIEDTIIRS ATGIEVLSID PAWPSVEYAG LRRALPLVI
|
| |