Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2777 |
Symbol | |
ID | 5734658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3535022 |
End bp | 3536308 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279920 |
Product | hypothetical protein |
Protein accession | YP_001545543 |
Protein GI | 159899296 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATA ATTTGAGCCA CATGATTATT GATGCATGGC AAAACGGGGC CAGCCTCGAA ACACTCTGCC AGCAGTATCC TCACCACGCT GAGGCTATCC AACAATTAAT TAATCCATTA ATTCAGTTAC AACGGGTCAA TCCACCAACT ATGCCAGCAC GAGCAAGCCA TGCACAAGCT GATTTTATGC GTTTAGCGCA ACACTATCGC GCTCAAACTG CCCCAAAGCC TAAACCACGA CGGCGCTTAT TAACTCAACG CTGGGTTTGG GCCACAGCGA CAATCCTGCT CTTGGTTTGT TTGAGTGGCA ATTTGGTCTT ATCGGCCTCG GCTAGCGCCT TGCCAGGCGA TAGTTTGTAT GGGATCAAAC GTTGGAGCGA ATCGATCAGC TTGGTGTTTA CGCCCAGCGC TGAACAATTA ACTGCACGGA TCGATCTGGT CAACGAACGC CAGCATGAGA TCGCTAGCTT GGTAGCATTA AATAAACCAG TGCCAAGCGA ATTGCTTGAT GAAGTCGTCA ACGAAACTCA GTCAATTGAG TTGGCGCTTG CGCCGACCCA TACGAATGAC CCACGACGGA GCAAATTGAG CCAAGTCAAT CAGCAACTGC AAACGACAAT TGCAATTATT CCAGTGGAAA ATGCGACTGA CAATCTCAAA CATCATGATC TGATTGAGAC GCTTGATCAA AGCCGTCAGC GCATCGATAT TGCCAATCTG ACGACTATTC CGACCCAACC AGCGAAGGTT TTGGTTGGTC CAACAGCAAC CAAGCAGCCA AGTATCGCGC CAGCGGCAAC TGATCTGCCA CATATTGCAA ACCCAAAGCT CCCGCCAAAG CCACATACAC CAACGCAGGA GCCAACCGAA GTCGTTTTAC CAACCTCAAC GAGCACACCA TGGCCAACCG CGAAACCAAC GCGGGTGCCG CCGCCTGTGA TCAAACCAAC TGCCACTGCG ACTTCGCTAC CCACATCAGC CCCAACCGAT GTGCCAATGC CCACATCAGC CCCAACCGAT GTGCCAATGC CCACATCAGC CCCAACCGAT GTGCCAGTGC CCACCGAAAT TCCGAGCATC GGGTTACCAA CCGCCACGCC AACGATTGTG CGTGAGCAAC CAACTGCGCC TGTCCCAACC AAAGATCCTG GCGATATTAA GCCAACCGAA GTTCCACCAA CTTCACCACC AACTTCACCA CCACCAACGA AGGAGCCACC GCCGCCTTCA CCAACGAAAG AGCCAGAGGA TACCAAAGGT CAGCCAAACC CCATTCTCCA ATATTAA
|
Protein sequence | MNDNLSHMII DAWQNGASLE TLCQQYPHHA EAIQQLINPL IQLQRVNPPT MPARASHAQA DFMRLAQHYR AQTAPKPKPR RRLLTQRWVW ATATILLLVC LSGNLVLSAS ASALPGDSLY GIKRWSESIS LVFTPSAEQL TARIDLVNER QHEIASLVAL NKPVPSELLD EVVNETQSIE LALAPTHTND PRRSKLSQVN QQLQTTIAII PVENATDNLK HHDLIETLDQ SRQRIDIANL TTIPTQPAKV LVGPTATKQP SIAPAATDLP HIANPKLPPK PHTPTQEPTE VVLPTSTSTP WPTAKPTRVP PPVIKPTATA TSLPTSAPTD VPMPTSAPTD VPMPTSAPTD VPVPTEIPSI GLPTATPTIV REQPTAPVPT KDPGDIKPTE VPPTSPPTSP PPTKEPPPPS PTKEPEDTKG QPNPILQY
|
| |