Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4051 |
Symbol | |
ID | 5735909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5168589 |
End bp | 5169791 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281202 |
Product | hypothetical protein |
Protein accession | YP_001546811 |
Protein GI | 159900564 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAGC TTAAAAATTA CGTAGTCTCC TTGGATGATG TGTCCACCGT CGAAGAACTC AAGGCCGCCT TGCGTGAACG GCGTTTGGTT GGCGTGACCC AACTAACCCT CAATGGGCGC AATTGGGGTG ATGCTGTCGC GCATAGCTTG GCTCGCTCAC CGCATATGGC CACCGTGCGC GTGCTTGACC TCAGCGAAAA TCAATTAACC GATGAAGCTG TGATTGCCTT GGCGCAATCG CCCCACTTAA GCCAATTGCG CACGCTGCGC TTGCAACGCA ACCAAATTGG CGATGCTGGC TTGATTGCCT TGGCGCAATC GCCCTACCTG AGCAACCTCA ACGAATTAGA GCTGTATCAA AATGCAATTG GCGCGGCGGG TGCTCAAGCA CTCAGCCATT CGACCAGCCT AACCCAATTA ACCAAACTAC GGCTTTATCG CAATCAACTG GGCGCAGCTG GCGGCGCGGC CTTGGTCGAA GGTCAAGCCT TGGTTAATCT CCAGTTGCTC GATATTGATC ACAACGATTT GGGCGATGCT GGAATTAGTG CTTTGGCCGC CGCCCAACAT TTGACCCAAC TCACCTCGCT CAATTTAAGC AACAACAATA TTGGGCCAGC AGGGGTGAAG GCGCTGGCCG AAGATGAACA ATTGGGCAAC GTAACCCAGC TTGATCTCAG CGATAATGAT GTGGGGTTTG CTGGAGCCAA AGCCTTAGCC GAATCGCCCT ATCTGCGCAG CATTGTGACG CTCAATTTGG CCTCAACCAA TATCGATGCT GCTGGCATTA GCGCCCTTGC CAATTCGCCA GTGCTTGCCA ACACCGAACG CCTTGATTTG CGCTCGAATG CAATTGGCGA TGCTGGGTTG AGTGCCCTAG CTCAATCGAG TCATGTCACT AATCTCAAGG CCTTGTTGCT CAACGATAAT GCGATTGGCG ATGCTGGAGT GCAAGCCTTG ACCACCAGCA CCAGTCTTGG CAATTTGGCG GTGTTGTATC TCGCCGATAA CGCAATTGGT GATGCTGGCG CAACCAGTTT GGCCAACACA ACCAACTTGG GCCAATTGCA AGAGCTTGAT TTATGGGGCA ATCAGCTAAC CCGTAGCAGC CAAGAAGCCT TTAGTCAATC GCCAACGCTG AATAATTTGG TGCTGCTCGA TTTAGGCATC AGCGACGACG AAGACGAGGA CGATATCTTC TAA
|
Protein sequence | MAKLKNYVVS LDDVSTVEEL KAALRERRLV GVTQLTLNGR NWGDAVAHSL ARSPHMATVR VLDLSENQLT DEAVIALAQS PHLSQLRTLR LQRNQIGDAG LIALAQSPYL SNLNELELYQ NAIGAAGAQA LSHSTSLTQL TKLRLYRNQL GAAGGAALVE GQALVNLQLL DIDHNDLGDA GISALAAAQH LTQLTSLNLS NNNIGPAGVK ALAEDEQLGN VTQLDLSDND VGFAGAKALA ESPYLRSIVT LNLASTNIDA AGISALANSP VLANTERLDL RSNAIGDAGL SALAQSSHVT NLKALLLNDN AIGDAGVQAL TTSTSLGNLA VLYLADNAIG DAGATSLANT TNLGQLQELD LWGNQLTRSS QEAFSQSPTL NNLVLLDLGI SDDEDEDDIF
|
| |