Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4491 |
Symbol | |
ID | 5736342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5751451 |
End bp | 5752899 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281654 |
Product | hypothetical protein |
Protein accession | YP_001547251 |
Protein GI | 159901004 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGAGG CACATTTATT TGGGTTTGGC GGAGCACTCT GGTTAGCCTT GTTTATTGCC ACCCGTACAC CCCAACGCTC GCTGCTGTTT TGGGCCAGCA TTATCAGTCT ATTTGGCTTG ATGGGCTTTT TTGGCTCAGG TGTGCTCGGC GGCGAAGGGC TGAATCTTGA GCAACTGGTC AGTTTAGAAC GCGGGTTTTG GTGGAGTTCG GTACTGCCAA TTACGACTTG GCTGATCGTA TGTAGTTTGA TTCACCAAAC CTTGCAACCA AGTATTGCCA CGATCGTCAA GCCCGAACGC TGGGTCGTCG GCGTTATTGG ATTAATCCTC ATTGGCTTGG GCAGTTCCAG CAATTGGTTG ATGAATTATG CCGAGCCAGT GCTGCTTGCT AGTGGTAGTC AAATTATCGG CACAGGCCCA GCCTACCCAA TCTATAGTGC CTATGTGATG GGCTGTGTCA GCGTGGCGCT TTGGCATTTG GTGGCAAGTT GGCGCATTGC CGAAACAGGT ATGGCGCGGC GCAGTTTAGC TAGTTTGGTG TTGGGGGCTT TAGGGTTCTT AATTGGCACA AGTAGTTTGT TGGCTCGCTT AATCAGCACG GGTACATGGC CACTTTTCTA TGGGTATATG CCGATTTTTG CTGGCTTGTT GATTACGGGT TTTGGGTTGG TGCGATTTGG CTTATTGCTC CAAGGCCAGA ATGTGCTGCG CGATTTGATT TATAGTTTTT GCGAAATTAG CATCCTCGCC TTGATTTATT TAATTAGTGT GAATATTTTA GATCTGCTGC GGCCTAGCCA ATTGGCCTTG CTTTTGGCGT GTGTGATCAT CAGCCACACA GGGTTGGATC GTGGGCGACG TTGGCTTGAT CGTTTGTTCT TTTCGCGAGC TGAACAAGAG GCTCGTAGCC AATCGCGCGA ATTTGCGCTT GCCTTGGCCT CAACTCCTAC GCCAACCCCC GCTCCAGTAA TTGTTGATGC GAAGCCCGAT AAAGCCTGGA ACGATGCAGT GCGGCGAGCG ATCAGCGGCT TGAAAAATCC AGTTCAATTA GCCCAAAATC CCTTGCTAAG CAGCGCTTTG GTTAGCCATA GCGTGCAGAG TAAAGCGCTA GAGGATAATC GGCTGAATCG CAGTGCAATT GCCCGCGAAA TATTATTGCA AGCAATCGAG CAACTGCGGC CTGATGCCAG CCAAGCCTTA GGCAGTGGCG ATGCTTGGCG TTGGTATAAT GTGCTGTATC TGCCCTATGT GCGCGAAATC AACCGCAAAA CTGCGATCGA TTGGCTACGG CGCGGCCTCA GTGACCCATT AATTGATGCG AGTGTGTTAA GTTGGCTAGC TGATATTGAT GAAGATACCT TTTATAAGTG GCAGCGCCGC GCCTCAGATT TGATCGCGGC TCAATTGTGG GAGCAACAGT TGAAGTTGGT GCAACCAGTT ATTGCGTAA
|
Protein sequence | MVEAHLFGFG GALWLALFIA TRTPQRSLLF WASIISLFGL MGFFGSGVLG GEGLNLEQLV SLERGFWWSS VLPITTWLIV CSLIHQTLQP SIATIVKPER WVVGVIGLIL IGLGSSSNWL MNYAEPVLLA SGSQIIGTGP AYPIYSAYVM GCVSVALWHL VASWRIAETG MARRSLASLV LGALGFLIGT SSLLARLIST GTWPLFYGYM PIFAGLLITG FGLVRFGLLL QGQNVLRDLI YSFCEISILA LIYLISVNIL DLLRPSQLAL LLACVIISHT GLDRGRRWLD RLFFSRAEQE ARSQSREFAL ALASTPTPTP APVIVDAKPD KAWNDAVRRA ISGLKNPVQL AQNPLLSSAL VSHSVQSKAL EDNRLNRSAI AREILLQAIE QLRPDASQAL GSGDAWRWYN VLYLPYVREI NRKTAIDWLR RGLSDPLIDA SVLSWLADID EDTFYKWQRR ASDLIAAQLW EQQLKLVQPV IA
|
| |