Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0655 |
Symbol | |
ID | 5732555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 752546 |
End bp | 753919 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641277784 |
Product | hypothetical protein |
Protein accession | YP_001543431 |
Protein GI | 159897184 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATGT TTCCACCAAC CGCCACCAGT AGCATCATTA GTCAGCACCT GCATGGGCAA CGCGTGGCTG ATGATTTTCA GCGCTTGCAG CAGTTTCGCG CGGCGTGGGA AGCCTATCGC AAACAGGATA CGCCCCCGCT GCCAATCGAA GCCGGACGAC CCAACGACAA CGTGCGGGTC AATCTGATTA AGCTGATTGT CCAGACCTCA GTCTCGTTCT TGTTTGGCGA TCAGGTGGCG TTTAGCGTGC CTGATGCCCC GCCTGACGAC CCACGCATCC AGTATCTTGA TCGTATCTGG AATGCGGCCA AAAAGATGAC GTGGCTCTAC AAACAAGGCA TCAACGGGGG CTTGTGTGGC CACATCTTTG CGCGGATCTA TCCAGAGCAG CCGCTGCCGC GCATCGTGAT TCTCGATCCG GCGACCGTGA CCGTGCGCTG GGCGGCGGAC GATCTGGATC GGGTGGAATG CTACCTGATT GACTACAGTG CCTACGAATG GCAGGGCAAT CGTGAAGTGC TCGTGAGTAT CCGCCAACGC ATCGAGCGCA CTGATACCGG CTGGCGCATT CTTGACCAAC GCAGCCAGCC CGATAAACAG GCCTTTGATA CCCTGCATGA AGCGCCATGG CCCTATCCGT TCCCGCCGAT CGTCGATAGT CAAAACCTCG CCGCGCCGAA TGAATACTGG GGCGAGGCCG ATATCACGCC CGACCTGATT GCCCTGGTCA ATGCGGGCGA TTTCACCTTA TCCAATCAAC AACGGATTGT GCGCTTCTAT GCCCAGCCGC TCACCGTGGC CAAGGGGATG GACCCTGCGC AGCTCAAGCA TGGGCGCGAC CGCACGGTCT TTATTCCGCC CAATGCCGAC CTGAGCATTG TCGAGATGCA GAGCGAGTTG GCGGCCAGTC TTGACCACCA CAGCCGGATC AAGAGCAGTA TTCACGAGGT AGCCCAAACA CCAGAGATTG CGACGGGCAA GGTCGAGGAT CTGGGCAACC TGAGCGGCTT AGCCTTGCAA ATCCTCTATG GCCCCTTACT GCAAAAGACC GTGCAAAAAC GCATGCTCTA TGGCGATTTC CTGAGCGAAC TCAATCGCCG CTTGCTGATG ATTGGCGGCT TTCCCGAGAC CACGACCACG GTGGTATGGG ATTCGATGCT GCCCACCGAC CCACAGGCCG AACGCGTCGT CGCACAGGCC GATCGCACGC TCGGCGTAAG TGAAGCCACG CTGTTGACCC AACTGGGCTA TGACCCCGCC CATGAACGCG AGCAGCGGCA GGCCGAAACG CCGCCCGACC ACGACCCCAT GACATCCAAA GGAGATGCTG ATGATTCGAA TCCTGATGCC GCAGCGCCGA CTGAACGCCG ATGA
|
Protein sequence | MGMFPPTATS SIISQHLHGQ RVADDFQRLQ QFRAAWEAYR KQDTPPLPIE AGRPNDNVRV NLIKLIVQTS VSFLFGDQVA FSVPDAPPDD PRIQYLDRIW NAAKKMTWLY KQGINGGLCG HIFARIYPEQ PLPRIVILDP ATVTVRWAAD DLDRVECYLI DYSAYEWQGN REVLVSIRQR IERTDTGWRI LDQRSQPDKQ AFDTLHEAPW PYPFPPIVDS QNLAAPNEYW GEADITPDLI ALVNAGDFTL SNQQRIVRFY AQPLTVAKGM DPAQLKHGRD RTVFIPPNAD LSIVEMQSEL AASLDHHSRI KSSIHEVAQT PEIATGKVED LGNLSGLALQ ILYGPLLQKT VQKRMLYGDF LSELNRRLLM IGGFPETTTT VVWDSMLPTD PQAERVVAQA DRTLGVSEAT LLTQLGYDPA HEREQRQAET PPDHDPMTSK GDADDSNPDA AAPTERR
|
| |