Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5009 |
Symbol | |
ID | 5736968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 15565 |
End bp | 16749 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641282176 |
Product | hypothetical protein |
Protein accession | YP_001547767 |
Protein GI | 159901521 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000976322 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCTCT TAGAACGCAA ACGTGCAACC GTCTATGACC ATGTGCGGGT GAACCAACCC CTGCGTGATC TCCACTACCA ACTGCCTGCG TGGATCACGA TTGCTGACCA ACCCTACCTT GCAGTGGGCG ATGACAATGG TAATGGGGCC AAAAAGATTG CGGTGTTGGA TGCCAAATCG CGGTTGATTA CGACCCGCAC GCCCACCGCC TATAAATTGG CCAAGGCCAT TCGGGCGGGT CAAGGGGTGA TCACCTATCG GGTCAATGGC GGCGATAGTT TTTGGATTGG CGATGATGCC TTACGCTTTG ATGGCGATGC GTTGCCCATT GGTGGCACCA GCCAACGCCT CTCCGATACC CGCCAGCGCT CGTTTAATGC GGCCTGTATG GTCGAAACCC TGATCAAAGC GCGGTATAAG CCGGGTGTCT ACCCCTTAGC CGTGGGCTTC GCCATCCCGA ATGAAGAAAT TGAGTCGCGG GACAATGACA AAATGGGGGT GAATCCCGAG ACCCGCACGG CCCTCAAGAC CCATTTGAAT GGGCAAACCT TTGTTGTGGA GCGCACCGAT GCCTTGGGCG TGGTAACCAA CTGGACGCTG CGCTATGAAA AGATCATCCC GCAAGCCCAG TCGATCGGGA CGTTGTATGC GTGGTCACGC ACCGTTGATG GCTCGTTAGA GGCCGACGGG ATTCGTCGCG TCTCGATTGT CGATATCGGT GGAGGTGATA CCCAACTGAC CGAAGTGGAA CTGAATCCCT ACCGCATGAG TGCCGAACGC TTGGGTGCGG GCACCATCAG TATTGCCCGG GAGTTGGCGG CGAAGTTTCA TCGGTTGCGG TTGAGTGATG CCCAAGCCCA ATATGCGCTC GAAACGCAGT TGTTAGAGGA GTCGGGACGC GAATTTCCGA TTGAAAGCGA AGTCAATGCG GCGATTCAAA GTGCCGGACA AGACTTAGTT GGCCGGATGC TGAAGGTACT CCAGCAGCCG AGCGCCTACG TGATCATTAC CGGAGGTGGG GTGAAATTGC AAGGGTTGCG GCGCTTGATT GAAGAACGGG CCGAGGCATC CGGCAAAACG GCTCCCCGCA ACTACACGAT CATTGATCCG AGCGTCGCAG ATATCCTGAA TGCGACGGGG GCACTGCTGG CGGTGGTCTA TGCGGCGGCA GGGAAAGGAG CCTAA
|
Protein sequence | MTLLERKRAT VYDHVRVNQP LRDLHYQLPA WITIADQPYL AVGDDNGNGA KKIAVLDAKS RLITTRTPTA YKLAKAIRAG QGVITYRVNG GDSFWIGDDA LRFDGDALPI GGTSQRLSDT RQRSFNAACM VETLIKARYK PGVYPLAVGF AIPNEEIESR DNDKMGVNPE TRTALKTHLN GQTFVVERTD ALGVVTNWTL RYEKIIPQAQ SIGTLYAWSR TVDGSLEADG IRRVSIVDIG GGDTQLTEVE LNPYRMSAER LGAGTISIAR ELAAKFHRLR LSDAQAQYAL ETQLLEESGR EFPIESEVNA AIQSAGQDLV GRMLKVLQQP SAYVIITGGG VKLQGLRRLI EERAEASGKT APRNYTIIDP SVADILNATG ALLAVVYAAA GKGA
|
| |