Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1743 |
Symbol | |
ID | 4710468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1913771 |
End bp | 1915069 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639856211 |
Product | hypothetical protein |
Protein accession | YP_001003309 |
Protein GI | 121998522 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAGC CACAGACCAG CCCGTTGTGG CTCGCCGTCC ACCTGCCTAC GCTCACCGCC GAGGCTCCCT CGGCCTCGGC GGCCCCCACC CTGGAGCAGA TCGCCCTATG GGGGCTGGAA CTCACCCATC AGGCGAGCCT CGAGCCCCCG GACACCGTCT TCCTCGAGGT CGGCGGCAGC CAGCGCCTGT TCGGCGGCCG GGCCGCGATC CAGCAGCGCG CCCAAGAGGG CCTGCGCGAA TTGGGCCAAC CCCGGGCCGC ACTGGCCGCT GCCCCCACCC CGCAAGGCGC CCGACTGCTC GCCCGGGCCA GCCCCGGGAT CTGGCTCACC GACCGCCAGT CGCTCCGCCG CGCGCTGATG CCCCTACCGT GCCACCTCCT CGATCCGACA CCGGCCCAGC AGTCCGCACT GAGCACCTTG GGCCTGACGC GGTTGGGCGA TTGCCTGCGC ATGCCCCGCA GCGGTCTGCG CCGCCGCATC GGCGACCCCC CTATCCGCAC CCTGGAGCAG GCCCTCGGCG AGCGCCCCGA GCCGCGCCGC TGCATCCCAC CACCCCAGCG CTACCGTGGT CGCCTGGAAC TCCCGGCCCC GACCGCAGCC ACTCAGGCAA CCGGCTTCGC CCTGCAGCGC CTGCTGCGCG CCCTAGTCGG CATGCTTCGC GGCCTCGATG CCGGGATCCA GCAGGCCGCG GTCGCCCTGG AGCACCCGGA TGGTCCGGAT ACGCGGCTGA CCCTGGGCTT CCTGCGTCCG ACCCGCGACC TGGAGCACAT GGCCCACATC GCCCGCCACC GGCTGGAACG CCAGGCACTC CCCGATGTGG CCACCGCCGT CCGTCTCGAG GCGGATCAAC TCCTGCCCTA CCAAGGGACC AGCGGCGACC TCTTCGAGCG TACGGGCGCC GATGGCGAAG CCGTGCGCAC CCTCAGCGAG CGACTCATCG CCCGCTTGGG GGCCGATTGC GTCCAACGCC TGGCTACCTA CCCCGACCCA CGTCCGGAGC GTGCCTGGTG CCGCCTACCG TTGGAACACC CGGACCGGCC CCCCGCCGCA ACGCTGCCGC GGCCGGTCTG GCTCTTGCCA CACCCGCGGC GTCTACTCAG CGGCGAGGGC GGAGAACCGC ACTGGGGCGG CCCGCTGTGC CTGGAAGCGG GTCCGGAACG GATCGAAAGC GGCTGGTGGG ACGACGAGGA CGTGGCCCGC GATTACTACG TCGCGCGCGC CCCGGTCGGC AGCCGGCTCT GGGTCTATCG CGACCGTCGC CCGCCCTACG GTTGGCACCT GCACGGCTTC TTTGCCTGA
|
Protein sequence | MPEPQTSPLW LAVHLPTLTA EAPSASAAPT LEQIALWGLE LTHQASLEPP DTVFLEVGGS QRLFGGRAAI QQRAQEGLRE LGQPRAALAA APTPQGARLL ARASPGIWLT DRQSLRRALM PLPCHLLDPT PAQQSALSTL GLTRLGDCLR MPRSGLRRRI GDPPIRTLEQ ALGERPEPRR CIPPPQRYRG RLELPAPTAA TQATGFALQR LLRALVGMLR GLDAGIQQAA VALEHPDGPD TRLTLGFLRP TRDLEHMAHI ARHRLERQAL PDVATAVRLE ADQLLPYQGT SGDLFERTGA DGEAVRTLSE RLIARLGADC VQRLATYPDP RPERAWCRLP LEHPDRPPAA TLPRPVWLLP HPRRLLSGEG GEPHWGGPLC LEAGPERIES GWWDDEDVAR DYYVARAPVG SRLWVYRDRR PPYGWHLHGF FA
|
| |