Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0005 |
Symbol | |
ID | 4710076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 4692 |
End bp | 5675 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854461 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_001001602 |
Protein GI | 121996815 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000992333 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTCTGG TGTTTGTTGT GTCGTCGGGA GTCGGCGGAT GGCTGTTCTA CGAGCTCGAT CACCGTCCAC TGGAGGTCAG CGCCCCCCCC GAGATCCTCG AGGTCCCCCG CGGAGGGTCC TTGCACGCCA TCTCCCGGGG TCTCGAGTCC CGTGGCTGGA TCCCCGGATC GACGCGCCTC GCGTTGCGAA TCTACGGTCG CCTCAGCGAC ATCTCGGGTG AGCTCAAGGC CGGCGAGTAT GTCGTCGAGC AGGGTATGAG CGTGCGTCAG CTGCTGGCGC GGATCCGTGC CGGGCGGGTC AAGCTGCACC GCCTGACCGT CGTCGAGGGT TGGACGTTCG CGCGGCTGCG CCAGGAGCTG GGCCAACACG AGGCCGTGGA GCAGACCCTG GACGGGGTGG AAGACGAGCA GATCATGGAG GAGCTGGGGC TCGAGGCGTC TCACCCCGAG GGGATGTTTT TCCCGACCAC CTACCGCTTT CCGCGTGGCG CGACCGACCG TGATCTGCTC CGGGTCGCTG CTCGGCAGAT GCGCCAGGAG CTGGCGCGGG TGTGGAGCGA GCGCCACCCG GAGGTGCCCC TGGACGAGCC CTACCAGGCG CTGATCCTCG CCTCGATTAT TGAGCGCGAG ACCGGGCGCG ATGATGAGCG CCGCAAGGTG GCGGGGGTGT TCACCCGGCG CCTGGAGCAG GGTATGCGCC TGCAGACGGA TCCCACCGTG ATCTACGGTC TGGGCGATGA CTACGACGGC CGCCTGCGCC GGGCTGACCT GCGACGCGAT ACGCCCTACA ACACGTATAC GCGCCACGGT TTGCCGCCGA CCCCGATCGC CCTTCCCGGG CGGGCGTCGT TGGAGGCCGC CGTGGACCCG AAACCGGGTA GTGCGCTGTA CTTTGTGTCA CGCGGGGATG GCAGCCACCA CTTCTCGGAT ACGCTGGACG AGCATAATCA GGCGGTGCGA CGCTACATCC TGGAGGAGAA GTGA
|
Protein sequence | MALVFVVSSG VGGWLFYELD HRPLEVSAPP EILEVPRGGS LHAISRGLES RGWIPGSTRL ALRIYGRLSD ISGELKAGEY VVEQGMSVRQ LLARIRAGRV KLHRLTVVEG WTFARLRQEL GQHEAVEQTL DGVEDEQIME ELGLEASHPE GMFFPTTYRF PRGATDRDLL RVAARQMRQE LARVWSERHP EVPLDEPYQA LILASIIERE TGRDDERRKV AGVFTRRLEQ GMRLQTDPTV IYGLGDDYDG RLRRADLRRD TPYNTYTRHG LPPTPIALPG RASLEAAVDP KPGSALYFVS RGDGSHHFSD TLDEHNQAVR RYILEEK
|
| |