Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2208 |
Symbol | |
ID | 4709200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2422735 |
End bp | 2424120 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639856683 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_001003774 |
Protein GI | 121998987 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0392675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCTTAC CCCTGCAATC TCGATCCCTG CCGTACATCG GTGACGGGGC AACGGCGATG CGCGTCCTGG GGGGCGAACC GTGGTCAGCG TGGCTCGACT CGGGCCACGG CGGCTGTGCC GGGGCGCGTT ACGACATCCT CGTCGCCCGG CCGACGGTCA CGCTGATCGC AGCCGGCGGG CAGACCACCA TCCGGCGCGG CGAGCGGGTC GAGCGCCGGC ACGGCGATCC GCTGGCCCAC CTGGCTGCTG AGCTCGATGC CCTCGGCCCG CTCCCCGTTG ACCCTCGGTG GCCGTTCACC GGCGGCGCGG TGGGGTATTT CGGCTACGAC CTGGGGCGCC GCCTGATGGG GGTTCCGGGT GCCGATCCCG CGTTGCCCGA GATGGCCGTG GGGATCTACG AGCACGCGGT AATCACCGAC CATCGCCACG AATGCAGCAC TGCGGTGGGG CGACGCCTGG ATGAGGCCTG GCTGGCGGAC GTGGCCTGCC GCCGGGAGAC GGGGGCGAGA CCGCAGCCGT GGTCGACCGC AGGTCCGGTC CTCCGTGAAC CGGACGCCGA TGGGTACGCG GCCGCTTTCC GTCGGGTGCA GGGGTATCTG CACGCCGGTG ACTGCTACCA GGTCAATCTG GCCCGGCGCT TCTCGGTGCC CTGCTGCGGG GATCCCCAGG CGGCGTACCT CGCCCTGCGC GCAGCCTCGT CGGCGCCCTT TGCGGCGTGG CTCCGCTTCC CCGGGGGCGA TGTGCTCAGC CTCTCGCCGG AGCGCTTCTT GCACATCGAC GGCGACGGGC GGGTGACCAC CGAGCCGATC AAGGGCACCC GGCCGCGGTT CACCGATCCC GCCGAGGATG AGGCGGCCCG CCGGGACCTG CTGGGCAGCG CCAAGGATCG GGCCGAGAAC GTGATGATCG TCGATCTGCT GCGCAACGAC CTGGGCAAGG GGTGCGAGGT GGGCAGCGTG CGGGTGCCGT CGCTCTGCCG CGCCGAGCGC TTTGCCAGCG TGCACCACTT GGTCAGTACT GTCACCGGGC GCCTGGCCCC GGGGCGGCGC GCCACCGATC TGCTGCGCGA TTGCCTGCCC GGTGGCTCCA TCACCGGGGC GCCGAAGCGC CGTGCCATGG AGATCATCAC CGAGCTCGAG CCGGGACCGC GCGGGGTCTA CTGTGGGGCC ATCGGTTACC TGGGGCTGGA TGGCCGCATG GACACCAGCA TTGCCATTCG CACCGCGACG TGCAGCGACG GCAGTATGAC CTACTGGGCC GGTGGCGGGG TGGTGGCGGA CTCCACCGCT GCCGCCGAGC TCCAGGAGAC GGAAGACAAG GCCGCTGGTT TTCTGTCGCT GGCCGAGGGC GGCCAGGCCG CGGCAGGGGT CAGGCCTCGC CGCTGA
|
Protein sequence | MSLPLQSRSL PYIGDGATAM RVLGGEPWSA WLDSGHGGCA GARYDILVAR PTVTLIAAGG QTTIRRGERV ERRHGDPLAH LAAELDALGP LPVDPRWPFT GGAVGYFGYD LGRRLMGVPG ADPALPEMAV GIYEHAVITD HRHECSTAVG RRLDEAWLAD VACRRETGAR PQPWSTAGPV LREPDADGYA AAFRRVQGYL HAGDCYQVNL ARRFSVPCCG DPQAAYLALR AASSAPFAAW LRFPGGDVLS LSPERFLHID GDGRVTTEPI KGTRPRFTDP AEDEAARRDL LGSAKDRAEN VMIVDLLRND LGKGCEVGSV RVPSLCRAER FASVHHLVST VTGRLAPGRR ATDLLRDCLP GGSITGAPKR RAMEIITELE PGPRGVYCGA IGYLGLDGRM DTSIAIRTAT CSDGSMTYWA GGGVVADSTA AAELQETEDK AAGFLSLAEG GQAAAGVRPR R
|
| |