Gene Hhal_2208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2208 
Symbol 
ID4709200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2422735 
End bp2424120 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content72% 
IMG OID639856683 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001003774 
Protein GI121998987 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0392675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCTTAC CCCTGCAATC TCGATCCCTG CCGTACATCG GTGACGGGGC AACGGCGATG 
CGCGTCCTGG GGGGCGAACC GTGGTCAGCG TGGCTCGACT CGGGCCACGG CGGCTGTGCC
GGGGCGCGTT ACGACATCCT CGTCGCCCGG CCGACGGTCA CGCTGATCGC AGCCGGCGGG
CAGACCACCA TCCGGCGCGG CGAGCGGGTC GAGCGCCGGC ACGGCGATCC GCTGGCCCAC
CTGGCTGCTG AGCTCGATGC CCTCGGCCCG CTCCCCGTTG ACCCTCGGTG GCCGTTCACC
GGCGGCGCGG TGGGGTATTT CGGCTACGAC CTGGGGCGCC GCCTGATGGG GGTTCCGGGT
GCCGATCCCG CGTTGCCCGA GATGGCCGTG GGGATCTACG AGCACGCGGT AATCACCGAC
CATCGCCACG AATGCAGCAC TGCGGTGGGG CGACGCCTGG ATGAGGCCTG GCTGGCGGAC
GTGGCCTGCC GCCGGGAGAC GGGGGCGAGA CCGCAGCCGT GGTCGACCGC AGGTCCGGTC
CTCCGTGAAC CGGACGCCGA TGGGTACGCG GCCGCTTTCC GTCGGGTGCA GGGGTATCTG
CACGCCGGTG ACTGCTACCA GGTCAATCTG GCCCGGCGCT TCTCGGTGCC CTGCTGCGGG
GATCCCCAGG CGGCGTACCT CGCCCTGCGC GCAGCCTCGT CGGCGCCCTT TGCGGCGTGG
CTCCGCTTCC CCGGGGGCGA TGTGCTCAGC CTCTCGCCGG AGCGCTTCTT GCACATCGAC
GGCGACGGGC GGGTGACCAC CGAGCCGATC AAGGGCACCC GGCCGCGGTT CACCGATCCC
GCCGAGGATG AGGCGGCCCG CCGGGACCTG CTGGGCAGCG CCAAGGATCG GGCCGAGAAC
GTGATGATCG TCGATCTGCT GCGCAACGAC CTGGGCAAGG GGTGCGAGGT GGGCAGCGTG
CGGGTGCCGT CGCTCTGCCG CGCCGAGCGC TTTGCCAGCG TGCACCACTT GGTCAGTACT
GTCACCGGGC GCCTGGCCCC GGGGCGGCGC GCCACCGATC TGCTGCGCGA TTGCCTGCCC
GGTGGCTCCA TCACCGGGGC GCCGAAGCGC CGTGCCATGG AGATCATCAC CGAGCTCGAG
CCGGGACCGC GCGGGGTCTA CTGTGGGGCC ATCGGTTACC TGGGGCTGGA TGGCCGCATG
GACACCAGCA TTGCCATTCG CACCGCGACG TGCAGCGACG GCAGTATGAC CTACTGGGCC
GGTGGCGGGG TGGTGGCGGA CTCCACCGCT GCCGCCGAGC TCCAGGAGAC GGAAGACAAG
GCCGCTGGTT TTCTGTCGCT GGCCGAGGGC GGCCAGGCCG CGGCAGGGGT CAGGCCTCGC
CGCTGA
 
Protein sequence
MSLPLQSRSL PYIGDGATAM RVLGGEPWSA WLDSGHGGCA GARYDILVAR PTVTLIAAGG 
QTTIRRGERV ERRHGDPLAH LAAELDALGP LPVDPRWPFT GGAVGYFGYD LGRRLMGVPG
ADPALPEMAV GIYEHAVITD HRHECSTAVG RRLDEAWLAD VACRRETGAR PQPWSTAGPV
LREPDADGYA AAFRRVQGYL HAGDCYQVNL ARRFSVPCCG DPQAAYLALR AASSAPFAAW
LRFPGGDVLS LSPERFLHID GDGRVTTEPI KGTRPRFTDP AEDEAARRDL LGSAKDRAEN
VMIVDLLRND LGKGCEVGSV RVPSLCRAER FASVHHLVST VTGRLAPGRR ATDLLRDCLP
GGSITGAPKR RAMEIITELE PGPRGVYCGA IGYLGLDGRM DTSIAIRTAT CSDGSMTYWA
GGGVVADSTA AAELQETEDK AAGFLSLAEG GQAAAGVRPR R