Gene Cpha266_0499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0499 
Symbol 
ID4569269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp547701 
End bp549548 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content50% 
IMG OID639765098 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_910980 
Protein GI119356336 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGATC GGGAATCTTG CAGGATTCAG CGGCTTCTTC TCCATGAGGG GACAGTCTGG 
CTGGACGAAT CGTATGGTGC TCGACATGAG GGCGAAAGCC TGCTTTTTTC CGAACCTCTT
GAGATGCTCG CTCTTTTTTC GGCAAGCGGT ATCAGGGAGT TTTTTACTCT TCTCGAACAA
AAACTCGACA GCGGGTATTG GCTTGCCGGA TGGATGGGCT ATGAGGCAGG ATACGGTTTT
GAGTCGGAAC GCTTTCTTGC GAACTGCGAT CCAGAGAGAG CTGCGCCTCT TGCCTGGTTT
GGCGTTTATC GGCCCCCGGA GCGATTTGCC TCTATTGTGA GCGAGAGGAT GTTTTCTCAG
GTAGCTGTCG ATGAACCGTT CATGATCAGC GATTTTCGTT TTAATCGCAC TCCTGCCGAG
TATTTTCAAG ATGTTGAGAG GGTGAAACGG GAGATTGCCA AAGGAAATGT TTATCAGGTT
AACGTTACCG GCCGGTTTCG TTTTTCTTTT CATGGTTCAG CTCAGGCATT GTTCGGTGCG
TTGTATCCGC AGCAGCCTTC GGTGTATTCC GCTTTTATCA ATACCGGTCG GCATCAGGTG
CTCTCTTTTT CTCCCGAGCT TTTTTTTCGG CGCCGGGCTT GTACGATTGA AGCTATGCCG
ATGAAAGGGA CGGCTCCGAG AAGCGGCGTT GTTGAAGAAG ATAATCGACT GAAGAGGCAA
TTATCGCTCT GCGAGAAGAA TCGGGCTGAA AACCTGATGA TTGTCGATCT GTTGCGTAAC
GATCTTGGCA GAATATGCAG ATCAGGTTCA GTTGAGGTTT CGGGGATGTT TGCCACTGAG
ACCTATCCTA CGCTTCACCA GATGGTGTCA ACTATTCGCG GGGAACTGAA GGATGAAAAC
GGTCTTTTCG AGACGTTTCG CGCTCTGTAT CCCTGCGGTT CCATTACGGG AGCGCCAAAA
ATCAGCGCAA TGCAGTTGAT CCGTGAACTT GAGCCGGGCT TGAGGGGGTG CTACACCGGT
ACCATCGGGT ATATCAATCC TCGGAGAGAT ATGGTTTTCA GTGTTGCGAT TCGTACGCTT
GAACTGTCCG GCAATGAGGG CGTCTATGGT TCGGGCGGCG GCATCGTCTG GGATTCCGAT
CCGGAGGAGG AGTACCAGGA GTGCATGCTG AAGGCAAAAA TTCTTGATGA TGTTACCGCA
GAGAGCGTTG AGCTGTTTGA GAGCATGCTT TTTTCCGGCA GATTTCTCTG GATGCAGGAG
CATCTTGGAC GGCTTGCGGC TTCGGCAAGA GTTCTCGGAT TTGCTTTCGA TGCTGCAGAG
GCTGCAGCCC GACTTGCGGC CCTTGCTGAT ATTCTTTTCA AGGCAGGCGG ACGTTTCAAG
GTCAGACTCG CTCTTGAAAA AAACGGGCGG ATGGCCATCA GCTATGAATC GCTGCTGTCT
GATACAGGTC GCGTTCCCCT GAAGCTCTCT CTTGCGGATG GATTTATCGA CTCATCAAAT
TCGCTTCGAT TTCATAAAAC CAGTTCGCGA AAACGGTACG AACGGTATTA CCGGAAAGCT
CTTGAGAGCG GTTATGACGA GGTTGTGTTT CTCAATGAAC GCCAGGAGGT GACGGAAGGG
GCGATCAGCA ACATTATTAT TCTCAAAAGC CGACGCTATG TTACTCCCTC TCTTGGTTCA
GGATTGCTTG ACGGCATTTA TCGACGTTAT TTTCTTTCTC TTCATCCGAA TGCTTCAGAA
CAAGTGCTCA CGCTGAAGGA TCTTTTCGAG GCTGATCAGG TTTTTATCTG CAATTCGGTT
CGAGGATTGC GTCCCGTCGT GTTCGATGGG TTTGTTATCT CCATGTAA
 
Protein sequence
MRDRESCRIQ RLLLHEGTVW LDESYGARHE GESLLFSEPL EMLALFSASG IREFFTLLEQ 
KLDSGYWLAG WMGYEAGYGF ESERFLANCD PERAAPLAWF GVYRPPERFA SIVSERMFSQ
VAVDEPFMIS DFRFNRTPAE YFQDVERVKR EIAKGNVYQV NVTGRFRFSF HGSAQALFGA
LYPQQPSVYS AFINTGRHQV LSFSPELFFR RRACTIEAMP MKGTAPRSGV VEEDNRLKRQ
LSLCEKNRAE NLMIVDLLRN DLGRICRSGS VEVSGMFATE TYPTLHQMVS TIRGELKDEN
GLFETFRALY PCGSITGAPK ISAMQLIREL EPGLRGCYTG TIGYINPRRD MVFSVAIRTL
ELSGNEGVYG SGGGIVWDSD PEEEYQECML KAKILDDVTA ESVELFESML FSGRFLWMQE
HLGRLAASAR VLGFAFDAAE AAARLAALAD ILFKAGGRFK VRLALEKNGR MAISYESLLS
DTGRVPLKLS LADGFIDSSN SLRFHKTSSR KRYERYYRKA LESGYDEVVF LNERQEVTEG
AISNIIILKS RRYVTPSLGS GLLDGIYRRY FLSLHPNASE QVLTLKDLFE ADQVFICNSV
RGLRPVVFDG FVISM