Gene Cag_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0422 
Symbol 
ID3747688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp495126 
End bp496898 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content50% 
IMG OID637772952 
ProductPara-aminobenzoate synthase, component I 
Protein accessionYP_378738 
Protein GI78188400 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.744844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTTTC ATAACCCACA TGAAGTGTTG ATGCTGAATG CGTTCGATGG AGTAGAAGAT 
TTTTTTAAGA AGATAGAAGA GAGGGTTGCG GCAGGATTTT TTGTGGCGGG ATGGTTAAGC
TATGAAGCTG CCTACGGCAT GGATAGCGCA CTTGCCGAAA TGGCAACGGC GCAAACGTGG
CAAGCGCCAC TTGCGTGGTT TGGGGTATAT AAAGCACCAC AACGCTTTAC GGCTGATGAG
GTGGCACAAC TCTTTCCACC ATCCTTAACG ACTGCCATTA CAGCACCGCA TTGTTCTACC
ACTGAGATTG ACCATGCCGA GCAAGTTGCC GCTATTCGAG AAGAGATTGC GGCTGGCAAG
GTGTATCAAG TCAACTTGAC TGCTCGCTAT CACTTTAGCA TGGCGGGAGA GGCACCTGCG
CTTTTTGCAG CGCTGCGCCA ACAGCAACCC GCCTCCTACA CGGCATTTCT TAATTGCGGA
GAACGCACCA TCCTCTCCTT TTCACCTGAA CTTTTTTTCC GAACTGATGG CTGCGCCATT
GAAACGCGCC CCATGAAAGG CACTGCACCT CGTGGCAGTT CAGCGGAAGA AGATGCCCAT
TTGCGCTTGC AGCTTCAGCA ATGCGAAAAA AATTGTGCCG AAAACTTGAT GATTGTGGAC
TTGCTCCGCA ACGATTTAGG GAGAATTTGT ACCCCCGCCA CCATTAAAGC TACGAAGCTT
TTTGCTACCG AAAGCTGGCC TACGCTTCAC CAAATGATCT CCACAATTTC GGGTGAACTG
CGCAATAACG TCAGTTTATA CGAACTTTTT CAAGCGCTCT ACCCCTGCGG CTCCATTACA
GGAGCACCAA AAATAAGCGC CATGCAGTTA ATTCAGCAGC TTGAACAATC GCCACGCGGC
ATTTATACAG GCGCTATTGG CTACATAACG CCGCCATCAG CTCAAGTATC TGCACAAACC
ATGCGCTTTA GCGTAGCAAT CCGCACCCTT GAGCTGCAAG GGCAGCACGG CATCTATGGC
TCTGGTGGAG GTATTGTGTG GGATTCCGTT GCGGCTGATG AGTATTGCGA ATGCCAACTC
AAAACTAAAA TTCTTGAGAG CATTGCCGCC CCACCATTTG AACTGTTTGA AACCATGCTG
TGGCATGATG GATGCTACCT CTGGCTTAAT GAACACCTCA ATCGCCTTGC GAACTCAGCC
AAAGCACTTG GCTTTGCATT TGAACGTCAA GCAACATTGC AGCAACTTCT TGCCTTTGAA
GTGGAACTGC AACAGTCCCC AAAAAAACGC TGTAAAGTAA AACTCACCCT TTTTCGCAAT
GGTGAAGTAC AGCTTGATGC CGAAGCCGTT TCGCCTGACT TATCAGGGCG CTTGATGCTT
GTAACGCTTG CAGAAAAGCC TGTTTCGAGC AATGAAGAGG CGTGGCTTCA GCACAAAACA
ACCTTGCGCC ACTCGTATGA CAGCGCATTT GCCGCTGCCC GTGCGGCTGG CTACGACGAG
GTTATTTTTT GCAACCAACG TGGCGAAATT ACGGAAGGCG CAATCAGCTC AATTATGGTG
CGGCACGGCT CCCAACTTCT TACGCCATCA CTTGCTTGTG GGCTGCTCAA CAGCATTAGC
CGTCGCTACC TGCTTGCCAC CCGCCCAAAT TTGCGTGAAG CCACTCTGTA CCCCAATGAC
CTTGTTACTG CCGACATGCT CTATATTGCA AACTCCGTGC GCGGCATTCG CCCAGCAGTA
ATGGAGCAAG AAATGAAACG TATAGAAAAA TAA
 
Protein sequence
MLFHNPHEVL MLNAFDGVED FFKKIEERVA AGFFVAGWLS YEAAYGMDSA LAEMATAQTW 
QAPLAWFGVY KAPQRFTADE VAQLFPPSLT TAITAPHCST TEIDHAEQVA AIREEIAAGK
VYQVNLTARY HFSMAGEAPA LFAALRQQQP ASYTAFLNCG ERTILSFSPE LFFRTDGCAI
ETRPMKGTAP RGSSAEEDAH LRLQLQQCEK NCAENLMIVD LLRNDLGRIC TPATIKATKL
FATESWPTLH QMISTISGEL RNNVSLYELF QALYPCGSIT GAPKISAMQL IQQLEQSPRG
IYTGAIGYIT PPSAQVSAQT MRFSVAIRTL ELQGQHGIYG SGGGIVWDSV AADEYCECQL
KTKILESIAA PPFELFETML WHDGCYLWLN EHLNRLANSA KALGFAFERQ ATLQQLLAFE
VELQQSPKKR CKVKLTLFRN GEVQLDAEAV SPDLSGRLML VTLAEKPVSS NEEAWLQHKT
TLRHSYDSAF AAARAAGYDE VIFCNQRGEI TEGAISSIMV RHGSQLLTPS LACGLLNSIS
RRYLLATRPN LREATLYPND LVTADMLYIA NSVRGIRPAV MEQEMKRIEK