Gene Cphamn1_0490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0490 
Symbol 
ID6374154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp511503 
End bp513326 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content50% 
IMG OID642683007 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001958934 
Protein GI189499464 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGATA GCTTCGCAGC CAATACGATG TTGGATGAAC CCTTCACGGT TCTGTTTGGA 
GGTTCCTTCA GGAAAGATGA TCCCGGGGGG TATCTTCTGT TCTCAGACCC GGTGGATACT
ATCGCGCTCA CCTCTCCTGA TGACCTTCGC TCATTTTTCG GAAAGCTTGA AGCATTTCTT
GCTGAAGGGT TCAGTCTGGC AGGGTATGTC GGTTATGAGG CCGGCTACGG TTTTGAGCCT
GAATCTTTTT CTTCCGAAAG CGCAGCGAAG GCGGTTGTTC CGCTTGCCTG GTTCGGGGCC
TATCGCTCTG CTGAGAGATT GTCGGGGAAC GGGGCCGAAG GTCTATTTTC CGGGCAGTGC
AGGCCGGGAG CGCTTCAGTT TGACATGACG CAGGGGGAGT ATGCGGAAAA GATTGACGAG
ATAAAGAAAC ATATTGCGGC CGGGGATGTC TATCAGGTTA ACTTTACGGG AAGGTATCGT
TTTGATTTTG GAGGAAGTGC CTCCTCGTTG TTCCGGTATC TCTCTTCCAG GCAGCCTGGA
GTCTACTCAG CGTGGATGAA CCTTGGTGAG CATCAGCTGG CGTCGTTTTC TCCTGAACTG
TTTTTCAGGA TGGAGGGGAA TGGTATTGAA ACCAGACCGA TGAAAGGAAC CGCACCGAGA
GGTGGCAGTG AGCATGAAGA CAGGTTGTTC AGGGAGTGGT TGGGATCCAA TGAGAAAAAC
AGGGCCGAAA ATCTCATGAT TGTGGATCTT CTTCGCAATG ATCTCGGGCG AATATGCAAA
CCGGGTTCAG TTAACGTTCC TGAACTCTTT TCCGTCGAGA CCTACCCGAC ACTGCATCAG
ATGGTATCCT CCGTTCGCGG AGAGGTACAG GACGACATTT CACTCTATGA ACTTTTCCGT
GCGGTATTTC CCTGCGGTTC CGTGACAGGG GCTCCGAAAA TAAGAGCAAT GCAGCTGATT
CAGGAGCTTG AGCGTTCACC AAGAGGGGTC TACACCGGTG CAGCAGGCTA TATGCTTCCT
GACAGGTCAA TGTGTTTCAA TGTGGCAATC AGGACAGCGA TGTTGTGTGG TCATACCGGA
GAATACGGGG CCGGAGGAGG TATTGTATGG GATTCGAATA CCGGGGAGGA GTACAACGAG
TGCAGATTGA AAGCGAAAAT CCTCAAACCA GGCAAGGCTG AAAATTTCGG CATTTTTGAA
ACCATATTGT ATAACGGATC TTTTGTCTGG CTTGACGAGC ATCTTTTCCG TCTCAGCGAG
TCGGCAAGGT GTCTGGGCTT TTCGTGTGAC CTGGAGAGGA TCAGGCGCGA ACTGGAACGT
CTGACTGATG AAGAGCTCAG GGGGAGGGGT AGATATAAGG TTCGTCTTGA ACTTCATCCT
GAAGGTACTT TTCAGATAAC TGTTGATGAC CTTTCTGAGA GCCCCTCATC TGATCCGGTT
TCTGTTTGCA GAGCAGGAGT ATCCCTGCCC TCAGACGGTC ATCTCAGAAT GCATAAAACA
ACAAGGAGGG AGCTCTACGA TAAGTTGTTG CGAAAAGCAA AGAAAAGGGG TTACGATGAA
CTGCTGTTCT GCAATGACAG AGGGGAGGTT GCCGAAGGAG CGATAAGCAA CATCATTATC
TGTTCTGACG GGCACTATGT TACACCGGGC CTTTCTTCCG GACTGCTTAA CGGCATCTAT
CGGCAATATT TTCTTTCAAC CCGCATGAAT GTTCAGGAAG CGATACTCAC TATGCATGAT
ATAGAACAGG CCGATCTCCT GTTTGTCTGT AATTCATTGA GAGGGTTGAG AAGAGCGGTT
CTTTTCGATG AGGTGGTGTC ATGA
 
Protein sequence
MRDSFAANTM LDEPFTVLFG GSFRKDDPGG YLLFSDPVDT IALTSPDDLR SFFGKLEAFL 
AEGFSLAGYV GYEAGYGFEP ESFSSESAAK AVVPLAWFGA YRSAERLSGN GAEGLFSGQC
RPGALQFDMT QGEYAEKIDE IKKHIAAGDV YQVNFTGRYR FDFGGSASSL FRYLSSRQPG
VYSAWMNLGE HQLASFSPEL FFRMEGNGIE TRPMKGTAPR GGSEHEDRLF REWLGSNEKN
RAENLMIVDL LRNDLGRICK PGSVNVPELF SVETYPTLHQ MVSSVRGEVQ DDISLYELFR
AVFPCGSVTG APKIRAMQLI QELERSPRGV YTGAAGYMLP DRSMCFNVAI RTAMLCGHTG
EYGAGGGIVW DSNTGEEYNE CRLKAKILKP GKAENFGIFE TILYNGSFVW LDEHLFRLSE
SARCLGFSCD LERIRRELER LTDEELRGRG RYKVRLELHP EGTFQITVDD LSESPSSDPV
SVCRAGVSLP SDGHLRMHKT TRRELYDKLL RKAKKRGYDE LLFCNDRGEV AEGAISNIII
CSDGHYVTPG LSSGLLNGIY RQYFLSTRMN VQEAILTMHD IEQADLLFVC NSLRGLRRAV
LFDEVVS