Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1992 |
Symbol | |
ID | 6355496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 2210148 |
End bp | 2211983 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642669590 |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_001944003 |
Protein GI | 189347474 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCCCC GTTCCTCTCA ATCTCTCTTT TCAAAACCCG GGACACTCTG GTTTGAAACA GCGGGAACCG GCCGAAAGGG AGGAGAGGCG CTGCTTTTTA CTGATCCTGT CGATACCCTT ACCCTTACAT CACGTTCCGG ACTGTACGAT TTTTTTCTGG CAATCGAGAA AAAACGGGAT GCAGGTTTTT TTCTGGCCGG ATGGCTGGGT TACGAGGCCG GTTGCGGATT TGAGCCGTTG GTGACCCGTT CGCCTTCTTC GCCGCCAGGA GCGCCTCTTG CCTGGTTCGG GGTTTATGAG AACCCGCAGT GTTTTGTCGG GTCGGATATC GATGCATTTC TGGCCGGGGC GAGTGAGCCA TGTATGGTCA GCCGGCTTCA GTTTGATTAT GCCGAGGATG ACTATGTGGT GGTTATACGG GCCATAAAAG AGCAGATAGC CGCAGGAAAC GTCTATCAGG TGAACTTTAC CGGCCGCTAC CGGTTCGCTT TCAGCGGCTC GCCGCAAACG CTGTTTTTAA CATTGCGGAG CAGTCAGCCT TCCGCCTATA CGGCTTTTCT CAATACCGGT GACAGGGTTG TGCTTTCGTT GTCGCCGGAG CTGTTTTTCA GGTGCAGCGG AGGAATGATC GAGACGATGC CCATGAAGGG CACCGCCCCG AGAGGGGAGA CTCCCGAAGA GGATGCGCTG ATGAAGCACG GGCTTTCCCG ATGCGAAAAA AACCGGGCAG AGAACCTGAT GATCGTCGAT CTTCTGCGAA ACGATCTTGG CCGAATCTGT CTTCCGGGCT CCGTTCATGC CGGTGAACTG TTCGCTACCG AGACCTATCC GACCCTGCAC CAGATGGTAT CCACGATACG CGGCAGGCTT GCGGAGCATA TTGGGCTTTA CGATATCTTT CGCGCGCTTT TCCCCTCAGG TTCGGTAACC GGCGCACCTA AAATAAGCGC CATGCAGCTT ATCGGCGAAC TTGAGCCGAC GTCAAGAGGA ATCTACACCG GAGCTATCGG TATTGTGAAG CCGGATGGCG ATATGGTTTT CAATGTCGCT ATCCGTACCA TAGAGATTTC CGGTCAAACG GGCACGTACG GTTCTGGCAG CGGGATTGTG TGGGATTCGG ATCCATTGCA GGAGTTTCGG GAGTGCATGC TCAAGGCCAG GATTATCAGT GATGAAGTTC AGGAAATTCC GGAGCTTTTC GAGACACTGC TCTGGGCAGG AAGATATCTC TGGCTTGATG AACATCTCGG GAGGATCCGA ACCTCGGCAG CTGCGCTTGG AGTTTCCTTT CAGGAAAATG AGGCCCGTTA CCGGCTTGAC CGGCTCGATT GCGCACTTGC TGCTTGCGGT GGACGCTTCA AGGTGAGGTT GAGGCTTTCC GGTGAAGGTC GCATTACCGT CGGGCATGAA CCGATCGATG CGACTCCTTC GGAAAAGCCG CTGAAGCTCT GCTCTGCGGC AGAGCGCATT GCCTCGACGG ATTTTCTCCG ATATCACAAA ACCGGTTCGC GGAAACTCTA TGACCGTTTC TACCGCCTGG CGCTCGATCA TGGGTATAAT GAGGTGGTGT TTTTCAATGA ACGGGAAGAG GTTGCTGAAG CGGCAGTCAG CAATATCATA ATCCGCAGTG GAACTCTTTA CTATACACCG CCGGTAACCT CGGGTCTGCT CGATGGTATA TACCGGAGTT ATTTTTTACG CACCCGTTCG GAATGCATCG AAAAAGTGCT TTTCATCGAT GATCTGTTAG CTGCCGACGC CATCTATCTC TGCAATTCGG TCAGGGGAAT GCGCCGGGCG ATATTCGATG GAACGCAACT TACGGGTAAC GGTTGA
|
Protein sequence | MAPRSSQSLF SKPGTLWFET AGTGRKGGEA LLFTDPVDTL TLTSRSGLYD FFLAIEKKRD AGFFLAGWLG YEAGCGFEPL VTRSPSSPPG APLAWFGVYE NPQCFVGSDI DAFLAGASEP CMVSRLQFDY AEDDYVVVIR AIKEQIAAGN VYQVNFTGRY RFAFSGSPQT LFLTLRSSQP SAYTAFLNTG DRVVLSLSPE LFFRCSGGMI ETMPMKGTAP RGETPEEDAL MKHGLSRCEK NRAENLMIVD LLRNDLGRIC LPGSVHAGEL FATETYPTLH QMVSTIRGRL AEHIGLYDIF RALFPSGSVT GAPKISAMQL IGELEPTSRG IYTGAIGIVK PDGDMVFNVA IRTIEISGQT GTYGSGSGIV WDSDPLQEFR ECMLKARIIS DEVQEIPELF ETLLWAGRYL WLDEHLGRIR TSAAALGVSF QENEARYRLD RLDCALAACG GRFKVRLRLS GEGRITVGHE PIDATPSEKP LKLCSAAERI ASTDFLRYHK TGSRKLYDRF YRLALDHGYN EVVFFNEREE VAEAAVSNII IRSGTLYYTP PVTSGLLDGI YRSYFLRTRS ECIEKVLFID DLLAADAIYL CNSVRGMRRA IFDGTQLTGN G
|
| |