Gene Clim_1992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1992 
Symbol 
ID6355496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2210148 
End bp2211983 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content54% 
IMG OID642669590 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001944003 
Protein GI189347474 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCCCC GTTCCTCTCA ATCTCTCTTT TCAAAACCCG GGACACTCTG GTTTGAAACA 
GCGGGAACCG GCCGAAAGGG AGGAGAGGCG CTGCTTTTTA CTGATCCTGT CGATACCCTT
ACCCTTACAT CACGTTCCGG ACTGTACGAT TTTTTTCTGG CAATCGAGAA AAAACGGGAT
GCAGGTTTTT TTCTGGCCGG ATGGCTGGGT TACGAGGCCG GTTGCGGATT TGAGCCGTTG
GTGACCCGTT CGCCTTCTTC GCCGCCAGGA GCGCCTCTTG CCTGGTTCGG GGTTTATGAG
AACCCGCAGT GTTTTGTCGG GTCGGATATC GATGCATTTC TGGCCGGGGC GAGTGAGCCA
TGTATGGTCA GCCGGCTTCA GTTTGATTAT GCCGAGGATG ACTATGTGGT GGTTATACGG
GCCATAAAAG AGCAGATAGC CGCAGGAAAC GTCTATCAGG TGAACTTTAC CGGCCGCTAC
CGGTTCGCTT TCAGCGGCTC GCCGCAAACG CTGTTTTTAA CATTGCGGAG CAGTCAGCCT
TCCGCCTATA CGGCTTTTCT CAATACCGGT GACAGGGTTG TGCTTTCGTT GTCGCCGGAG
CTGTTTTTCA GGTGCAGCGG AGGAATGATC GAGACGATGC CCATGAAGGG CACCGCCCCG
AGAGGGGAGA CTCCCGAAGA GGATGCGCTG ATGAAGCACG GGCTTTCCCG ATGCGAAAAA
AACCGGGCAG AGAACCTGAT GATCGTCGAT CTTCTGCGAA ACGATCTTGG CCGAATCTGT
CTTCCGGGCT CCGTTCATGC CGGTGAACTG TTCGCTACCG AGACCTATCC GACCCTGCAC
CAGATGGTAT CCACGATACG CGGCAGGCTT GCGGAGCATA TTGGGCTTTA CGATATCTTT
CGCGCGCTTT TCCCCTCAGG TTCGGTAACC GGCGCACCTA AAATAAGCGC CATGCAGCTT
ATCGGCGAAC TTGAGCCGAC GTCAAGAGGA ATCTACACCG GAGCTATCGG TATTGTGAAG
CCGGATGGCG ATATGGTTTT CAATGTCGCT ATCCGTACCA TAGAGATTTC CGGTCAAACG
GGCACGTACG GTTCTGGCAG CGGGATTGTG TGGGATTCGG ATCCATTGCA GGAGTTTCGG
GAGTGCATGC TCAAGGCCAG GATTATCAGT GATGAAGTTC AGGAAATTCC GGAGCTTTTC
GAGACACTGC TCTGGGCAGG AAGATATCTC TGGCTTGATG AACATCTCGG GAGGATCCGA
ACCTCGGCAG CTGCGCTTGG AGTTTCCTTT CAGGAAAATG AGGCCCGTTA CCGGCTTGAC
CGGCTCGATT GCGCACTTGC TGCTTGCGGT GGACGCTTCA AGGTGAGGTT GAGGCTTTCC
GGTGAAGGTC GCATTACCGT CGGGCATGAA CCGATCGATG CGACTCCTTC GGAAAAGCCG
CTGAAGCTCT GCTCTGCGGC AGAGCGCATT GCCTCGACGG ATTTTCTCCG ATATCACAAA
ACCGGTTCGC GGAAACTCTA TGACCGTTTC TACCGCCTGG CGCTCGATCA TGGGTATAAT
GAGGTGGTGT TTTTCAATGA ACGGGAAGAG GTTGCTGAAG CGGCAGTCAG CAATATCATA
ATCCGCAGTG GAACTCTTTA CTATACACCG CCGGTAACCT CGGGTCTGCT CGATGGTATA
TACCGGAGTT ATTTTTTACG CACCCGTTCG GAATGCATCG AAAAAGTGCT TTTCATCGAT
GATCTGTTAG CTGCCGACGC CATCTATCTC TGCAATTCGG TCAGGGGAAT GCGCCGGGCG
ATATTCGATG GAACGCAACT TACGGGTAAC GGTTGA
 
Protein sequence
MAPRSSQSLF SKPGTLWFET AGTGRKGGEA LLFTDPVDTL TLTSRSGLYD FFLAIEKKRD 
AGFFLAGWLG YEAGCGFEPL VTRSPSSPPG APLAWFGVYE NPQCFVGSDI DAFLAGASEP
CMVSRLQFDY AEDDYVVVIR AIKEQIAAGN VYQVNFTGRY RFAFSGSPQT LFLTLRSSQP
SAYTAFLNTG DRVVLSLSPE LFFRCSGGMI ETMPMKGTAP RGETPEEDAL MKHGLSRCEK
NRAENLMIVD LLRNDLGRIC LPGSVHAGEL FATETYPTLH QMVSTIRGRL AEHIGLYDIF
RALFPSGSVT GAPKISAMQL IGELEPTSRG IYTGAIGIVK PDGDMVFNVA IRTIEISGQT
GTYGSGSGIV WDSDPLQEFR ECMLKARIIS DEVQEIPELF ETLLWAGRYL WLDEHLGRIR
TSAAALGVSF QENEARYRLD RLDCALAACG GRFKVRLRLS GEGRITVGHE PIDATPSEKP
LKLCSAAERI ASTDFLRYHK TGSRKLYDRF YRLALDHGYN EVVFFNEREE VAEAAVSNII
IRSGTLYYTP PVTSGLLDGI YRSYFLRTRS ECIEKVLFID DLLAADAIYL CNSVRGMRRA
IFDGTQLTGN G