Gene Clim_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0631 
SymboltrpD 
ID6354079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp709041 
End bp710096 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content53% 
IMG OID642668262 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001942697 
Protein GI189346168 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCACA AGGATTTTCT TCACAAGCTG CTCTCCGGAG ACCATTTTTC GCAGGAGGAG 
ATGACTCAGT GCATGAACGC CATCATGAAC GGGGTTTTTC CTGATACCGT TATTGCCGCT
CTGCTCGCTC TTCTGGAGCA CAAAGGGGTA ACCTCCACAG AAGTTGCCGG AGCGTATTAC
AGTCTTATCG CAAAGGCCAA CACCATCGAT CTCTCTCCCG ATGCCGTCGA TACCTGCGGT
ACCGGCGGCG ATCATGCCGG CACCTACAAT ATTTCCACTA TCGGCTCAAT CATCGCCAAC
AGCACCGGCG TTTCCATTGC CAAACACGGA AACCGTTCGG TTACAAGCAG CTGCGGCAGC
GCCGACGTGC TCGAGGAGCT TGGATTCCGT ATCGACCTGC CTGTGGAAGC CACCGTGGAA
CTCTATGCCC GGACAGGGTT TTCATTCCTC TTCGCCCCGC TCTTCCACCC GTCGATGAAA
CGGGTTGCCC ATATACGCAA GGAACTTGGC ATAAGAACCA TATTCAACAT GCTCGGCCCT
CTTATCAACC CTGCGCGATC AAAAAGGCAG CTTGTCGGTG TTTACAGCAG CGAGCTCATG
GAACTCTATA CCGAAGTGCT CCTGCAGACC GGTACACGCC ACGCCATGAT TGTGCATGCG
ATGACTGAAG AAGGCGTCTC CCTCGATGAA CCAAGTCTTA ACGGACCGAC CTATATTGTT
GAAATCCAGA ACGGATATGT CTGTCGGCAT ACAGTCTATC CGGAGGATTT CGGTCTCGAC
AGACATCCGC TTTCGGCCAT TCAGGGAGGA GAGCGAAAGC AGAATGCCGC TATCATCAGA
AGCATTCTCG ATGGCAGCGC TTCACCGGCG CAGATCGATG CAGCTCTCTA TACCTCGGCA
ATGGCCTGTT ACGTATCCGG ACATGCAAGG TGCATCGATG ACGGCCTCAC CATATCAAGA
GAATCGCTTG AAAGCGGCGA TACCGACAGA AAATTCAGGG AGATTCTTGA CTTTAACGCA
GAACTTTCTG CCCGTTACAG GGAAGCGGTG AACTAA
 
Protein sequence
MGHKDFLHKL LSGDHFSQEE MTQCMNAIMN GVFPDTVIAA LLALLEHKGV TSTEVAGAYY 
SLIAKANTID LSPDAVDTCG TGGDHAGTYN ISTIGSIIAN STGVSIAKHG NRSVTSSCGS
ADVLEELGFR IDLPVEATVE LYARTGFSFL FAPLFHPSMK RVAHIRKELG IRTIFNMLGP
LINPARSKRQ LVGVYSSELM ELYTEVLLQT GTRHAMIVHA MTEEGVSLDE PSLNGPTYIV
EIQNGYVCRH TVYPEDFGLD RHPLSAIQGG ERKQNAAIIR SILDGSASPA QIDAALYTSA
MACYVSGHAR CIDDGLTISR ESLESGDTDR KFREILDFNA ELSARYREAV N