Gene Cphamn1_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0603 
SymboltrpD 
ID6374267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp635498 
End bp636535 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content52% 
IMG OID642683116 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001959043 
Protein GI189499573 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.421903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0114034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTACA AGGAGTTACT TCACAAGCTG CTGACCGGCA CCGATCTTTC AGGCAAAGAG 
ATGGAAGAGT GTTTCTCGGG TATCATGCTG GGAGAGTACC CGGACAGTGT CATAGCGGCA
ATTCTCGCCT TGCTTCAGAA AAAAGGAGTA ACCCCCGAAG AAGTTGCCGG AGCGTATTTC
GCAATTATCT CAAAAGCCCT TCCGGTTCAG CTTGGTGATA ATGCCGTCGA TACGTGCGGT
ACCGGAGGTG ATCAGGCAGG CACCTTCAAC ATCTCAACGG TAGCGGCAAT TATTGCAAAC
GGCGCAGGAG TACCGATAGC CAAACATGGC AACAGGTCTG TGACGAGCCG GTGCGGCAGC
GCCGATGTGC TTGAACAGCT TGGCTACCGA ATTCTTCTTC CTCCTGACAA AACCGAAATG
CTTTTCCGTG AGACCGGATT CGCCTTTCTT TTCGCGCCCC TCTATCATCC GGCGATGAAA
GCTGTCGCGC ATATACGCAG AGAACTCGGC ATAAAAACCA TTTTCAACAT GCTGGGGCCT
CTGGTCAATC CGGCTAAAGT GCACAGGCAG GTGGTAGGCG TGTTCGATAT GCGCGTCATG
GAAATCTACG CTCAGTCACT CATCAGGACA GGATGCAGCC ATGCCCTTGT CGTTCACGGC
AAAACCGAAA ATGGGGACGG ACTTGATGAA GCAAGCATAT GCGGTCCGAC CCGTATTGTA
GAAATTCAGA ACGGAGAAAT CACCTGTCAC GACGTAGAAC CTGAAACCTT CAGCCTGTCA
CGGTGTACCA TAGCCGAACT TCAAGGAGGC GACAGCAGCC GGAATGCAGA CATACTTCTC
AGGATTCTCG ACGGAAGCGC AACAAAAGCC CAGACAGATG CCGCGCTATT CAGTGCGGCT
ATGGCATGTT ACGTATCCGG TAGAGCAACA TGCATTGACG ACGGCCTGAG CAAAGCAAAA
GGCTCTCTGG AAAGCGGAAA CGCCTCGAAA CAATTCTCAC GCATCCTTGC CCTCAATGCA
GAACTTGCCG GCAAATAG
 
Protein sequence
MQYKELLHKL LTGTDLSGKE MEECFSGIML GEYPDSVIAA ILALLQKKGV TPEEVAGAYF 
AIISKALPVQ LGDNAVDTCG TGGDQAGTFN ISTVAAIIAN GAGVPIAKHG NRSVTSRCGS
ADVLEQLGYR ILLPPDKTEM LFRETGFAFL FAPLYHPAMK AVAHIRRELG IKTIFNMLGP
LVNPAKVHRQ VVGVFDMRVM EIYAQSLIRT GCSHALVVHG KTENGDGLDE ASICGPTRIV
EIQNGEITCH DVEPETFSLS RCTIAELQGG DSSRNADILL RILDGSATKA QTDAALFSAA
MACYVSGRAT CIDDGLSKAK GSLESGNASK QFSRILALNA ELAGK