Gene Cag_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1741 
SymboltrpD 
ID3746874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2260325 
End bp2261380 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content47% 
IMG OID637774278 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_380035 
Protein GI78189697 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCAA AACAATTGCT TCAAAAGCTG CTTGCAGGCG AGCACTGCTC AAAAGAGGAG 
ATGCAAGACT GCATGAATAG CATTATGGAT GGTGAGTTTT CGGATAGTGT TATTGCGGCT
TTGTTGGCGT TGCTGCAAAA AAAGGGCGTG GTGGCAAATG AGCTTGCGGG AGCACATGCA
AGTTTAATGG CTCATGCAAC CACCGTTGCA TTAAGCACGC ATGCGGTTGA TACATGTGGC
ACAGGTGGCG ACCATGGTGG CACGTACAAT ATTTCAACCA CAGCCTCGCT TATTGCGTGT
AGTGCTGGTG TTCGAGTAGC AAAGCATGGT AACCGTTCGG TAACCAGCAG TTGTGGTAGT
GCCGATGTGC TGGAAGCGCT GGGATTTACG CTTGAGCTTC CACCTGAGGC AACCATTTCG
CTCTTTAAAA AAACAGGTTT TGCCTTCCTT TTTGCCCCCT TGTACCATCC ATCAATGAAG
CGCGTGGCTC ATATTCGTCG CGAACTTGGC ATTCGTACTC TTTTCAACAT GCTTGGACCG
TTGCTTAATC CAGCACAAGT TAAGCGGCAG CTTGTTGGCG TTTTTAGTGA GGAGTTGTCG
GAACTCTACG CAGACGTACT TTTGCAAACA GGGGCACGCC ATGCGCTTAT TGTACATGCA
AGTACCGAAG AGGGTGTTAT ACTTGATGAG CCAAGTTTAA ATGGCACAAC CTTTGTTACA
GAAATTGAGA AAGGCGTTGT GCGCAAACAC ACCCTTCGTC CCGAAGAGTT TGGCATTGCA
CCAGCTCCTC TTGCTGCGCT ACAAGGAGGC GATAAGGAAC ATAATGCCCG AATTATTCAA
AGCATTGCAG ATGGAAGTGC CTCAGCAGCA CAGCGCGATG CAGCACTCTA CTCAAGTGCT
ATGGCATGTT ACGTTGGAGG CAAGTGTGCT TGCTTAAATG ATGGTTTTAT AGTAGCTAAA
GAGGCTTTGG AAAGCGGCAA AACACAAGCT AAACTCAAAG AGATTATTGC CTATAATCAA
GCGTTAGTAA CTGAATACCA TGTGGCAAAA TCTTAA
 
Protein sequence
MESKQLLQKL LAGEHCSKEE MQDCMNSIMD GEFSDSVIAA LLALLQKKGV VANELAGAHA 
SLMAHATTVA LSTHAVDTCG TGGDHGGTYN ISTTASLIAC SAGVRVAKHG NRSVTSSCGS
ADVLEALGFT LELPPEATIS LFKKTGFAFL FAPLYHPSMK RVAHIRRELG IRTLFNMLGP
LLNPAQVKRQ LVGVFSEELS ELYADVLLQT GARHALIVHA STEEGVILDE PSLNGTTFVT
EIEKGVVRKH TLRPEEFGIA PAPLAALQGG DKEHNARIIQ SIADGSASAA QRDAALYSSA
MACYVGGKCA CLNDGFIVAK EALESGKTQA KLKEIIAYNQ ALVTEYHVAK S