Gene Cagg_2768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2768 
SymboltrpD 
ID7269838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3400585 
End bp3401607 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content60% 
IMG OID643567589 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_002464067 
Protein GI219849634 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC GCGAGGCGAT TGCCGCGGTG GTGGCGCGGC GCGATCTCAC GCAGGCCGAA 
GCGGCGAGCG TGATGGAAGA GATTATGAAT GGGACGGCAA CGCCTGCTCA GATCGCGGCT
TTTCTTACGG CGTTGCATAT CAAGGGTGAG ACCGACGCTG AAATCGCCGG TATGGCGGCG
GTGATGCGTG AGAAGGCGAC CCATGTCTAT TTCGACGGGC CGGTTATCGA TACCTGCGGT
ACCGGTGGTG ATGGTGCGCA TACGTTTAAC ATCAGCACTA CCGCTGCATT TGTCGCTGCC
GGTGCCGGTT TGACGGTGGC GAAGCACGGG AATCGGGCGA TGTCGAGTGT GTGTGGGAGT
GCCGATGTCC TGGAAGGCCT GGGCGTTCAG ATTGAACTTG ACGCTGAAGG TGTTGCCCGT
TGTTTACGTG ATGCCGGCAT CGGCTTTATG TTCGCACCCA AGTTTCATCC GGCGATGCGT
TTTGCCGGGC CGGTGCGGCG CGAGATTGGT ATTCGCACCG TGTTTAACAT TCTCGGCCCG
CTGACCAACC CGGCACGAGC ACGTTATCAG GTATTGGGAG TGGCGAGCGC GGCGCTGGCC
GAAAAGCTGG CTAACGCCCT CAGCCGGCTC GACACCGTCC ATGCGCTGGT CGTGCATGGC
GACGGCGGGG TTGATGAATT GACCCTCTCC GGTCCAAACC TGATCTTCGA GGTACGGGCC
GGGCATGCGC TGCGTCAAAT GATAGTTGCT CCTGAAGATG TTGGCCTGGA ACGGGCTCCA
CGTGAAGCAT TGCGGGGGGG TGATGTTGCC TACAATGTGG CATTGGTACG CGCCATCCTT
AGTGGTGAAG ATCGTGGGCC GCGGCGCGAT GTCGTCTTGT TGAATGCGGC TGCTGCTCTC
GTTGCCGGTG ATGTGGCGCC GGATCTGGCG ACCGGGGTTA AGATGGCGCG TGCGAGTATT
GATAGTGGAC GTGCGCTCGA ACGGCTGCAT CGGATGATCG CGGTGAGTCG GGGTGAGGCG
TAA
 
Protein sequence
MNIREAIAAV VARRDLTQAE AASVMEEIMN GTATPAQIAA FLTALHIKGE TDAEIAGMAA 
VMREKATHVY FDGPVIDTCG TGGDGAHTFN ISTTAAFVAA GAGLTVAKHG NRAMSSVCGS
ADVLEGLGVQ IELDAEGVAR CLRDAGIGFM FAPKFHPAMR FAGPVRREIG IRTVFNILGP
LTNPARARYQ VLGVASAALA EKLANALSRL DTVHALVVHG DGGVDELTLS GPNLIFEVRA
GHALRQMIVA PEDVGLERAP REALRGGDVA YNVALVRAIL SGEDRGPRRD VVLLNAAAAL
VAGDVAPDLA TGVKMARASI DSGRALERLH RMIAVSRGEA