Gene Arth_2208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2208 
SymboltrpD 
ID4445269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2484310 
End bp2485368 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID639690017 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_831688 
Protein GI116670755 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00486869 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTTCAC AGGCATCTGC ACAGGCGGCG GGCAACACCT GGCCGCGGCT CATTTCGGCA 
CTGATCAACG GCGCGGACCT GACGGCCGAC AATACGGAAT GGGCGATGGA CACCATCATG
TCCGGGGAGG CCACCCCTTC GCAGATCGCC GGGTTCCTGG TGGCGCTGCG GTCAAAGGGC
GAGACGGTGG ACGAGATAGC CGGGCTTGTC GAGGCCATGC TGGCGCACGC GAACCCTGTC
GACATCGCCG GCGAGAAGCT GGATATTGTG GGCACCGGCG GTGACCAGCT GAACACCGTC
AACATTTCCA CCATGGCCGC CCTCGTCTGT GCGGGAGCGG GCGCAAAGGT GGTTAAGCAT
GGCAACCGTG CGTCGTCGTC GTCGTCCGGG TCGGCGGACG TGCTGGAAGC GCTCGGCGTC
CGGCTGGACC TGCCCATCGA ACGCGTTGCG CGCAACGCAG AAGAAGCGGG GATCACGTTC
TGCTTCGCCC AGGTGTTCCA CCCCTCCTTC CGGCACACTG CCGTCCCGCG CCGGGAGCTT
GCCATACCCA CGGCGTTCAA CTTCCTCGGC CCGCTGACCA ACCCCGCGCG GGTTCAGGCC
TCGGCGGTTG GTGTCGCCAA CGAAAGGATG GCCCCCTTGG TTGCCGGGGT CCTGGCCAAG
CGCGGAAGCC GCGGCCTGGT TTTCCGGGGG AGTGACGGAC TGGACGAACT GACAACCACC
GGACCGTCCA CTGTCTGGGA GATCCGGAAC GGCGAAGTGA CGGAACAGAC GTTTGATCCC
CAGGCGCTGG GCATCCGTGC CGCAACAGTG GAGGAGCTCC GCGGCGGCGA CGCGACAGCC
AATGCCGCCG TCGTCCGTGA TGTCCTGAGC GGCGTGGCGG GCCCGGCCCG TGACGCCGTC
CTTCTGAATG CCGCCGCGGG CCTGGTGGCA TTCGACGTCG ATGCCGAGGG CACGCTCACG
GACCGGATGG CGGCCGCGTT GAAGCGAGCG GAAGAGTCCA TTGACTCCGG TGCCGCGGCG
GCCGTGCTGG AAAAGTGGGT CGCCCTTACC CGGGGCTAG
 
Protein sequence
MTSQASAQAA GNTWPRLISA LINGADLTAD NTEWAMDTIM SGEATPSQIA GFLVALRSKG 
ETVDEIAGLV EAMLAHANPV DIAGEKLDIV GTGGDQLNTV NISTMAALVC AGAGAKVVKH
GNRASSSSSG SADVLEALGV RLDLPIERVA RNAEEAGITF CFAQVFHPSF RHTAVPRREL
AIPTAFNFLG PLTNPARVQA SAVGVANERM APLVAGVLAK RGSRGLVFRG SDGLDELTTT
GPSTVWEIRN GEVTEQTFDP QALGIRAATV EELRGGDATA NAAVVRDVLS GVAGPARDAV
LLNAAAGLVA FDVDAEGTLT DRMAAALKRA EESIDSGAAA AVLEKWVALT RG