Gene Saro_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2023 
SymboltrpD 
ID3917344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2156671 
End bp2157663 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content69% 
IMG OID640444775 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_497296 
Protein GI87200039 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTGC TCCCCGATCC CCAGCACCCG CTAGAGGAAG CCGAGGCCGA AGCCGCCTTT 
GCCGCGATTC TCGATGGCGC CGTGGCGGAT GAAGCCATCG CCCGGTTTCT CGTGGGCCTG
TCCGACCGTG GCGAGAACGC CAGCGAGATC GCCGGCGCCG CCCGGGCCAT GCGCGCCCGG
ATGATCCCGA TCAAGGCGCC CGCAAACGCC ATCGACGTCT GCGGCACCGG CGGCGACGGG
CATCACACGC TCAACGTCTC CACCGCCGTC AGCCTCGTCG TCGCCGCCTG CGGCGTGCCC
GTCGCCAAGC ACGGCAACCG CGCCGCCAGT TCCAAGGCCG GCGCCGCCGA TACCCTCGAA
GCCCTGGGCC TCAATCTCGA CCGCGCCGCC GAAACCGCCG AAGAGACGTT GGCCGACCTC
GGCATCTGCT TCCTCTTCGC CGCGCGTCAT CACCCGTCGA TGGGCCGTAT CATGCCCATC
CGCAAGGCGC TCGGCCGCCG CACCATCTTC AACCTGATGG GGCCGCTCGC CAATCCCGCC
AACGTGCGCC GCCAGCTCGT CGGCATCGCG CGTCCGGCCT ATGTCCCGAT CTATGCCGAA
GCCATCCTGC GCCTCGGCAC CGATCACAGC TTCGTCATTT CCGGCGATGA GGGGCTCGAC
GAACTGAGCC TTGCCGGCGG CAACGAACTG GCCGAAGTGC GCGACGGCGA AATCTCCATG
CGCCGCGTAA CGCCTGCGGA CGCCGGCCTG CCCGAAAGCG CGGTCACCGC GATCCGTGGC
GGCGACGCGG CCCATAACGC CCGCGCCCTG CGCGCCCTCC TCGAAGGCGA GCACGGTCCC
TACCGCAACG CCGTGCTCTT CAACGCCGCC GCCGCGCTCA TCATCGCGGG CGAGGCGCAG
GACTGGCACG AAGGCGTCGA GGAAGCAGCC GAAGCCATCG ACAAGGGCCT TGCCAACGCC
CTTCTCAACT GCTGGATCGC CGCTCTCGAA TAG
 
Protein sequence
MTLLPDPQHP LEEAEAEAAF AAILDGAVAD EAIARFLVGL SDRGENASEI AGAARAMRAR 
MIPIKAPANA IDVCGTGGDG HHTLNVSTAV SLVVAACGVP VAKHGNRAAS SKAGAADTLE
ALGLNLDRAA ETAEETLADL GICFLFAARH HPSMGRIMPI RKALGRRTIF NLMGPLANPA
NVRRQLVGIA RPAYVPIYAE AILRLGTDHS FVISGDEGLD ELSLAGGNEL AEVRDGEISM
RRVTPADAGL PESAVTAIRG GDAAHNARAL RALLEGEHGP YRNAVLFNAA AALIIAGEAQ
DWHEGVEEAA EAIDKGLANA LLNCWIAALE