Gene Smed_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1408 
SymboltrpD 
ID5322259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1489869 
End bp1490882 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content65% 
IMG OID640790350 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001327089 
Protein GI150396622 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.457707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT TGAAACCGTT CGTCGCCAAA GTCGCAGCGC GCGAGGCACT TAGCCGCGAC 
GATGCACGCG CGGCCTTCGA GATCATCATG TCCGGGGCGG CCACGCCGTC GCAGATCGGC
GGCTTTCTCA TGGCGCTCCG TGTACGCGGC GAAACGGTCG ACGAGATCGT GGGTGCCGTC
GGGGCGATGC GTGCACGCAT GTTGCACGTG AAGGCGCCGG ACGGTTCGAT CGACATTGTC
GGCACCGGCG GCGACGGCGC CGGCACCTAC AATATTTCGA CGTTGGCCGC GCTGATCGTT
GCAGGCGCGG GGGTGCCGGT CGCCAAGCAC GGCAACCGTG CGCTGAGCTC GAAATCAGGA
ACGGCCGATG CGCTCTCCTG CCTGGGCGTC AATCTCGAAA TAGGGCCCGA GGCAATCTCG
CGCTGCATCG GCGAAGCCGG TCTGGGCTTC ATGTTCGCGC AGCAGCACCA TTCGGCTATG
CGCCATGTCG GTCCGACGCG GGTGGAACTC GGAACGAGAA CGATCTTCAA CCTGCTCGGC
CCCCTCGCCA ATCCGGCCGG CGTTCGGCAA CAGCTCGTCG GCGTCTACGC GCCGCAATGG
GTCGATCCGC TGGCAGAGGT GCTCCGCGAT CTCGGCTCCG AGAGTGTCTG GGTCGTCCAT
GGCGAAGGGC TCGACGAGAT CACGACGACC GGAGTGACCA AGGTTGCGGC GCTCAAGGAC
GGCACGATCA CCAACTTCGA ACTGACACCG GCCGATTTCG GGCTCGAGCG CGTTACGCTC
GATGCCTTGA AGGGCGGTGA CGGCGCCCAT AACGCCGCCG CGCTGCAAGC TGTTCTCGAC
GGTGCGGAGA ATGCCTACCG GGACATTTCC CTTGCGAACG CCGCCGCTTC GTTGATGATA
GCGGGGCGCG CAAAGGACCT GATGGAGGGC ATGGACTTGG CCCGGAAATC GCTTTCGAGC
GGCGCCGCAA AGGTCGCCTT GCAGCGATTG ATCACCGTTT CGAACGCGGC ATGA
 
Protein sequence
MSDLKPFVAK VAAREALSRD DARAAFEIIM SGAATPSQIG GFLMALRVRG ETVDEIVGAV 
GAMRARMLHV KAPDGSIDIV GTGGDGAGTY NISTLAALIV AGAGVPVAKH GNRALSSKSG
TADALSCLGV NLEIGPEAIS RCIGEAGLGF MFAQQHHSAM RHVGPTRVEL GTRTIFNLLG
PLANPAGVRQ QLVGVYAPQW VDPLAEVLRD LGSESVWVVH GEGLDEITTT GVTKVAALKD
GTITNFELTP ADFGLERVTL DALKGGDGAH NAAALQAVLD GAENAYRDIS LANAAASLMI
AGRAKDLMEG MDLARKSLSS GAAKVALQRL ITVSNAA