Gene Sala_0836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0836 
SymboltrpD 
ID4080044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp842002 
End bp843000 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content70% 
IMG OID638009195 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_615887 
Protein GI103486326 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0270172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00485559 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGGT TCGGGCCATT TCCCGACCCT TCGGCGCTGC TCGACCATGA CGAGGCGGCG 
CACGCCTTCG CAACGATGCT CGATGGCGGT GCGCGCGACG AACAGATCGC CGCGTTTCTG
GTCGCGCTCG CCGACCGCGG CGAAACGATG GTCGAAATCG CCGCCGCGGC ACAGGCGATG
CGCGATCGGC TGATCCCCAT CGAGGCGCCG GCGGGCGCGA TCGACGTGTG CGGCACCGGC
GGCGACGGAC ACCACACGCT CAACGTCTCG ACGGCGGTGT CGATCGTCGT CGCGGCGTGC
GACGTGCCGG TCGCAAAGCA CGGCAATCGC GCGGCTTCGT CGAAATCGGG CGCCGCCGAC
ACGCTGGAGG CGCTTGGCCT CGACATGGAG CGCGCCGATC GTCAGGCGCA GGAACAGCTC
GCCGACCTCG GCATCTGTTT CCTCTTCGCC GGGACGCGCC ACCCTGCGAT GAAGCGCATC
ATGCCGATCC GCAAGGCGAT CGGGCGGCGG ACGATCTTCA ACCTGATGGG GCCGCTCGCC
AATCCCGCGC GCGTCACCCG CCAGCTTGTC GGCATCGCGC GCCCCGCCTA TGTGCCCGTC
TATGCCGAGG CGCTGCACCG GCTCGGCACC GATCATTCGC GCGTCATTTC GGGCGACGAG
GGGCTCGACG AACTCTCGCT CGCGGGCGGC AACGAGGTCG CGGTGGTGAC CCCCGACGGC
GTGCGGATGC AGCGCAGCAG CGCCGCCGAC GCCGGGCTGC CGACGCGCTC GCTCGCCGAA
ATCCGCGGCG GCGATGCGGC GTTCAACGCC CGCGCGCTGC GCCGCCTGCT CGAAGGCGAA
ACCGGCGCGT ACCGTGACGC GGTACTCTAC AACGCCGCCG CGGCGCTGAT CGTCGCGGGC
GCGGTCGACA CGCTGACGGA GGGGGTCGAG GAAGCCGCCG AAGCGATCGA CAAGGGCCTC
GCCAACGCGC TGCTCAACTG CTGGATCGCG TATAAATGA
 
Protein sequence
MSRFGPFPDP SALLDHDEAA HAFATMLDGG ARDEQIAAFL VALADRGETM VEIAAAAQAM 
RDRLIPIEAP AGAIDVCGTG GDGHHTLNVS TAVSIVVAAC DVPVAKHGNR AASSKSGAAD
TLEALGLDME RADRQAQEQL ADLGICFLFA GTRHPAMKRI MPIRKAIGRR TIFNLMGPLA
NPARVTRQLV GIARPAYVPV YAEALHRLGT DHSRVISGDE GLDELSLAGG NEVAVVTPDG
VRMQRSSAAD AGLPTRSLAE IRGGDAAFNA RALRRLLEGE TGAYRDAVLY NAAAALIVAG
AVDTLTEGVE EAAEAIDKGL ANALLNCWIA YK