Gene Ssed_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_1684 
SymboltrpD 
ID5609927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp2018449 
End bp2019507 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content56% 
IMG OID640932554 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001473423 
Protein GI157374823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA GCACAGATCT ACAACCACTT ATCGACAAAT TGTATCGAGG TGAGAGTGTC 
TCACGCAGCA AAGCCAAGCA ACTGTTCAGC TGCATCATTA ACGGTGAGAT GAGCGAAGCG
GCAATGGCAG GCATGTTAGT CGCCATGAAG ATGCGCGGCG AAACCATAGA TGAGATATCC
GGCGCTGCAG ATGCGCTTAT ATCGGCGGCA AAGGCATTCC CAACTCCCAG TGATGCGACT
CGAAAACAGG GAATCGTCGA TATTGTCGGT ACCGGCGGCG ATGGCCACAA CACCATCAAT
ATCTCCACAA CGGCGGCCTT CGTTGCTGCA GCGTCCGGGG CTAAGGTGGC TAAGCATGGC
AATCGCAGTG TATCGAGCAA ATCAGGCTCA TCGGATCTGC TGGCGCAATT TGGTATCGAC
CTTACCATGG CGCCGGAGAC CGCCCGGGAT TGCTTAGATG AATTGGGGCT CTGTTTCCTG
TTTGCTCCAC ACTATCACGG CGGGGTTCGC CACGCAGTTC CCGTCAGACA GGCGCTCAAG
ACCCGCACCC TGTTCAATGT CCTGGGGCCA CTCATCAACC CCTCTCACCC GGACTACATC
CTGCTCGGCG TTTACAGCGA AGAGTTGGTT CAACCGATAG CTGAAGTACT CAAAGCACTG
GGGATGAAGC GCGCGATGGT CGTTCATGGT AGCGGACTGG ACGAAGTCGC TGTCCATGGC
AATACTTCAG TCTGTGAGCT CACAGACGGC GAGCTCAAAC AATACACCCT AACCCCTGAG
GTGCTGGGCG TACCCAGGGC AAACCTGAAA GAGTTAGAGG GCGGCTCGCC CGAAGAGAAT
GCCGAGTTCA CCCGCGCTAT CTTACAGGGC CAAGGCCGGA CGGCGCATAC CAACGCGGTC
GCGGTTAATG CAGGTTGCGC CCTGTACATT TCAGGCGTGT GTGATAGCGT CGAGTCGGGT
ACAGCACTGG CACTAGAGAC GTTAGCCAGC ACCAAGGCCT ATACACTTCT TGAGCGGCTT
GCCAGTGCAA GCGCTAACCA AGCAAAAGTC GGAGCATAA
 
Protein sequence
MSDSTDLQPL IDKLYRGESV SRSKAKQLFS CIINGEMSEA AMAGMLVAMK MRGETIDEIS 
GAADALISAA KAFPTPSDAT RKQGIVDIVG TGGDGHNTIN ISTTAAFVAA ASGAKVAKHG
NRSVSSKSGS SDLLAQFGID LTMAPETARD CLDELGLCFL FAPHYHGGVR HAVPVRQALK
TRTLFNVLGP LINPSHPDYI LLGVYSEELV QPIAEVLKAL GMKRAMVVHG SGLDEVAVHG
NTSVCELTDG ELKQYTLTPE VLGVPRANLK ELEGGSPEEN AEFTRAILQG QGRTAHTNAV
AVNAGCALYI SGVCDSVESG TALALETLAS TKAYTLLERL ASASANQAKV GA