Gene Sare_3530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3530 
SymboltrpD 
ID5704598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4071353 
End bp4072414 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content73% 
IMG OID641272957 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_001538323 
Protein GI159039070 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000658769 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGAAC GGACCTGGCC GCAACTGCTC GCCGCGCTGC TTCGCGGCGA CGAGCTCTCC 
ACCGCTGACA CAGCCTGGGC AATGGGTGAG ATCATGTCCG GCTCGGCTGG CTCGGCGCAG
ATCGCCGGTT TCGCCATCGC GCTACGGGCC AAGGGCGAAA CCCCCGCCGA GGTGTCCGGC
TTGGTGGAGG CGATGCTTCA GCACGCGGTT CGGGTCGAGC TGCCCGAGGA CCTACGCGCG
ACCGCAGTGG ACGTGGTGGG CACCGGCGGC GACCTCGCGC ACACCGTCAA CATCTCCACC
ATGGCCTCCC TGGTGGTGGC CGGTGCCGGC GTACGGGTCG TCAAGCACGG CAACCGGGCC
GCCTCCTCGT CCTGCGGCAC CGCGGACGTG CTGGAGTTTC TCGGCCTGCC GCTGGACCTG
GGTCCGGAGG GGGTGGCGGC CTGCGTCGCC GAGGCAGGTA TCGGCTTCTG CTTCGCGGCC
CGGTTCCACC CCGGTATGCG CCATGCCGGT CCGGTCCGCC GGGAACTGGG CGTACCGACC
GCCTTCAACT TCCTCGGCCC GCTCACCAAC CCGGCCCGTC CGCGGGCCGG CGCGGTCGGC
TGCTTCGACG CGCGGATGGC ACCGGTCATG GCAGCGGTCT TCGCCGCCCG CGGTGACTCG
ACGCTCGTCC TGCGGGGCGA GGACGGGCTG GACGAGTTCA CCACTGCCGC CCCCACCCGG
GTGTGGGCGG CGCAGAACGG CACCGTCCGG GAGGCCCTGC TCGACGCAGC CGACCTCGGG
GTGCCCCGGG CCACCCTCGC CGACCTGCGC GGCGGTGATG TCGCGTGCAA CGCCGACGCG
GTGCGCCGCC TGCTGGCCGG TGAGACCGGG CCGATACGCG ACGCCGTGTT GGTCAACGCC
GCCGCCGCGC TGGCCACCCA GGCACCCCTG GACGGTGACC TGACCGAGGC GCTGCGGACC
GGTCTGTCCC GCGCGGCCGA ATCGATCGAC TCCGGCGCTG CCGCCCGCAC CCTGAACCGG
TGGATCGAGG TCGCCCACGC CGTCCGGCCA GTGCTCGGCT GA
 
Protein sequence
MGERTWPQLL AALLRGDELS TADTAWAMGE IMSGSAGSAQ IAGFAIALRA KGETPAEVSG 
LVEAMLQHAV RVELPEDLRA TAVDVVGTGG DLAHTVNIST MASLVVAGAG VRVVKHGNRA
ASSSCGTADV LEFLGLPLDL GPEGVAACVA EAGIGFCFAA RFHPGMRHAG PVRRELGVPT
AFNFLGPLTN PARPRAGAVG CFDARMAPVM AAVFAARGDS TLVLRGEDGL DEFTTAAPTR
VWAAQNGTVR EALLDAADLG VPRATLADLR GGDVACNADA VRRLLAGETG PIRDAVLVNA
AAALATQAPL DGDLTEALRT GLSRAAESID SGAAARTLNR WIEVAHAVRP VLG