Gene Sama_1687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1687 
Symbol 
ID4603938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2060661 
End bp2062076 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content53% 
IMG OID639781050 
Productpara-aminobenzoate synthase, component I 
Protein accessionYP_927563 
Protein GI119774823 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.532467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.427027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCAC CCCTGCCCAC TCGTATAAAT CATGCACAGC TCGCCGTAAA AACCCTCGAT 
TGGACCGCAT CCACCAGCGA CATTTTTACC CCATTGGCCA GTCAGCCATG GTCAATGCTG
CTTGACTCGG CCGATGCACC GCACATGGAC GCCCACTGGG ACATTTTGGT GGCAGCCCCT
GTCGCTACGT TAAAAGTGTA CGAGAGCCAC TCCGAACTCA CCTATGCAGG CAACACACTG
AGACTGGACA CCTCTGAGTG CCCTTTTTCA CAGTTGCAAT CGGTCCAGCA TGCACTATTC
AGCGTTCAAA AAAATACATC ACTCCCCTTC GCCGGTGGCG CATTAGGCAG TTTTAACTAT
GATTTAGGCA GGCGCATTGA GCGACTACCC AGTACGGCTC TGGACGATAT TAATCTGCCA
TTAGCCTGTA TTGGCTTTTA CGACTGGGCG CTTATGCGAA GCTATCAATC TGATTCATGG
CAACTGGTAC ATTATCTCGG TGATGACGCA TTAAACGAGA CACTGGCATG GCTTGAGCAG
CAGCGCGACT TTGCCCAAGC CGGGGCGGAG TCTAACACCA GCTTCTCGCT GCTGACGGAG
TTTACCCCCC AAATCACCCG AGACCAGTAC CAGCAAAAAT TCAATCAGGT GCAATCTTAT
TTGGCGAGTG GTGACTGCTA TCAGATAAAC CTGACCCAAA GGTTCAGCGC TGATTATCAG
GGAAGCGAGT GGCAGGCCTA CCTCAAACTG CGTTCTGCCA ATGTGGCGCC CTTCTCTGCC
TTTGTCCGGC TTGAAGAAGG CGCCATTTTG TCCATCTCGC CGGAACGGTT TATCAAACTT
GACGGCAGAC AGGTGGAAAC CAAGCCTATC AAGGGCACCT TGCCAAGATT GCCCGACCCG
GACGCCGATA AAACCAATGC CATTTTGCTG AAAGCCTCGC CCAAAGACAG GGCCGAAAAC
CTGATGATTG TGGATCTGCT GCGAAATGAT ATTGGCCGGG TAGCAAGCCC GGGGAGTGTT
CGGGTGCCCA AGCTTTTTGA AGTGGAAAGC TTCCCTGCCG TGCATCATTT GGTCAGTACA
GTAACCGCGC AACTTGCCGA AAACAAAGAT GCCTTTGATT TATTGAGAGC AGCCTTCCCG
GGCGGCTCTA TTACCGGCGC CCCCAAAATC CGCGCCATGG AAATTATTGA AGAGCTTGAG
CCATCCCGGC GCAGCATTTA CTGTGGCTCC ATCGGTTATA TCAGCCAGCA CGGTAATATG
GATACCAGCA TCACCATACG CACCCTGGCG GCTGTCGATG GCAAACTGTA CTGCTGGGCC
GGTGGGGGCG TGGTGGCCGA CTCAATTGCC GACAGCGAGT ATCAGGAAAC CTTCGACAAG
ATCAGCCGTA TTCTACCGAT ACTGGAACAG GAATAA
 
Protein sequence
MFPPLPTRIN HAQLAVKTLD WTASTSDIFT PLASQPWSML LDSADAPHMD AHWDILVAAP 
VATLKVYESH SELTYAGNTL RLDTSECPFS QLQSVQHALF SVQKNTSLPF AGGALGSFNY
DLGRRIERLP STALDDINLP LACIGFYDWA LMRSYQSDSW QLVHYLGDDA LNETLAWLEQ
QRDFAQAGAE SNTSFSLLTE FTPQITRDQY QQKFNQVQSY LASGDCYQIN LTQRFSADYQ
GSEWQAYLKL RSANVAPFSA FVRLEEGAIL SISPERFIKL DGRQVETKPI KGTLPRLPDP
DADKTNAILL KASPKDRAEN LMIVDLLRND IGRVASPGSV RVPKLFEVES FPAVHHLVST
VTAQLAENKD AFDLLRAAFP GGSITGAPKI RAMEIIEELE PSRRSIYCGS IGYISQHGNM
DTSITIRTLA AVDGKLYCWA GGGVVADSIA DSEYQETFDK ISRILPILEQ E