Gene Sama_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2159 
Symbol 
ID4604409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2608704 
End bp2609801 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content57% 
IMG OID639781544 
Productchorismate synthase 
Protein accessionYP_928034 
Protein GI119775294 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGGA ACAGCATAGG TCAGAATTTC GTGGTTACTA CCTTTGGCGA GAGCCATGGG 
GTGGCGCTCG GTTGCATTAT CGATGGTTGC CCTCCGGGCC TCGAGCTTAC CGAAGCTGAT
ATGCAGCACG ATCTTGACCG TCGCCGTCCG GGCACGTCCC GCTACACCAC GGCGCGCCGC
GAGCCGGACG AAGTGCGTAT TCTCTCAGGT GTTTTCGAAG GCAAAACAAC AGGAACCTCC
ATTGGTCTGA TTATCGAAAA CACCGATCAG CGCAGCCAGG ACTACAGCAA TATCAAAGAC
CTGTTCCGCC CCGGTCACGC CGATTACACC TATCAACAAA AGTACGGTCT GCGGGATTAT
CGCGGCGGTG GCCGTTCGTC CGCCCGAGAA ACCGCCATGC GGGTTGCCGC CGGCGCTGTG
GCGAAAAAGT ACCTCAAGGC CGTTCACGGG ATCGAAATTT ACGGCTTCCT GTCGCAACTT
GGCCCCATTG AGGCAGAGCA CATCGACCGC GAGCAGATTG AGCAAAACGC CTTTTTCTTC
CCTGATGCCA GCAAGCTTGA AGCGCTGGAT GAATATATGC GCGAGCTGAA AAAATCCGGC
GACTCCATCG GTGCCAAGGT CAGCGTGATT GCCACCAATG TACCAGTGGG CCTGGGCGAG
CCTGTGTTTG ACCGTCTTGA TGCCGACATC GCCCATGCGC TGATGGGCAT CAATGCCGTG
AAGGGAGTGG AAATTGGCGA TGGTTTCGCG GTAGTGACCC AAAAGGGCTC CGAGCATCGT
GATTTGATGT CACCCGAGGG CTTTGCCAGC AACCATGCCG GCGGCGTGCT TGGCGGCATT
TCATCCGGTC AGCCAATTGT GGCCCATATG GCGCTTAAGC CAACCTCCAG TATCAGCATT
CCCGGCGAGA GCATGACAGT GCAGGGCAAT ACTGCGGAAG TGGTTACCAA GGGCCGTCAC
GACCCCTGCG TGGGCATTCG CGCCGTGCCT ATTGCCGAGG CCATGTTGGC GATTGTATTG
ATGGATCATC TGCTCAGACA CCGTGCTCAG AATCAGCACG TGCACAGCGA AACCCCTGTG
CTGGGGATGC GCTCTTAA
 
Protein sequence
MSGNSIGQNF VVTTFGESHG VALGCIIDGC PPGLELTEAD MQHDLDRRRP GTSRYTTARR 
EPDEVRILSG VFEGKTTGTS IGLIIENTDQ RSQDYSNIKD LFRPGHADYT YQQKYGLRDY
RGGGRSSARE TAMRVAAGAV AKKYLKAVHG IEIYGFLSQL GPIEAEHIDR EQIEQNAFFF
PDASKLEALD EYMRELKKSG DSIGAKVSVI ATNVPVGLGE PVFDRLDADI AHALMGINAV
KGVEIGDGFA VVTQKGSEHR DLMSPEGFAS NHAGGVLGGI SSGQPIVAHM ALKPTSSISI
PGESMTVQGN TAEVVTKGRH DPCVGIRAVP IAEAMLAIVL MDHLLRHRAQ NQHVHSETPV
LGMRS