Gene Shew_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_1952 
Symbol 
ID4920985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp2257349 
End bp2258629 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content54% 
IMG OID640163521 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001094077 
Protein GI127512880 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000690945 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000426447 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAAC TACGACTCAA TCCCATATCT AAGGTTCATG GTACCGTGAA TATTCCCGGT 
TCTAAGAGTA TCTCTAACCG CGCTCTGTTA TTGGCGACCT TAGCTGAAGG GAAAACCCGA
CTGACCAATC TGCTCGATTC TGACGACATT CGTCACATGC TTACTGCCCT CAAGCAGCTC
GGGGTTAATT ATCAATTATC TGACAATAAC CGGGTTTGCG AAGTCGAGGG GCTGAGCGGC
GTGATTAATA GCGACACAGC CCAGACACTG TTTCTGGGTA ATGCCGGTAC GGCGATGCGC
CCCTTATGTG CTGCGCTGAC CTTAGGCAGC GGCGAATTTA CCTTAACGGG CGAGCCGCGA
ATGGAGGAGC GTCCCATAGG TGATCTGGTC GATGCCCTTA ACGCACTTGG CGCCGATATA
CGCTACCTGA AACAGCCTGG CTTTCCGCCA CTGACCATTA ATGCCACCGG ACTCAATGGC
GGCGATGTTG AGATCGCAGG CGACCTTTCC AGCCAGTTTT TAACCGCGCT GCTGATGGTA
ACGCCGCTTG CCAAGGCCCA GGTGAATATC AAGATTAAAG GCGAGCTGGT CTCCAAACCT
TATATCGACA TCACTATCGC GTTGATGGCG CAGTTTGGCG TGACCGTTAT CAATCACGAC
TATCAGCGCT TTGAGATCCC TGCAGGCCAG AAATATGTCT CCCCCGGCAC TGTGCTGGTT
GAAGGCGACG CCTCATCGGC CTCATACTTC CTGGCGGCGG GAGCCATTCA GGGCGGTGAG
GTTAAGGTCA CCGGCGTTGG ACTGAAAAGT ATTCAAGGGG ATGTTAAGTT TGCCGAGGTG
CTCGAAGCCA TGGGCGCACA GATAGAGTGG GGCGACGATT TTATCATCGC CAGAAGTGCG
CCGCTGCATG GGGTGGATCT CGACATGAAC CACATCCCGG ATGCTGCCAT GACCATAGCG
ACAGCGGCGC TGTTTGCCAC AGGCACCACG ACGCTGCGTA ATATCTATAA CTGGCGCATC
AAGGAGACGG ACCGTCTCGC TGCCATGGCC ACCGAACTGC GTAAAGTCGG CGCCGAGGTA
GAAGAGGGCC ATGATTATAT TCGCGTCACG GCGCCGGCTC AGTTAAATAC GGCCGATATC
GATACTTATA ACGATCATCG CATGGCCATG TGTTTCTCGC TGATGGCCTT TGCCGATTGT
GGCATTACCA TCAACGATCC TGATTGTACT TCCAAAACCT TCCCCGACTA CTTCGCCCAG
TTTGCGGCGC TTGCCCAGTA G
 
Protein sequence
MKQLRLNPIS KVHGTVNIPG SKSISNRALL LATLAEGKTR LTNLLDSDDI RHMLTALKQL 
GVNYQLSDNN RVCEVEGLSG VINSDTAQTL FLGNAGTAMR PLCAALTLGS GEFTLTGEPR
MEERPIGDLV DALNALGADI RYLKQPGFPP LTINATGLNG GDVEIAGDLS SQFLTALLMV
TPLAKAQVNI KIKGELVSKP YIDITIALMA QFGVTVINHD YQRFEIPAGQ KYVSPGTVLV
EGDASSASYF LAAGAIQGGE VKVTGVGLKS IQGDVKFAEV LEAMGAQIEW GDDFIIARSA
PLHGVDLDMN HIPDAAMTIA TAALFATGTT TLRNIYNWRI KETDRLAAMA TELRKVGAEV
EEGHDYIRVT APAQLNTADI DTYNDHRMAM CFSLMAFADC GITINDPDCT SKTFPDYFAQ
FAALAQ