Gene Shew_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_1067 
Symbol 
ID4920576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp1228034 
End bp1229125 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content58% 
IMG OID640162600 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001093197 
Protein GI127512000 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000238391 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAAG ACACGATAAA TAATGTTCAC ATCAGCTCGG AGAAAGTGCT GGTGACCCCG 
GAAGAGTTAA AACAACAACT GCCCCTGTCG ACTGCCGCCT ATCACTATGT GCTTAACGCG
CGCCGCACTG TATCGGATAT CGTCCATAAG CGTGATAACC GCGTGCTGGT GGTCAGCGGT
CCCTGTTCTA TCCACGACAT CGCCTCGGCC AAAGAGTACG CGCTGCGCCT CAAGACGCTG
CACGACGAGC TTAAAGATGA GTTCTACATC CTGATGCGGG TCTACTTCGA GAAGCCACGT
ACTACCGTGG GCTGGAAGGG GATGATCAAC GATCCCGATA TGGATGAGTC TTTCGATGTG
GAGAAGGGGC TGCGTCAGGC GCGCGAGTTG ATGATCTGGC TGGCGGAGCT GGGACTGCCG
GTGGCGACCG AGGCGCTGGA TCCTATCAGC CCGCAATACA TCTCCGAGCT GGTCACCTGG
TCGGCCATCG GTGCGCGCAC CACAGAATCT CAGACCCACA GGGAGATGGC GTCCGGTCTC
TCCATGCCTG TGGGCTTCAA GAATGGTACC GACGGCAAGC TGGGGGTGGC GATCAACGCG
CTGCAATCGG CGGCCAGCAG CCACAGATTC ATGGGGATCA ACCAGGCGGG TCAGGTAGCG
CTGCTACAGA CCGCGGGTAA CCCAGATGGC CACGTGATCC TGCGCGGCGG CAAGACTCCT
AACTATGACG CTTCGAGCGT GGCCGAGTGT GAGCAGCAGC TGCATGCGGC CAAGCTTAAC
GCCCGCTTGG TGGTGGATTG CAGCCACGGC AACTCGAGTA AAGATCACAC CCGCCAGCCG
AGCGTCTGTC AGGATGTGTT CGATCAGATA GCCAATGGCA ACAAGTCGAT CATAGGCGTC
ATGCTGGAGA GTCATCTGAA CGAAGGTAAT CAGAGCAGCG ACAAGCCGAT GAGTGAGTTG
GCCTACGGGG TGTCGGTAAC CGATGCCTGC ATCGACTGGC AGACCACGGA AGATCTTCTA
CGTCAGGGGG CGGCACAATT GGCATCGGTT TTACCAGGCA GATTTGCTAT GCTCAAGGCG
GTCAATGCCT GA
 
Protein sequence
MQQDTINNVH ISSEKVLVTP EELKQQLPLS TAAYHYVLNA RRTVSDIVHK RDNRVLVVSG 
PCSIHDIASA KEYALRLKTL HDELKDEFYI LMRVYFEKPR TTVGWKGMIN DPDMDESFDV
EKGLRQAREL MIWLAELGLP VATEALDPIS PQYISELVTW SAIGARTTES QTHREMASGL
SMPVGFKNGT DGKLGVAINA LQSAASSHRF MGINQAGQVA LLQTAGNPDG HVILRGGKTP
NYDASSVAEC EQQLHAAKLN ARLVVDCSHG NSSKDHTRQP SVCQDVFDQI ANGNKSIIGV
MLESHLNEGN QSSDKPMSEL AYGVSVTDAC IDWQTTEDLL RQGAAQLASV LPGRFAMLKA
VNA