Gene Shewmr4_2767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2767 
Symbol 
ID4253338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3310810 
End bp3312459 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content49% 
IMG OID638119402 
Productmalate synthase 
Protein accessionYP_734895 
Protein GI113971102 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00223378 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAC ACACTTTAAG TGAACAGCAA TTGAATTCGA CACAGAATAA GGCCACTGCG 
AATGGCACTC TGGCCTTAGT GGGAAATACC ATCCCGGGGC AGGAGGTGAT TTTTACCGAA
GGCGCGCTGG CGTTGCTTGA GTCACTTTGC CGTGAATTTG CGACAGAAGT GCCCACATTA
CTCGCCAAAC GTAAAGATAG ACAGGCGCGT ATCGATAAAG GGGCTTTGCC TGACTTTTTA
CCCGAGACTC GCGCAATTCG TGACGGGGCA TGGAAGATCC GTGGTATTCC GAATGACTTA
CTCGATCGCC GTGTCGAAAT TACCGGTCCC GTCGAACGTA AGATGGTGAT CAATGCGCTT
AATGCAAATG CCAAAGTGTT TATGGCGGAT TTTGAAGACT CCTTAGCACC GAGCTGGCAG
AAAGTCGTGG AAGGTCAGAT TAACCTACGT GATGCCGTGC GCGGTGAGAT TGAGTACACA
GCGCCTGAAA CCGGTAAACA CTACAAGTTA GGCCAAAATC CTGCGGTATT AATTTGCCGT
GTGCGTGGCC TGCACTTAAA AGAGAAGCAC GTTGAATTTA ACCAGCAGTC CATTCCCGGC
GCGTTATTCG ACTTTGCGAT GTTCTTCTAC CATAACTATC GCCAATTGCT GGCGAAGGGC
AGTGGTCCTT ACTTCTATAT TCCTAAACTT GAGAGCCATA TTGAAGCGCG TTGGTGGGCA
AAAGTGTTTG CTTTTGTTGA AGAAAGATTT TGCCTTCAGC CTGGGACAAT CAAATGTACC
TGTTTGATTG AGACCTTACC CGCGGTGTTT GAAATGGACG AAATTCTCTA TGAATTACGC
TCAAACATTG TCGCGCTGAA CTGTGGTCGT TGGGACTATA TCTTCAGCTA TATCAAAACA
TTAAAGCGTC ATAGCGACCG TGTATTACCG GATCGCCAAG CGGTGACCAT GGATACGCCT
TTCTTAAGCG CCTATTCAAG ACTGCTGATC AAAACCTGCC ATAAACGTGG CGCACTGGCG
ATGGGCGGCA TGGCGGCCTT TATTCCAGCG AAGGATCCCG CTCAAAACGA AACCGTGTTG
CAACGGGTAC GTAAGGACAA AGAGCTTGAG GCTCGCAATG GTCACGATGG GACTTGGGTC
GCTCACCCAG GTCTTGCGGA CACGGCAATG GGGATCTTTA ACGAATACAT AGGCCAAGAT
CATCAAAATC AATTGCATAT CACCCGTGAT GTGGATGCAC CGATCCTTGC CGCAGAGTTA
TTAAAACCCT GTGATGGTGA GCGAACTGAG CAAGGGATGC GCCTGAATAT TCGCATCGCT
CTGCAATACC TTGAGGCGTG GATCAGTGGC AACGGTTGTG TGCCGATTTA CGGATTAATG
GAAGATGCGG CAACGGCGGA AATCTCCCGC GCCTCGATTT GGCAATGGAT CCAACATGGC
AAGTCACTCT CAAACGGCAA ACCCGTCACT AAACAATTGT TTAAGGACAT GCTGGTGGAA
GAGTTAGCAA ATGTGAAAAA AGAAGTGGGC GGTGACAGAT TCACCCACGG CAAATTTACC
CAAGCGGCTG TATTGCTTGA GGATATTACC ACTTCGGATG AATTGGTCGA TTTCTTAACC
TTACCCGGTT ACGAGATGCT AACGGCTTAA
 
Protein sequence
MTEHTLSEQQ LNSTQNKATA NGTLALVGNT IPGQEVIFTE GALALLESLC REFATEVPTL 
LAKRKDRQAR IDKGALPDFL PETRAIRDGA WKIRGIPNDL LDRRVEITGP VERKMVINAL
NANAKVFMAD FEDSLAPSWQ KVVEGQINLR DAVRGEIEYT APETGKHYKL GQNPAVLICR
VRGLHLKEKH VEFNQQSIPG ALFDFAMFFY HNYRQLLAKG SGPYFYIPKL ESHIEARWWA
KVFAFVEERF CLQPGTIKCT CLIETLPAVF EMDEILYELR SNIVALNCGR WDYIFSYIKT
LKRHSDRVLP DRQAVTMDTP FLSAYSRLLI KTCHKRGALA MGGMAAFIPA KDPAQNETVL
QRVRKDKELE ARNGHDGTWV AHPGLADTAM GIFNEYIGQD HQNQLHITRD VDAPILAAEL
LKPCDGERTE QGMRLNIRIA LQYLEAWISG NGCVPIYGLM EDAATAEISR ASIWQWIQHG
KSLSNGKPVT KQLFKDMLVE ELANVKKEVG GDRFTHGKFT QAAVLLEDIT TSDELVDFLT
LPGYEMLTA