Gene Shewmr4_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3720 
Symbol 
ID4254283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4444941 
End bp4447793 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content50% 
IMG OID638120365 
Productformate dehydrogenase alpha subunit 
Protein accessionYP_735840 
Protein GI113972047 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.933728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTAA CCCGCAAAAC AGACCAAGTC GTCGAGCCAA AAGTGCCCGC CCTCGGTCTC 
AATCGTCGCC AATTCTTAAA ATCTGCAGGT CTTGCCACTG GTGGTATCGC CGCCGCGTCT
ATGCTTGGCA CAGGCATGAT GCGTAAAGCA CAGGCGCAGG AACATATCCC CCATAATGCA
CCGACTGAAG TCAAACGTAC TATTTGCTCT CACTGCGCTG TGGGTTGTGG TATCTATGCT
GAAGTGCAAA ACGGTGTGTG GACAGGTCAA GAACCCGCGT TCGATCATCC ATTTAACCAA
GGCGGCCACT GCGCGAAAGG GGCTGCACTG CGTGAACACG GCCACGGTGA GAAACGCCTG
AAATACCCAA TGAAGTTAGA AGGCGGTAAG TGGAAGAAGA TCTCTTGGGA TCAAGCCATC
AATGAAGTGG GTGATAAAAT GACTGCGATT CGTCAAGAAT CGGGTCCTGA CTCTATCTAC
TTTATGGGTA GCGCTAAGTT CTCTAACGAA CAAGCCTATT TATATCGTAA ACTCGCGGCA
CTGTGGGGCA CAAACAACGT CGACCACTCA GCCCGTATTT GTCACTCTAC CACGGTAGCC
GGTGTTGCTA ACACTTGGGG CTACGGTGCG CAAACCAACT CGTTGAACGA TATCCGCAAC
TCTAAGTGCG TCATGTTTGT GGGTTCAAAC CCAAGTGAAG CGCACCCAGT CGCCATGCAA
CACATTTTGG TGGCAAAAGA GCGCGGCGCT AAGATTATCG TTGTTGATCC ACGTTTCACC
CGTACTGCAG CTAAGTCTGA CGAGTACGTG CATATCCGCC CAGGTACCGA TATCCCCTTC
ATCTATGGTC TGTTATGGCA CATTTTTGAA AACGGCTGGG AAGATAAAGA GTTCATCAAG
CAACGTGTTT ACGGCATGGA ACGTATTCGC GATGAAGTGA AAAAATATAC GCCTGAAGAA
GTCGAAAACG TTGCTGGCGT GCCTAAGGCG CAAATGTACC GTATCGCTAA AATGTTAGCC
GAAACCAAAC CTGGCACTAT CGTATGGTGT ATGGGCGGTA CTCAGCACCA CGTCGGTAAC
GCCAACACCC GTTCATACTG TATTTTACAG TTAGCGCTGG GCAACATGGG CGTATCAGGC
GGCGGTACTA ACATTTTCCG TGGTCATGAT AACGTGCAAG GCGCGACTGA CTTTGGTCTG
TTATTCGACA ACTTACCCGG TTACTACGGT TTAACTTCTG GCGCTTGGGC TCACTGGTCT
AACGTTTGGG ACTTAGATCC AAAATGGGTT GCAGGCCGTT TCGACCAAGG CGAGTACCTA
GGTCAAACAC CTCAAACCTC AACGGGTATT CCCTGCTCTC GCTGGCACGA TGGTGTACTA
GAAGATAAAA CCAAGATCGC GCAAAAGGAC AACATCCGTC TGGCGTTCTT CTGGGGTCAA
TCTGTCAACA CCGAAACCCG TGGCCGCGAA GTACGTGAAG CACTGAACAA GTTAGATACT
GTGGTTGTTG TCGACCCAAT CCCAACCATG GCCGGTGTTA TGCACCAGCG TAAAGATGGG
GTGTATCTGC TCCCTGCGTC GACTCAATTT GAAACCTACG GCTCAGTGTC TGCCACTAAC
CGTTCGATTC AATGGCGCTC TAAAGTGATC GAGCCGCTGT TTGAGTCTCT GCCTGACCAC
GTCATTATGT ACAAACTGGC GAAAAAGCTG GGTATCGAAA AAGAATTCTG TAAGCACATC
CAAGTGAATG GTGAAGAGCC ATTGATTGAA GACGTGACCC GCGAATTCAA CAAAGGCATG
TGGACCGTCG GTTACACAGG CCAGAGCCCA GAACGTCTGA AAATGCACCA AGAAAACTGG
GGCACTTTCG ATGTTAACAG CCTGACCGCA CCGGGCGGCC CAGCTAAAGG TGAAGTCTAC
GGCTTACCTT GGCCATGTTG GGGTACTCCA GAGATGAAAC ACCCTGGTAC CCAAATCCTT
TACGATCAAT CCAAAGAAGT GAAAGACGGT GGTGGTAACT TCCGCGCCCG TTATGGTGTT
GAACACAATG GTGTGAATAT TCTCGCCGAC GGTTCATTCT CCAAAGGCAG TGAAATTCAA
GATGGTTATC CTGAGTTTAC CGCCGACATG CTCAAGCAAT TGGGTTGGTG GGATGATTTA
ACTGAAGAAG AGAAAAAATA CGCCGAAGGC AAAAATTGGA AAACAGATAT TTCTGGTGGT
ATCCAACGTG TTGCCATCAA ACACGGCTGT ATTCCTTTCG GTAACGCAAA AGCGCGTTGT
ATCGTGTGGA CTTTCCCAGA CGATATCCCG CTGCACCGCG AGCCACTCTA CACTCCTCGT
CGTGACTTAG TCGCTAAGTA CCCAACCTAC GAAGACCGTA TGGTTGCCCG CCTACCGACC
CTGTATAAGT CAATTCAGGA TAAGGACTTC ACCCAAGGCT TCCCACTGAC ACTGACCTCT
GGTCGTTTGG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTCTAACCC TTGGTTGGCC
GAGCTACAAC AGGAAATGTT CATCGAAATG AACCCGGCAG ACGCTGCTGA CCGTGGTATC
CGTGACGGTG ACAATGTCTT TGTTCATGGT CCTGAAGGCG CCAAGATCAC AGTGAAGGCA
ATGGTGACAC CACGGGTTGT TCCGGGTGAA TGTTTTATGC CATACCACTT CGCCGGTATC
TTTGAAGGTG AAAACCTCGC GAAGAATTAC CCTGAAGGTA CAGTACCTTA TGTACAAGGT
GAATCGGCTA ACACCATTCT AACTTACGGC TATGACGTTG TGACTCAGAT GCAAGAAACT
AAGTCCAGCC TTTGCCAAGT TAGCAAAGCC TAA
 
Protein sequence
MRLTRKTDQV VEPKVPALGL NRRQFLKSAG LATGGIAAAS MLGTGMMRKA QAQEHIPHNA 
PTEVKRTICS HCAVGCGIYA EVQNGVWTGQ EPAFDHPFNQ GGHCAKGAAL REHGHGEKRL
KYPMKLEGGK WKKISWDQAI NEVGDKMTAI RQESGPDSIY FMGSAKFSNE QAYLYRKLAA
LWGTNNVDHS ARICHSTTVA GVANTWGYGA QTNSLNDIRN SKCVMFVGSN PSEAHPVAMQ
HILVAKERGA KIIVVDPRFT RTAAKSDEYV HIRPGTDIPF IYGLLWHIFE NGWEDKEFIK
QRVYGMERIR DEVKKYTPEE VENVAGVPKA QMYRIAKMLA ETKPGTIVWC MGGTQHHVGN
ANTRSYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDFGL LFDNLPGYYG LTSGAWAHWS
NVWDLDPKWV AGRFDQGEYL GQTPQTSTGI PCSRWHDGVL EDKTKIAQKD NIRLAFFWGQ
SVNTETRGRE VREALNKLDT VVVVDPIPTM AGVMHQRKDG VYLLPASTQF ETYGSVSATN
RSIQWRSKVI EPLFESLPDH VIMYKLAKKL GIEKEFCKHI QVNGEEPLIE DVTREFNKGM
WTVGYTGQSP ERLKMHQENW GTFDVNSLTA PGGPAKGEVY GLPWPCWGTP EMKHPGTQIL
YDQSKEVKDG GGNFRARYGV EHNGVNILAD GSFSKGSEIQ DGYPEFTADM LKQLGWWDDL
TEEEKKYAEG KNWKTDISGG IQRVAIKHGC IPFGNAKARC IVWTFPDDIP LHREPLYTPR
RDLVAKYPTY EDRMVARLPT LYKSIQDKDF TQGFPLTLTS GRLVEYEGGG EESRSNPWLA
ELQQEMFIEM NPADAADRGI RDGDNVFVHG PEGAKITVKA MVTPRVVPGE CFMPYHFAGI
FEGENLAKNY PEGTVPYVQG ESANTILTYG YDVVTQMQET KSSLCQVSKA