Gene Shewmr4_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3724 
Symbol 
ID4254287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp4450091 
End bp4452943 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content50% 
IMG OID638120369 
Productformate dehydrogenase alpha subunit 
Protein accessionYP_735844 
Protein GI113972051 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTAA CTCGCAAGTC CGATGTCGCC CAAGTGGCCG ACAAACCGAC GTTAGGCATT 
AGCCGTCGTC AATTTATGAA GCAAGCAGGT ATTACTACCG GTGGTATCGC AGCAGCTTCT
CTGATGGGTA CTGGCATGAT GCGCCGCGCA GAAGCCAAAG ATGTGCCACA CGACGCGCCG
ATCGAAGTTA AACGTACGAT TTGTAGTGCC TGTGCTGTGG GTTGTGGTCT GTATGCCGAA
GTGCAAAATG GTGTGTGGAC GGGTCAAGAA CCTGCATTCG ATCACCCATT CAATGCTGGC
GGTCACTGCG CTAAGGGTGC TGCACTGCGT GAGCATGGCC ACGGTGAAAA ACGCCTGAAA
TACCCAATGA AATTGGTTGA TGGTAAGTGG AAAAAAATCT CTTGGGAAGA TGCGATTAAC
GAAGTGGGCG ACCAAATGCT CAACATTCGT AAAGAATCAG GCCCAGACTC AGTGTACTTC
ATGGGTAGCG CTAAGTTCTC TAACGAAGGC TGCTATATGT ACCGCAAACT GGCGGCCATG
TGGGGCACAA ACAACGTCGA CCACTCAGCT CGTATTTGTC ACTCTACCAC GGTAGCCGGT
GTTGCTAACA CTTGGGGTTA CGGTGCGCAA ACTAACTCTT TCAACGACAT TCAGAATGCC
AATGCCATCT TCCTGATCGG GGCCAACCCA GCAGAAGCGC ATCCAGTATC TATGCAGCAC
ATTCTGATCG CTAAAGAGAA AAACAACGCG AAAATCATCG TTGTTGACCC ACGTTTCTCT
CGCACTGCGG CGCACTCAGA TCTGCACTGC GCGATTCGTC CAGGTACTGA CATTCCCTTT
ATCTACGGTA TGTTATGGCA CATTTTCGAA AACGGTTGGG AAGATAAGAC CTTTATCCAA
CAACGCGTAT TTGAGATGGA AACCATCCGC GAAGAAGCGA AAAAATTCCC ACCTAAAGAA
GTGGCAAATA TCACTGGCGT AAGTGAAGAA GTCATTTATC AAGCCGCGAA ACTGATGGCG
GAAAACCGTC CAGGTACCGT GATTTGGTGT ATGGGTGGTA CTCAACACCA CGTCGGTAAC
GCTAACACCC GTGCTTACTG TATTCTGCAA TTAGCCTTAG GCAACATGGG CGTTTCTGGC
GGCGGTACTA ACATTTTCCG TGGCCACGAC AACGTACAAG GCGCGACCGA CTTAGGTCTG
CTGTTCGATA ACCTACCAGG TTACTACGGT TTAACCTCAG CCGCTTGGAC TCACTGGACC
CATGTGTGGG ATCTAGATAT GGAGTGGGTG AAGAGCCGCT TCGATCAAAA CGCCTATTTA
GGCAAAGATC CAATGACCAC CCCTGGTATT CCTTGTTCTC GCTGGCACGA TGGCGTGTTA
GAAGACAAGA GCAAGCTGGC ACAGAAAGAC AATATCCGTA TGGCCTTCTT CTGGGGTCAA
TCGGTCAACA CTGAAACCCG TCAACGTGAA GTGCGTGATG CTTTAGACAA GATGGACACA
GTAGTGGTTG TCGACCCATT CCCAACCATG GCGGGTGTGA TGCACCGTCG TAAGAATGGC
GTGTATCTGT TACCTGCTGC GACTCAGTTT GAAACTCAAG GTTCAGTGTC TAACTCAGGC
CGTTCTATCC AATGGCGTGA GCAGGTTATC CAACCTTTAT TCGAGTCAAA AACCGACATC
GAAATTATGT ACCGTTTAGC GCAAAAACTC GGTATTGCCG AGCAATACAC TAAACGCATC
GCCAAAGAAA ACGGCTTACC GGTTATCGAA GAGATCACCC GCGAAATCAA CCGCGGCATG
TGGACCATCG GTATGACAGG TCAAAGCCCT GAGCGTATCA AGCTGCACAC CCAAAACTGG
GGCACTTTCA GCAACAAGAC GCTCGAAGCC GTCGGCGGCC CAGCGAAGGG CGAAACCTAC
GGTTTACCTT GGCCATGTTG GGGCACACCA GAAGCTAAAC ACCCTGGTAC CCAAATTCTG
TATAACCAAT CCAAACACGT TAAAGACGGC GGCGGTAACT TCCGCGCTCG TTACGGCGTT
GAATACAATG GCAAAAACCT ACTGGCTGAA GGCTCTTTCT CTAAAGGTGC CGAGATCCAA
GACGGTTACC CAGAATTTAC CGACAAGCTG CTGAAGCAAC TCGGTTGGTG GGATGACCTG
ACTGCGGAAG AAAAAGCCGA AGCTGAAGGC CGCAACTGGA AGACAGACTT GTCAGGCGGC
ATAGTGCGCG TGGCAATCAA GCACGGTTGT ATTCCATTTG GTAACGCTAA AGCCCGTTGT
ATTGTTTGGA CTTTCCCAGA CCAAGTGCCA GTTCACCGCG AGCCGTTATA CACAGCACGC
CGTGACTTAG TGGCTAAATA CCCAACCTAC GACGATATGC AAGTTCATCG TCTGCCAACA
CTGTACAAGT CAATCCAAGA GAAAGACTTC AGCGGCAAGT ACCCACTGGT ACTGACCTCT
GGTCGTTTAG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTCTAACCC ATGGCTGGCT
GAGCTTCAAC AGGAAATGTT TGTTGAAATC AACCCAGGTG ACGCAGCCGA CCGCGGTATC
CGCAACGGTG AGTTTGTGTG GTTAGAGGGC GCCGAAGGTG GCCGCATTAA AGTACAAGCC
ATGGTAACAC CACGCGTTAA ACCAGGTGTG ACCTTTATGC CATACCACTT TGCGGGTGTG
ATGCACGGTG AAAGCTTAGC GCCTAACTAT CCTGAGGGCA CTGTGCCTTA CGTTATCGGT
GAATCCGCTA ACACGGCACT GACCTATGGT TATGACCCTG TGACTCAAAT GCAGGAAACC
AAAGCGTCGC TCTGTCAGAT CGTTAAAGCG TAA
 
Protein sequence
MKLTRKSDVA QVADKPTLGI SRRQFMKQAG ITTGGIAAAS LMGTGMMRRA EAKDVPHDAP 
IEVKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK
YPMKLVDGKW KKISWEDAIN EVGDQMLNIR KESGPDSVYF MGSAKFSNEG CYMYRKLAAM
WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA NAIFLIGANP AEAHPVSMQH
ILIAKEKNNA KIIVVDPRFS RTAAHSDLHC AIRPGTDIPF IYGMLWHIFE NGWEDKTFIQ
QRVFEMETIR EEAKKFPPKE VANITGVSEE VIYQAAKLMA ENRPGTVIWC MGGTQHHVGN
ANTRAYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTSAAWTHWT
HVWDLDMEWV KSRFDQNAYL GKDPMTTPGI PCSRWHDGVL EDKSKLAQKD NIRMAFFWGQ
SVNTETRQRE VRDALDKMDT VVVVDPFPTM AGVMHRRKNG VYLLPAATQF ETQGSVSNSG
RSIQWREQVI QPLFESKTDI EIMYRLAQKL GIAEQYTKRI AKENGLPVIE EITREINRGM
WTIGMTGQSP ERIKLHTQNW GTFSNKTLEA VGGPAKGETY GLPWPCWGTP EAKHPGTQIL
YNQSKHVKDG GGNFRARYGV EYNGKNLLAE GSFSKGAEIQ DGYPEFTDKL LKQLGWWDDL
TAEEKAEAEG RNWKTDLSGG IVRVAIKHGC IPFGNAKARC IVWTFPDQVP VHREPLYTAR
RDLVAKYPTY DDMQVHRLPT LYKSIQEKDF SGKYPLVLTS GRLVEYEGGG EESRSNPWLA
ELQQEMFVEI NPGDAADRGI RNGEFVWLEG AEGGRIKVQA MVTPRVKPGV TFMPYHFAGV
MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KASLCQIVKA