Gene Sbal195_4228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_4228 
Symbol 
ID5756059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp5002940 
End bp5005792 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content50% 
IMG OID641290584 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001556646 
Protein GI160877330 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00413313 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGTTAA CTCGCAAGTC TAGTGTCGCC CCAGTGGCAG ACAAACCGGC GTTAGGCATT 
AGCCGTCGTC AATTTATGAA GCACGCAGGT ATTGCCACCG GAGGTATTGC GGCGGCATCA
CTAATGGGCA CAGGCATGAT GCGTCGCGTC GAAGCCAAAG AAGTGCCACA TGATGCCCCT
ATCGAAGTCA AACGTACTAT TTGTAGTGCC TGCGCCGTCG GTTGTGGTCT GTATGCTGAA
GTACAAAATG GCGTGTGGAC GGGTCAAGAA CCTGCATTCG ATCATCCCTT CAACGCCGGT
GGTCATTGCG CTAAAGGTGC GGCACTGCGT GAGCATGGCC ATGGCGAAAA ACGCCTGAAA
TACCCAATGA AATTGGTTGA TGGCAAGTGG AAAAAAATCT CTTGGGAAGA TGCGATTAAC
GAAGTGGGCG ACCAAATGCT CAACATTCGT AAAGAATCCG GCCCAGACTC AGTGTACTTC
ATGGGTAGCG CTAAATTCTC AAACGAAGGT TGCTACATGT ACCGCAAACT GGCGGCCATG
TGGGGCACAA ACAACGTCGA CCATTCAGCT CGTATTTGTC ACTCTACCAC GGTAGCCGGT
GTTGCTAACA CTTGGGGCTA CGGTGCGCAA ACTAACTCCT TTAACGATAT CCAAAACGCG
AATGCGATTT TCTTTATCGG TGCTAACCCA GCAGAAGCCC ACCCAGTGGC GATGCAACAT
ATTCTGATCG CTAAAGAAAG AAACAACGCT AAGTTGATCG TAGTTGATCC ACGCTTCTCT
CGTACCGCGG CGCATTCCGA TCTGCACTGT GCATTACGTC CAGGTACTGA TATTCCTTTT
ATCTACGGTA TGTTATGGCA CATTTTTGAA AACGGTTGGG AAGATAAAAC CTTTATTCAA
CAACGTGTTT TCGAGATGGA CACCATTCGC GCCGAGGCGA AAAAATTCCC ACCTAAAGAA
GTAGAAAACA TCACAGGCGT GAGCGAAGCC GAGGTTTATC AAGCCGCGAA ACTCATGGCT
GATAACCGCC CAGGTACTGT GATCTGGTGT ATGGGTGGTA CTCAACACCA CGTCGGTAAT
GCCAATACCC GCGCTTACTG TATTCTGCAA TTAGCGCTAG GTAACATGGG CGTGTCAGGC
GGCGGAACGA ACATTTTCCG TGGTCACGAT AACGTACAAG GCGCGACTGA CTTAGGTTTG
TTATTCGATA ACTTACCGGG TTACTACGGC TTAACAACAG GTGCTTGGAC TCACTGGACC
CACGTGTGGG ATCTAGACAT GGAATGGATG AAAGGCCGTT TCGATCAAGG CACTTATCTT
GGCCGCGAAC CTATGACTAC GCCGGGTATC CCTTGTTCTC GCTGGCACGA TGGTGTACTA
GAAGAAAAAG GCAAACTCGC GCAGCAAGAT AACATTCGCA TGGCGTTTTT CTGGGGACAA
TCGGTGAACA CCGAAACCCG CCAACGTGAT GTGCGTGATG CCTTAGACAA GATGGACACT
GTGGTGGTGG TCGATCCATT CCCAACTATG GCAGGTGTGA TGCATCGCCG TAAGAATGGT
GTTTACTTGC TACCAGCAGC GACACAGTTT GAAACTGAAG GCTCAGTTTC AAACTCTGGC
CGCAGTCAGC AATGGCGCGA AAAAGTCATA GAGCCATTGT TTGAATCTAA AACCGATATC
GAAATCATGT ATCGCTTAGC GCAAAAACTT GGTTTTGCTG AGCAGTACAC TAAACGCATC
GCTAAAGAAA ATGATTTGCC TGTGATTGAA GACATCACCC GCGAAATCAA CCGCGGCATG
TGGACTATCG GTATGACGGG CCAAAGCCCT GAACGTCTGA AACTGCATAC CCAAAACTGG
GGCACCTTCA GCAATAAGAC GCTCGAAGCC GCAGGCGGCC CAGCAAAAGG CGAAACCTAC
GGTTTACCTT GGCCATGTTG GGGCACGCCA GAAGCGAAAC ACCCCGGCAC CCAAATTCTT
TACGATCAAT CCAAACATGT TAAAGACGGT GGCGGTAACT TCCGTGCCCG TTACGGCGTT
GAATACAATG GTAAAAACCT ATTGGCTGAA GGCTCTTTCT CTAAAGGTGC TGAAATCCAA
GACGGTTATC CAGAGTTCTC CGACAAGCTG CTCAAGCAAC TCGGTTGGTG GGATGATCTG
ACGGCGGCAG AAAAAGCCGA AGCCGAAGGC AGAAACTGGA AGACAGACTT GTCAGGCGGC
ATAGTGCGCG TAGCAATCAA GCATGGCTGT ATTCCTTTCG GTAACGCTAA AGCCCGTTGT
ATTGTTTGGA CTTTCCCAGA CCAAGCGCCA GTTCACCGCG AACCGTTATA CACGCCACGC
CGTGACTTAG TGGCCAAATA CCCAACCTAT GCCGACATGC AAGTGCACCG TCTGCCTACG
CTGTATAAGT CAATCCAAGA CAAAGACATG AGCGGCAAGT ACCCACTGGT ACTGACCTCA
GGCCGTCTGG TTGAATACGA GGGTGGTGGT GAAGAATCTC GTTGTAACCC TTGGCTAGCT
GAACTTCAAC AGGAAATGTT TGTTGAGATA AGCCCTGCCG ATGCAGCTGA TCGCGGCATT
CGTAACGGTG AAGACGTCTG GTTAGAAGGC GCCGAAGGTG GCCGCATTAA GGTACAAGCT
ATGGTGACGC CACGCGTTAA ACCTGGCGTG ACCTTTATGC CATATCACTT TGCGGGTGTG
ATGCACGGTG AAAGTTTAGC GCCTAATTAT CCAGAGGGCA CAGTACCTTA CGTTATCGGT
GAATCCGCTA ACACTGCGCT GACCTATGGT TATGATCCTG TGACGCAAAT GCAGGAAACC
AAGGCATCAC TGTGTCAGAT CGTGAAAGCG TAA
 
Protein sequence
MKLTRKSSVA PVADKPALGI SRRQFMKHAG IATGGIAAAS LMGTGMMRRV EAKEVPHDAP 
IEVKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK
YPMKLVDGKW KKISWEDAIN EVGDQMLNIR KESGPDSVYF MGSAKFSNEG CYMYRKLAAM
WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA NAIFFIGANP AEAHPVAMQH
ILIAKERNNA KLIVVDPRFS RTAAHSDLHC ALRPGTDIPF IYGMLWHIFE NGWEDKTFIQ
QRVFEMDTIR AEAKKFPPKE VENITGVSEA EVYQAAKLMA DNRPGTVIWC MGGTQHHVGN
ANTRAYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTTGAWTHWT
HVWDLDMEWM KGRFDQGTYL GREPMTTPGI PCSRWHDGVL EEKGKLAQQD NIRMAFFWGQ
SVNTETRQRD VRDALDKMDT VVVVDPFPTM AGVMHRRKNG VYLLPAATQF ETEGSVSNSG
RSQQWREKVI EPLFESKTDI EIMYRLAQKL GFAEQYTKRI AKENDLPVIE DITREINRGM
WTIGMTGQSP ERLKLHTQNW GTFSNKTLEA AGGPAKGETY GLPWPCWGTP EAKHPGTQIL
YDQSKHVKDG GGNFRARYGV EYNGKNLLAE GSFSKGAEIQ DGYPEFSDKL LKQLGWWDDL
TAAEKAEAEG RNWKTDLSGG IVRVAIKHGC IPFGNAKARC IVWTFPDQAP VHREPLYTPR
RDLVAKYPTY ADMQVHRLPT LYKSIQDKDM SGKYPLVLTS GRLVEYEGGG EESRCNPWLA
ELQQEMFVEI SPADAADRGI RNGEDVWLEG AEGGRIKVQA MVTPRVKPGV TFMPYHFAGV
MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KASLCQIVKA