Gene Sama_3609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_3609 
Symbol 
ID4605856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp4246748 
End bp4249600 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content58% 
IMG OID639783030 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_929481 
Protein GI119776741 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTA CCCGTACCTC AGACGTGGCC GCCAAGGTCG AGGCCAAGCC ACTGGGCATG 
AGCCGCCGTC AATTTATGAA AACGGCCGGT ATTGCCACCG GCGGCATTGC TGCAGCCTCC
ATGCTTGGCA CTGGTATGAT GCGCCGCGCC GAGGCCAAAG ACGTACCCCA TGATGCGCCG
ATTGAAATCA AGCGCACCAT TTGCAGTGCC TGCGCCGTGG GTTGTGGCCT GTATGCCGAA
GTGCAAAATG GCGTCTGGAC CGGCCAGGAG CCGGCATTCG ATCACCCCTT CAACGCCGGT
GGTCACTGTG CCAAGGGTGC TGCGCTGCGT GAGCACGGCC ATGGTGAAAA ACGCCTCAAG
TATCCGATGA AGCTTGAAGG CGGTAAGTGG AAGCGTATCA GCTGGGAGCA GGCGATTAAC
GAAGTGGGCG ACAAGATGCT CAACATTCGT CAGGAGTCAG GCCCGGATTC TGTGTACTTC
ATGGGCTCGG CCAAGTTCTC CAACGAAGGC TGCTATATGT ACCGCAAGCT CGCGGCCATG
TGGGGTACCA ACAACGTCGA CCACTCGGCC CGTATTTGTC ACTCTACCAC TGTAGCCGGT
GTGGCCAACA CCTGGGGCTA TGGTGCGCAA ACCAACTCTT TCAACGACAT TCAGAATGCC
CGCGCCATCT TTTTTATCGG CGCCAACCCT GCCGAGGCAC ACCCTGTGTC CATGCAGCAC
ATTCTGACTG CCAAAGAGCG CAACAACGCC AAGATAATCG TGGTCGACCC GCGCTTCTCC
CGCACAGCGG CCCATGCCGA CCTGCACGTG GGCATTCGCC CCGGCACTGA TATTCCGTTT
ATTTACGGCA TGTTGTGGCA CATTTTTGAA AACGGCTGGG AAGACAAAAC CTTTATCGAC
CAGCGCGTAT TTGGTATGGA CAAGATTCGC GAAGAAGCCA AAAAATTTCC GCCAAAAGAA
GTGGCCGATA TCACGGGTGT CTCTGAAGAG GCCATTTATC AGGCTGCCAA ACTGATGGCC
GATAACCGCC CCGGCACCGT GGTGTGGTGT ATGGGCGGTA CCCAGCACCA TGTGGGTAAC
GCAAATACCC GTGCCTACAG TATTCTGCAG CTGGCGCTGG GTAATATGGG CGTATCCGGT
GGTGGTACCA ACATCTTCCG AGGTCACGAC AACGTACAGG GCGCCACCGA CCTGGGCCTG
CTGTTTGACA ACCTGCCCGG CTACTACGGC CTGACCAGCG CCGCCTGGCA GCATTGGACT
CACGTGTGGG ATCTGGATCT GGAATGGGTC AAGGGGCGTT TCGACCACGG CACCTACCTG
GGCCGCGAGC CCATGACCAC CCCCGGTATT CCTTGCTCCC GCTGGCACGA TGGTGTGCTG
GAAGACAAGG CCAAGCTGGC GCAGAAAGAC AATATCCGTC TGGCGTTTTT CTGGGGGCAG
TCGGTTAACA CCGAAACCCG TCAGCGCGAA GTCCGTGATG CACTGGACAA GATGGACACA
GTGGTCGTGG TTGACCCCTT CCCAACCATG GCCGGTGTTA TGCACCGCCG TAAAGACGGC
GTTTACCTGC TGCCTGCAGC TACCCAGTTT GAGACCCGTG GCTCCATCTC CAACTCGGGC
CGCTCTATCC AGTGGCGTGA ACAGGTTATC GAGCCGCTGT TTGAGTCCAA GACCGACATC
GAAATCATGT ACCGTCTGGC GGAAAAGCTG GGTATTGCCG AGCAATACAC CAAGCGCATC
AGCAAAGAAA ATGGTGTGCC GCTGATTGAA GACATCACCC GCGAAATCAA CCGCGGCATG
TGGACCATCG GCATGACAGG TCAGAGCCCT GAGCGTATCA AGGCCCACAC CATGAACTGG
GGCACCTTCT CGCAGAAGAG CCTCGAAGCC GAAGGTGGCC CATGTAAGGG CGAAACCTAC
GGTCTGCCAT GGCCATGTTG GGGCACGCCT GAGATGAAAC ACCCGGGCAC CCAGATTCTC
TATAACACAA GCAAACATGT GAAAGACGGC GGCGGCAACT TCCGTGCCCG TTATGGCGTT
GAGTATCAGG GCCAAAATCT GCTGGCCGAG GGCTCCTTCT CCAAGGGCGC CGAGATTGAA
GACGGCTATC CCGAATTTAC CGCCGATATG CTCAAGCAAT TGGGCTGGTG GGACGACCTG
ACCGCGGCCG AAAAAGCCGA GGCCGAAGGC AAAAACTGGA AGACGGATCT GTCCGGTGGC
ATCGTGCGCG TTGCCATCAA GCACGGCTGT ATTCCTTTCG GTAACGCCCG TGCCCGCTGC
CTGGTGTGGA CCTTCCCCGA TGCTGTGCCT GTGCACCGTG AACCGCTGTA TACAGCCCGT
CGCGATCTGG TGGCCAAGTA CCCCACCTAC GACGACATGC AGGTGCACCG TTTGCCTACC
CTGTACAAAT CGATTCAGGA AAAAGACTTC AGCGGCAGCT ACCCGCTGGT GCTCACCTCC
GGTCGTTTGG TGGAGTACGA GGGTGGCGGC GAAGAGTCCC GCTCCAACCC CTGGCTGGCC
GAGCTGCAGC AGGAGATGTT TGTGGAAATC AATCCGGGCG ACGCTGCCGA CCGGGGTATT
CGCAACAATG ACAATGTCTG GCTGGAAGGC CCAGAGGGTG GCCGCATCCT CATCAAGGCA
CTGGTGACAC CCAGGGTAAA ACCCGGCGTG ACCTTTATGC CTTACCACTT CGCCGGTGTG
ATGCATGGCG AGAGTCTGGC CCCCAACTAT CCGGAAGGCA CCGTGCCCTA CGTCATCGGT
GAATCCGCCA ACACGGCGCT GACCTATGGC TACGACCCTG TGACCCAAAT GCAGGAAACC
AAGTCGAGCC TGTGTCAGAT AGTAAAAGCC TGA
 
Protein sequence
MKLTRTSDVA AKVEAKPLGM SRRQFMKTAG IATGGIAAAS MLGTGMMRRA EAKDVPHDAP 
IEIKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK
YPMKLEGGKW KRISWEQAIN EVGDKMLNIR QESGPDSVYF MGSAKFSNEG CYMYRKLAAM
WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA RAIFFIGANP AEAHPVSMQH
ILTAKERNNA KIIVVDPRFS RTAAHADLHV GIRPGTDIPF IYGMLWHIFE NGWEDKTFID
QRVFGMDKIR EEAKKFPPKE VADITGVSEE AIYQAAKLMA DNRPGTVVWC MGGTQHHVGN
ANTRAYSILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTSAAWQHWT
HVWDLDLEWV KGRFDHGTYL GREPMTTPGI PCSRWHDGVL EDKAKLAQKD NIRLAFFWGQ
SVNTETRQRE VRDALDKMDT VVVVDPFPTM AGVMHRRKDG VYLLPAATQF ETRGSISNSG
RSIQWREQVI EPLFESKTDI EIMYRLAEKL GIAEQYTKRI SKENGVPLIE DITREINRGM
WTIGMTGQSP ERIKAHTMNW GTFSQKSLEA EGGPCKGETY GLPWPCWGTP EMKHPGTQIL
YNTSKHVKDG GGNFRARYGV EYQGQNLLAE GSFSKGAEIE DGYPEFTADM LKQLGWWDDL
TAAEKAEAEG KNWKTDLSGG IVRVAIKHGC IPFGNARARC LVWTFPDAVP VHREPLYTAR
RDLVAKYPTY DDMQVHRLPT LYKSIQEKDF SGSYPLVLTS GRLVEYEGGG EESRSNPWLA
ELQQEMFVEI NPGDAADRGI RNNDNVWLEG PEGGRILIKA LVTPRVKPGV TFMPYHFAGV
MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KSSLCQIVKA