Gene Shewana3_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2089 
Symbol 
ID4479497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2506781 
End bp2509798 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content50% 
IMG OID639726674 
Productformate dehydrogenase alpha subunit 
Protein accessionYP_869725 
Protein GI117920533 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000081067 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAGAC GACAGTTTTT TAAACTCTGT GCTGTAGGAG CCGCGACTTC TACTATTTCT 
GCACTGGGGT TAATGTCCGA AAAGGCATTT GCATCCGTCA GAGGTTTTAA ATTGCTACGC
GCAAAAGAGA CGCGTAACAA TTGTTGCTAC TGCTCTGTGG GTTGTGGTTT GTTGATGTAC
AGCCAAAGCA GTAATGGGAA AAATGCGGAG CAGAGCATTT TTCATATCGA GGGCGATGCC
GATAATCCGA TTAACCGCGG CGCCCTGTGT CCAAAAGGGG CAGGACTGGT TGACTATGTG
AACAGTCCGC ACCGTTTAAA ATACCCCGAA GTGCGTTTAC CCGGTTCGGA TAAATGGCAG
CGCATTAGCT GGGATGAGGC CTTTAAGCGT ATCGCAAGGC TTATCAAAGA CGAGCGCGAT
GCCAATCTGG TTGAGAAAAA TGCTCAGGGT CAAACCGTTA ACCGCTTAGT CAGCCTAGGG
ATGATGACGT CATCGGCCCA GGCAAACGAA GGCTGCTACA TTACCCATAA ATTTGGCCGT
GCCATTGGTA TGTTAGGCAT AGATAACATC GCCCGTGTTT GCCACGCCCC AACACCGGCC
GCGATGGCGC CCACCTTTGG CCGTGGTGCT ATGACTAACC ACTGGGCGGA TATGAAAAAT
ACCGATCTGG CCATTGTGAT GGGCGGTAAC GCTGCCGAAG CGCATCCCGT CGGCTTTGGT
TGGGTGACAG AAGCGATGGA GCACAACAAC GCCAAGTTGA TTGTGGTCGA TCCGCGCTTT
AACCGAAGTG CTGCCGTCGC CGATTTATAT GCGCCTATTC GTTCGGGCAC CGATATTGCC
TTCCTGCTCG GCATGATCCG CTACCTGCTC GAAACCCAGC AAATCAACCT TAATTATGTC
AAAGCCTATA CCAACGCCAC GTTTATCGTG CGGGAAGACT TTGAATTTAA TGACGGTTTA
TTCAGTGGCT ACGATGAGGC TAACCATAAA TACGACCAAT CCACTTGGTT CTACGAGCTA
GATGAAGAGG GTTACGCAAA AGTTGACCCT AGCTTAAGCC ATCCTCGCTG CGTGATTAAC
CTGCTGAAAA AACACGTCGA TCGCTACGAT CCCTACACGG TTTCTAGCAT CACAGGTACG
CCTAAAGAAG CCTATCTTGA AGTGTGTCAG CAAATTGGGG CGACCCATGT TGACCATAAA
GCTGCCACCT TCCTGTATGC CCTCGGTTGG ACACAGCACA GCGTTGGCGC GCAAAACATC
CGTACCATGG CGATGATCCA ATTGCTGCTT GGCAATATGG GAATCATGGG CGGCGGCGTG
AATGCGCTGC GCGGTCACTC AAACGTACAG GGCGCGACGG ACTTAGGTTT ATTGTGCCAA
GGATTACCTG GCTACCTTAA ACTGCCACAG GATAGAGATG TTGATTTACA AAACTATTTA
GCCCATTACA CCCCTAAAGC CTTAAGACCA AACCAAACCA ACTATTGGCA CAATTACCCA
GCCTTTACCG TGTCGTTGTT AAAAGCCTTC TTCGGTGAAC ATGCAACGGC AGAGAATGAT
TATGGTTATA ACTGGCTGCC AAAATGGGAC CAACAGTACG ATATCAACAA GCAAATCGAC
ATGATGGTTC ACGGTGAGGT CAACGGATAC TTTATCCAGG GCATCAACGC GCTTAACTCC
CAGCCCGATA AGCAAAAAGT GTCTAAGGGC TTATCGAATC TTAAGTTCTT AGTAGTGCTC
GATGCGCTTG CGAACGAAAC CTCGAGTTTC TGGCGCAATG CGGGTCAATT TAACGATGTC
GATACCGCCA GTATTCAAAC CGAAGTCTTC CGCTTACCAA CAACCTGTTT TGCTGAGGAA
AGTGGTTCCA TTGCTAACTC GAGCCGCTGG TTACAATGGC ACTTTAAGGG CGCTAATCCT
CCCGGTGAAG CCTTATCTGA TCCTGCGATC CTTTCTGGCA TCATGCTGGA ATTAAAACGT
TTATACCGTG AAGAGGGCGG CCGTTTACCT GCGCCTATCG AAGCCATTAA ATGGGACTAT
GCGATTGAGC ATGAACCCAG TTCAGAGGAA ATCGCGCGGG AGATGAACGG TTATGATCTC
ACGACCGGTA AGCTGCTCAA TGGTTTCTCC GAATTAAAAT CCGATGGCTC AACCTCATGC
GGTATTTGGG TTTACTCAGG CATGTGGACT GAAGCGGGCA ACTTGATGGC GCGCCGCGAT
AATAGCGATC CATCCGGCAA AGGTATTACC CCGAATTGGT CATTTGCATG GCCTGCAAAC
CGCCGCATCT TGTACAACCG CGCATCCTGC GACGTGCAAG GTAAGCCGCG CGATCCAAGC
CGTGTGCTAC TCGAGTATAA GGACAACAAG TGGCAGGGTA TTGATGTGCC AGACTTTAAT
GCCAAATTGA ATGCCGAAGA ATCGGCCCAT CCTTTCATCA TGCAAGCTGA TGGCGTTGGC
CACTTATTTG CGCTGCGTGA CTTAAAAGAT GGCCCATTCC CAGAGCATTA CGAGCCGTTT
GAATCACCAC TGGCGAGTAA CCCGCTGCAT CCTAAGGTCA CCAATAACCC TGTGGCACGG
ATGTTCAAAG GCTTACGTGA AAGCTTTGGT ACCAATGAAG AATTCCCCTA TGTTGGCACC
ACTTACTCAA TGACGGAACA CTTCAACAAC TGGACCACGC ATTGCCACCT TGCTGCGATT
ACCCAGCCAC AGCACTTTAT CGAAATCGAT GAAACCTTGG CGGCGGAAAA GGGCATCAAT
AACGGTGATT GGGTCAAGGT GAGCTCTAAG CGCTCGCATA TTGTCACTAA GGCCTATGTC
ACTAAACGAC TCCAACCCAT GATGGTTCAG GGCAAAAAAG TTCACACCAT TGGTATTCCA
CGCCATGGCA GTTATGAGGC CTTGACGCAG AAGAGTTATA TCGTCAACGA GCTGACTTCA
TCTGTGGGCG ATGCCAATAC CCAAACCCCT GAATATAAAG CATTCCTTGT GAATATTGCC
AAAGCGGAGG GCTTCTAA
 
Protein sequence
MNRRQFFKLC AVGAATSTIS ALGLMSEKAF ASVRGFKLLR AKETRNNCCY CSVGCGLLMY 
SQSSNGKNAE QSIFHIEGDA DNPINRGALC PKGAGLVDYV NSPHRLKYPE VRLPGSDKWQ
RISWDEAFKR IARLIKDERD ANLVEKNAQG QTVNRLVSLG MMTSSAQANE GCYITHKFGR
AIGMLGIDNI ARVCHAPTPA AMAPTFGRGA MTNHWADMKN TDLAIVMGGN AAEAHPVGFG
WVTEAMEHNN AKLIVVDPRF NRSAAVADLY APIRSGTDIA FLLGMIRYLL ETQQINLNYV
KAYTNATFIV REDFEFNDGL FSGYDEANHK YDQSTWFYEL DEEGYAKVDP SLSHPRCVIN
LLKKHVDRYD PYTVSSITGT PKEAYLEVCQ QIGATHVDHK AATFLYALGW TQHSVGAQNI
RTMAMIQLLL GNMGIMGGGV NALRGHSNVQ GATDLGLLCQ GLPGYLKLPQ DRDVDLQNYL
AHYTPKALRP NQTNYWHNYP AFTVSLLKAF FGEHATAEND YGYNWLPKWD QQYDINKQID
MMVHGEVNGY FIQGINALNS QPDKQKVSKG LSNLKFLVVL DALANETSSF WRNAGQFNDV
DTASIQTEVF RLPTTCFAEE SGSIANSSRW LQWHFKGANP PGEALSDPAI LSGIMLELKR
LYREEGGRLP APIEAIKWDY AIEHEPSSEE IAREMNGYDL TTGKLLNGFS ELKSDGSTSC
GIWVYSGMWT EAGNLMARRD NSDPSGKGIT PNWSFAWPAN RRILYNRASC DVQGKPRDPS
RVLLEYKDNK WQGIDVPDFN AKLNAEESAH PFIMQADGVG HLFALRDLKD GPFPEHYEPF
ESPLASNPLH PKVTNNPVAR MFKGLRESFG TNEEFPYVGT TYSMTEHFNN WTTHCHLAAI
TQPQHFIEID ETLAAEKGIN NGDWVKVSSK RSHIVTKAYV TKRLQPMMVQ GKKVHTIGIP
RHGSYEALTQ KSYIVNELTS SVGDANTQTP EYKAFLVNIA KAEGF