Gene Shewana3_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3920 
Symbol 
ID4480134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4709123 
End bp4711975 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content51% 
IMG OID639728536 
Productformate dehydrogenase alpha subunit 
Protein accessionYP_871544 
Protein GI117922352 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.922429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTAA CTCGCAAGTC CGATGTCGCC CAAGTGGCCG ACAAACCGAC GTTAGGCATT 
AGCCGTCGTC AATTTATGAA GCAAGCAGGT ATTACTACTG GTGGTATCGC AGCCGCTTCT
CTGATGGGTA CAGGCATGAT GCGCCGCGCA GAAGCCAAAG ATGTGCCACA CGACGCGCCG
ATCGAAGTTA AACGTACGAT TTGTAGTGCC TGTGCTGTGG GTTGTGGTCT GTATGCCGAA
GTGCAAAATG GCGTGTGGAC AGGTCAAGAA CCCGCGTTCG ATCACCCATT CAATGCCGGC
GGTCACTGCG CGAAAGGTGC TGCACTGCGT GAGCATGGCC ACGGTGAAAA ACGCCTGAAA
TACCCAATGA AATTGGTTGA TGGCAAGTGG AAAAAAATCT CTTGGGAAGA TGCGATTAAC
GAAGTGGGCG ACCAAATGCT CAACATTCGT AAAGAATCTG GCCCAGACTC TGTGTACTTC
ATGGGTAGCG CTAAGTTCTC TAACGAAGGC TGCTATATGT ACCGCAAACT GGCGGCCATG
TGGGGCACAA ACAACGTCGA CCACTCTGCT CGTATTTGTC ACTCTACCAC GGTAGCCGGT
GTTGCTAACA CTTGGGGTTA CGGTGCGCAA ACTAACTCTT TCAACGACAT TCAGAATGCC
AATGCCATCT TCCTGATCGG GGCAAACCCA GCAGAAGCGC ATCCGGTTTC AATGCAACAC
ATTCTGATCG CCAAAGAGAA AAACAACGCG AAAATCATCG TTGTTGACCC ACGTTTCTCT
CGCACAGCGG CGCACTCAGA TCTGCACTGT GCAATTCGTC CTGGTACTGA TATTCCGTTT
ATCTACGGTA TGTTATGGCA CATTTTCGAA AACGGTTGGG AAGATAAAAC CTTTATCCAA
CAACGCGTAT TCGAGATGGA AACCATCCGC GAAGAAGTGA AAAAGTTCCC ACCTAAAGAA
GTGGCGGATA TCACTGGCGT GAGCGAAGAA GTGGTTTATC AAGCCGCAAA ACTGATGGCA
GAAAACCGTC CGGGTACTGT GATTTGGTGT ATGGGTGGTA CTCAGCACCA CGTCGGTAAC
GCTAACACTC GTGCTTACTG TATTCTGCAA TTAGCCTTAG GCAACATGGG CGTTTCTGGC
GGCGGCACTA ACATTTTCCG TGGCCACGAC AACGTACAGG GCGCGACTGA CTTAGGTCTG
CTGTTCGACA ACCTGCCTGG CTACTACGGT TTAACCTCAG CCGCTTGGAC TCACTGGACC
CATGTGTGGG ATCTGGATAT GGAGTGGGTG AAGAGCCGCT TCGATCAAAA CGCCTACTTA
GGCAAAGATC CAATGACCAC CCCAGGTATT CCTTGTTCTC GCTGGCACGA TGGCGTGTTA
GAAGATAAGA GCAAGCTGGC ACAGAAAGAC AATATCCGTA TGGCCTTCTT CTGGGGTCAA
TCGGTCAACA CTGAAACCCG TCAACGTGAA GTGCGTGATG CCTTAGACAA GATGGACACA
GTCGTGGTTG TCGACCCATT CCCAACCATG GCGGGTGTTA TGCACCGTCG TAAGAATGGC
GTGTATCTGT TACCTGCTGC GACTCAGTTT GAAACTCAAG GTTCAGTGTC TAACTCAGGC
CGTTCTATCC AATGGCGTGA GCAGGTTATC CAACCTTTAT TCGAGTCAAA AACCGACATC
GAAATCATGT ACCGTTTAGC GCAAAAACTC GGTATCGCCG AGCAATACAC TAAACGCATC
GCCAAAGAAA ACGGCTTACC GGTTATCGAA GAAATCACCC GCGAAATCAA CCGCGGCATG
TGGACCATCG GTATGACAGG TCAAAGCCCT GAGCGTATCA AGCTGCACAC CCAAAACTGG
GGCACTTTCA GCAACAAGAC GCTCGAAGCC GCCGGCGGCC CAGCTAAAGG CGAAACCTAC
GGTTTACCTT GGCCATGTTG GGGCACACCA GAAGCTAAAC ACCCTGGTAC CCAAATTCTG
TATAACCAAT CTAAACACGT TAAAGACGGC GGCGGTAACT TCCGTGCTCG TTACGGCGTT
GAATACAATG GTAAAAACCT GCTGGCTGAA GGCTCTTTCT CTAAAGGTGC CGAGATCCAA
GACGGTTACC CAGAATTTAC CGATAAGCTG CTGAAGCAAC TCGGTTGGTG GGATGACCTG
ACTGCGGAAG AAAAAGCCGA AGCCGAAGGT CGTAACTGGA AGACCGACTT ATCAGGCGGT
ATCGTGCGCG TGGCAATCAA GCATGGTTGT ATTCCATTTG GTAACGCTAA AGCCCGTTGT
ATTGTTTGGA CTTTCCCAGA CCAAGTGCCA GTTCACCGCG AGCCGTTATA CACAGCACGC
CGTGACTTAG TGGCTAAATA CCCAACCTAC GACGATATGC AAGTTCATCG TCTGCCAACA
CTGTACAAGT CAATCCAAGA GAAAGACTTC AGTGGCAAGT ACCCACTGGT ACTGACCTCT
GGTCGTTTAG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTCTAACCC ATGGCTGGCT
GAGCTTCAAC AGGAAATGTT TGTTGAAATC AACCCAGGCG ACGCAGCCGA CCGCGGTATC
CGCAACGGTG AGTTTGTGTG GTTAGAGGGC GCCGAAGGTG GCCGCATTAA AGTACAAGCT
CTGGTAACGC CACGCGTTAA ACCAGGTGTG ACCTTTATGC CATACCACTT TGCGGGTGTG
ATGCACGGTG AAAGCTTAGC GCCTAACTAC CCTGAGGGCA CTGTGCCTTA CGTTATCGGT
GAATCCGCTA ACACGGCACT GACCTATGGT TATGACCCTG TGACTCAAAT GCAGGAAACC
AAAGCGTCGC TCTGTCAGAT CGTTAAAGCG TAA
 
Protein sequence
MKLTRKSDVA QVADKPTLGI SRRQFMKQAG ITTGGIAAAS LMGTGMMRRA EAKDVPHDAP 
IEVKRTICSA CAVGCGLYAE VQNGVWTGQE PAFDHPFNAG GHCAKGAALR EHGHGEKRLK
YPMKLVDGKW KKISWEDAIN EVGDQMLNIR KESGPDSVYF MGSAKFSNEG CYMYRKLAAM
WGTNNVDHSA RICHSTTVAG VANTWGYGAQ TNSFNDIQNA NAIFLIGANP AEAHPVSMQH
ILIAKEKNNA KIIVVDPRFS RTAAHSDLHC AIRPGTDIPF IYGMLWHIFE NGWEDKTFIQ
QRVFEMETIR EEVKKFPPKE VADITGVSEE VVYQAAKLMA ENRPGTVIWC MGGTQHHVGN
ANTRAYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDLGL LFDNLPGYYG LTSAAWTHWT
HVWDLDMEWV KSRFDQNAYL GKDPMTTPGI PCSRWHDGVL EDKSKLAQKD NIRMAFFWGQ
SVNTETRQRE VRDALDKMDT VVVVDPFPTM AGVMHRRKNG VYLLPAATQF ETQGSVSNSG
RSIQWREQVI QPLFESKTDI EIMYRLAQKL GIAEQYTKRI AKENGLPVIE EITREINRGM
WTIGMTGQSP ERIKLHTQNW GTFSNKTLEA AGGPAKGETY GLPWPCWGTP EAKHPGTQIL
YNQSKHVKDG GGNFRARYGV EYNGKNLLAE GSFSKGAEIQ DGYPEFTDKL LKQLGWWDDL
TAEEKAEAEG RNWKTDLSGG IVRVAIKHGC IPFGNAKARC IVWTFPDQVP VHREPLYTAR
RDLVAKYPTY DDMQVHRLPT LYKSIQEKDF SGKYPLVLTS GRLVEYEGGG EESRSNPWLA
ELQQEMFVEI NPGDAADRGI RNGEFVWLEG AEGGRIKVQA LVTPRVKPGV TFMPYHFAGV
MHGESLAPNY PEGTVPYVIG ESANTALTYG YDPVTQMQET KASLCQIVKA