Gene Shewana3_3916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3916 
Symbol 
ID4480130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4703971 
End bp4706823 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content50% 
IMG OID639728532 
Productformate dehydrogenase alpha subunit 
Protein accessionYP_871540 
Protein GI117922348 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.733885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTAA CCCGCAAAAC AGACCAAGTC GTCGAGCCAA AAGTGCCCGC CCTCGGTCTT 
AATCGTCGCC AATTCTTAAA ATCTGCAGGT CTTGCCACTG GTGGTATCGC CGCCGCGTCT
ATGCTTGGCA CAGGTATGAT GCGTAAAGCA CAAGCGCAGG AACATATCCC CCATAATGCA
CCAACTGAAG TCAAACGTAC CATTTGCTCT CACTGCGCAG TGGGTTGTGG TATCTATGCT
GAAGTGCAAA ACGGTGTGTG GACAGGTCAA GAACCCGCGT TCGATCATCC ATTTAACCAA
GGCGGCCACT GCGCGAAAGG GGCTGCACTG CGTGAACACG GCCACGGTGA AAAACGCCTG
AAATACCCAA TGAAGTTAGA AGGCGGTAAG TGGAAGAAGA TCTCTTGGGA ACAAGCGATC
AATGAAGTAG GCGATAAAAT GACGGCGATT CGTCAAGAAT CGGGTCCAGA CTCTATCTAC
TTTATGGGTA GTGCTAAGTT CTCTAACGAA CAGGCTTATT TATATCGCAA ACTCGCGGCA
CTGTGGGGCA CAAACAACGT CGACCACTCA GCCCGTATTT GTCACTCTAC CACGGTAGCC
GGTGTTGCTA ACACTTGGGG CTACGGTGCG CAAACCAACT CGTTAAACGA TATCCGCAAC
TCTAAGTGCA TCATGTTCGT GGGTTCAAAC CCAAGTGAAG CACACCCTGT CGCCATGCAA
CACATTCTGG TGGCAAAAGA GCGCGGCGCT AAGATTATCG TTGTTGACCC ACGTTTCACC
CGTACTGCAG CTAAGTCTGA CGAGTACGTG CATATCCGCC CAGGTACCGA TATCCCCTTC
ATCTATGGTC TGTTATGGCA CATTTTTGAA AACGGCTGGG AAGATAAAGA GTTCATCAAT
CAACGTGTTT ACGGCATGGA ACGTATTCGC GATGAAGTGA AAAAATATAC CCCCGAAGAA
GTCGAAAACG TTGCAGGCGT GCCTAAGGCG CAAATGTACC GTATCGCTAA AATGTTAGCC
GAGACCAAAC CAGGCACTAT CGTATGGTGT ATGGGCGGTA CTCAGCACCA CGTCGGTAAT
GCCAACACCC GTTCATACTG TATTTTACAG TTAGCGCTGG GCAACATGGG CGTATCTGGC
GGCGGTACCA ACATTTTCCG TGGTCACGAT AACGTGCAAG GTGCAACAGA CTTTGGTCTG
TTATTCGACA ACTTACCCGG TTACTACGGC TTAACATCAG GCGCTTGGGC TCACTGGTCT
GGCGTTTGGG ACTTAGATCC AAAATGGGTT GCAGGCCGTT TCGACCAAGG TGAATACTTA
GGTCAAACGC CACAAACCTC AACGGGTATC CCCTGCTCTC GCTGGCACGA CGGTGTGTTA
GAAGATAAAA CCAAGATCGC GCAGAAGGAC AATATCCGTC TGGCGTTCTT CTGGGGTCAA
TCAGTTAACA CCGAAACCCG TGGCCGTGAA GTGCGTGAAG CGCTGAACAA GTTAGATACT
GTGGTCGTTG TCGACCCAAT CCCAACCATG GCAGGTGTGA TGCACCAGCG TAAAGATGGG
GTATATCTGC TCCCAGCGGC GACCCAATTT GAAACCTACG GCTCAGTGTC TGCCACTAAC
CGTTCGATTC AATGGCGCTC TAAAGTCATC GAGCCACTGT TTGAGTCTCT GCCTGACCAC
GTGATCATGT ACAAACTGGC GAAAAAGCTG GGTATCGAAA AAGAATTCTG TAAGCACATC
CAAGTGAATG GCGAAGAGCC ATTGATTGAA GATGTGACTC GCGAGTTCAA CAAAGGTATG
TGGACCGTCG GTTACACAGG TCAAAGCCCA GAACGTTTGA AAATGCACCA AGAAAACTGG
GGCACCTTCG ATGTAAACAG CCTGACCGCA CCGGGCGGCC CAGCTAAAGG TGAAGTTTAC
GGCTTACCTT GGCCATGTTG GGGTACGCCA GAAATGAAAC ACCCTGGTAC CCAAATTCTT
TACGATCAAT CTAAAGAAGT GAAAGACGGC GGCGGTACTT TCCGTGCCCG TTACGGTGTT
GAACACAATG GCGTTAGCAT TCTTGCCGAC GGTTCATTCT CTAAAGGCAG TGAAATTCAA
GATGGTTATC CTGAGTTTAC CGCCGACATG CTCAAGCAAT TGGGTTGGTG GGATGACTTA
ACCGAAGACG AGAAGAAACT GGCCGAAGGC AAAAACTGGA AGACAGACCT GTCTGGCGGT
ATCCAACGTG TTGCTATCAA GCACGGCTGT ATTCCTTTCG GTAACGCGAA AGCACGTTGT
ATCGTGTGGA CTTTCCCAGA TGATATCCCA CTGCACCGCG AGCCACTCTA CACTCCTCGT
CGTGACTTAG TCGCTAAATA CCCAACCTAC GAAGACCGTA TGGTTGCGCG TCTACCGACT
CTGTATAAGT CAATTCAGGA TAAGGACTTT ACCCAAGGCT TCCCACTGAC ACTGACCTCT
GGTCGTTTAG TGGAATACGA AGGTGGTGGT GAAGAATCTC GTTCTAACCC TTGGTTGGCC
GAGCTACAAC AAGAAATGTT TATCGAAATG AACCCGGCAG ACGCTGCTGA CCGTGGTATC
CGTGACGGTG ACAATGTCTT TGTTCATAGT CCTGAAGGCG CCAAGATCAC AGTGAAGGCA
ATGGTGACAC CACGGGTTGT TCCGGGTGAA TGTTTTATGC CATACCACTT CGCCGGTATC
TTCGAAGGTG AAAACCTCGC GAAGAATTAC CCAGAAGGTA CAGTACCTTA TGTACAAGGT
GAATCGGCAA ACACCATTTT AACTTACGGC TATGACGTTG TGACTCAGAT GCAAGAAACT
AAGTCCAGCC TTTGCCAAGT TAGCAAAGCC TAA
 
Protein sequence
MRLTRKTDQV VEPKVPALGL NRRQFLKSAG LATGGIAAAS MLGTGMMRKA QAQEHIPHNA 
PTEVKRTICS HCAVGCGIYA EVQNGVWTGQ EPAFDHPFNQ GGHCAKGAAL REHGHGEKRL
KYPMKLEGGK WKKISWEQAI NEVGDKMTAI RQESGPDSIY FMGSAKFSNE QAYLYRKLAA
LWGTNNVDHS ARICHSTTVA GVANTWGYGA QTNSLNDIRN SKCIMFVGSN PSEAHPVAMQ
HILVAKERGA KIIVVDPRFT RTAAKSDEYV HIRPGTDIPF IYGLLWHIFE NGWEDKEFIN
QRVYGMERIR DEVKKYTPEE VENVAGVPKA QMYRIAKMLA ETKPGTIVWC MGGTQHHVGN
ANTRSYCILQ LALGNMGVSG GGTNIFRGHD NVQGATDFGL LFDNLPGYYG LTSGAWAHWS
GVWDLDPKWV AGRFDQGEYL GQTPQTSTGI PCSRWHDGVL EDKTKIAQKD NIRLAFFWGQ
SVNTETRGRE VREALNKLDT VVVVDPIPTM AGVMHQRKDG VYLLPAATQF ETYGSVSATN
RSIQWRSKVI EPLFESLPDH VIMYKLAKKL GIEKEFCKHI QVNGEEPLIE DVTREFNKGM
WTVGYTGQSP ERLKMHQENW GTFDVNSLTA PGGPAKGEVY GLPWPCWGTP EMKHPGTQIL
YDQSKEVKDG GGTFRARYGV EHNGVSILAD GSFSKGSEIQ DGYPEFTADM LKQLGWWDDL
TEDEKKLAEG KNWKTDLSGG IQRVAIKHGC IPFGNAKARC IVWTFPDDIP LHREPLYTPR
RDLVAKYPTY EDRMVARLPT LYKSIQDKDF TQGFPLTLTS GRLVEYEGGG EESRSNPWLA
ELQQEMFIEM NPADAADRGI RDGDNVFVHS PEGAKITVKA MVTPRVVPGE CFMPYHFAGI
FEGENLAKNY PEGTVPYVQG ESANTILTYG YDVVTQMQET KSSLCQVSKA