Gene Shewana3_3379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3379 
Symbol 
ID4478512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4050318 
End bp4051358 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content51% 
IMG OID639727988 
Productaldo/keto reductase 
Protein accessionYP_871008 
Protein GI117921816 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACA GACGCATACC GCATTCTAAT CTCGAGGTGA GCAAAATCTG TTTAGGCACT 
ATGACTTGGG GTGAACAAAA TACCCAAGCC GAAGCCTTCG CACAGCTAGA CTACGCCATC
GGCAGTGGCA TCAACTTTAT TGACACTGCG GAAATGTACC CTGTGCCGCC AAAGCCAGAA
ACCCAAGGGG AAACCGAGCG CATTTTGGGC CAATACATTA AGGCCCGTGG CAACCGTGAT
GACCTAGTGA TCGCCACTAA GATTGCCGCA CCCGGCGGCA AGAGTGACTA TATTCGCAAA
AATATGGCGC TGGACTGGAA CAATATCCAT CAAGCGGTCG ATGCTTCACT CGAACGCCTG
CAAATCGATA CTATCGATCT CTACCAAGTG CATTGGCCAG ACCGCAATAC CAACTTCTTC
GGGGAATTAT TTTACGACGA ACAAGAGATT GAGCAGCAAA CGCCAATCCT CGAGACCCTC
GAAGCCCTCG CCGAAGTGAT TCGCCAAGGT AAAGTGCGCT ATATCGGCGT ATCGAACGAA
ACCCCTTGGG GACTAATGAA GTATCTGCAA CTGGCGGAAA AACACGGCCT GCCGCGCATT
GTGACTGTGC AAAACCCCTA TAACCTGCTC AACCGCAGCT TTGAAGTGGG CATGAGTGAA
ATCAGCCATC GCGAAGAGTT GCCACTGCTG GCTTACTCGC CCTTGGCCTT TGGTGCCTTA
AGCGGTAAAT ATTGCAATAA CCAATGGCCA GAAGGCGCGC GCTTAACCCT GTTTAAACGC
TTCGCCCGTT ACACGGGTTC GCAAATGGCG CTCGATGCCA CCGCAGCTTA TGTAGACTTA
GCCCGCGAGT TTAATCTCTC CCCCGCGCAA ATGGCGTTAG CCTTTGTTAA CTCACGTAAA
TTTGTTGGCT CAAATATCAT TGGCGCCACG GACTTATACC AGCTGAAAGA GAATATCGAC
AGCTTAAAGG TCAGCCTCTC CCCCGAGTTA CTCAGCCGTC TCAATGCACT CTCAGATCAA
TTTAGATTGC CCTGCCCTTA G
 
Protein sequence
MEYRRIPHSN LEVSKICLGT MTWGEQNTQA EAFAQLDYAI GSGINFIDTA EMYPVPPKPE 
TQGETERILG QYIKARGNRD DLVIATKIAA PGGKSDYIRK NMALDWNNIH QAVDASLERL
QIDTIDLYQV HWPDRNTNFF GELFYDEQEI EQQTPILETL EALAEVIRQG KVRYIGVSNE
TPWGLMKYLQ LAEKHGLPRI VTVQNPYNLL NRSFEVGMSE ISHREELPLL AYSPLAFGAL
SGKYCNNQWP EGARLTLFKR FARYTGSQMA LDATAAYVDL AREFNLSPAQ MALAFVNSRK
FVGSNIIGAT DLYQLKENID SLKVSLSPEL LSRLNALSDQ FRLPCP