Gene Shewana3_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2070 
Symbol 
ID4476316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2477453 
End bp2479213 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content50% 
IMG OID639726655 
Productdihydroxy-acid dehydratase 
Protein accessionYP_869706 
Protein GI117920514 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000109007 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000050577 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGCCTTG GAAGACGAAG GTGTCATATG AATAATAAAA AACCGAAAAC ACTTCGTTCG 
GCTAGTTGGT TTGGTAGTGA TGACAAAAAT GGCTTTATGT ATCGCAGTTG GATGAAAAAC
CAAGGCATAC CCGAGCATCA CTTTCAAAAT AAGCCTGTGA TTGGTATTTG CAATACTTGG
TCAGAATTGA CGCCCTGTAA TGGTCATCTA CGGGAATTGG CGCAAAGAGT AAAGAATGGC
ATTCGGGAAG CGGGTGGCAT TCCGGTGGAG TTTCCGGTGT TTTCGAATGG TGAGTCCAAC
TTGCGTCCAA GTGCCATGCT GACCCGTAAC CTTGCGGCCA TGGACACGGA AGAAGCCATT
CGTGGCAACC CCATCGACGG TGTTGTGCTG TTAGTAGGCT GCGATAAAAC GACTCCGGCT
TTATTAATGG GCGCGGCCAG TTGTGATTTA CCGACAATCG TTGTTACTGG TGGTCCCATG
CTCAATGGTA AGCATAAGGG TAAGGATGTT GGTTCGGGCA CACTCGTGTG GGAACTGCAT
CAAGAATATA AAGCGGGCAA TATCAGCCTT GCCGCATTTA TGAATGCCGA AGCGGATATG
TCACGCTCAA CGGGCACCTG TAACACTATG GGGACGGCAT CGACCATGGC TTGTATGGTG
GAAACCCTTG GGGTGAGTTT GCCACACAAT GCAACCATTC CTGCGGTGGA TTCTCGCCGC
CAAGTGCTGG CCCATATGTC GGGAATGCGA ATTGTGGACA TGGTCAAAGA GGATTTGACC
TTAAGTAAAA TTTTAAGCCG TGATGCTTTT ATCAATGCCA TCAAAGTGAA TGCTGCCATT
GGGGGGTCAA CCAACGCCGT AATCCATTTA AAGGCGATCG CCGGCAGGAT AGGGGTTGAG
CTGTCACTCG ATGACTGGCG CCATGGTTAC ACAGTACCGA CCATAGTGAA TCTTAAACCT
TCGGGTCAGT ACTTAATGGA AGACTTTTAC TACGCAGGTG GCCTGCCAGC AGTACTAAGG
CAACTGTTTG AGCATGATTT ACTGAGCAAA AATACGCTTA CAGTCAATGC CGCTAGCCTC
TGGGACAATG TCAAAGAGGC GCCGTGTTAT AACCAAGAGG TGATCATGTC ACTTGAAAAT
CCCTTGGTTG AAAATGGCGG CATTCGCGTA CTGCGCGGCA ATCTCGCGCC TAGAGGCGCT
GTGATCAAGC CTTCAGCCGC CAGCGCACAC CTGATGCAAC ACCGCGGTAA AGCCGTGGTG
TTTGAAAGCT TCGACGATTA CAACGCTCGC ATCGGCGATC CTGAATTGGA TATCGATGAA
AACAGCATTA TGGTGCTTAA AAACTGTGGC CCAAAGGGAT ATCCGGGCAT GGCAGAGGTC
GGCAATATGG GACTGCCACC GAAGTTGTTG AAAAAAGGAA TTAAGGACAT GGTGAGGATT
TCCGATGCAC GCATGAGTGG CACCGCCTTT GGCACAGTTG TGCTGCATGT TGCCCCAGAA
GCACAAGCCC TTGGGCCACT GGCCGCCGTC CAAAATGGTG ACATGATAGC GCTAGATACC
TATGCCGGAA CGTTACAGCT GGAGATCAGT GACCAAGAGT TACAAGCCCG TCTTGCCAAA
CTGGCGACAG TGAAATCCAT TCCTGTGAAT GGTGGCTATC TCTCGCTCTT TAAGGAGCAT
GTTCTCCAGG CGGATGAGGG CTGTGATTTT GATTTTCTCG TGGGATGTCG AGGTGCAGAG
ATACCAGCAC ATTCCCATTA A
 
Protein sequence
MCLGRRRCHM NNKKPKTLRS ASWFGSDDKN GFMYRSWMKN QGIPEHHFQN KPVIGICNTW 
SELTPCNGHL RELAQRVKNG IREAGGIPVE FPVFSNGESN LRPSAMLTRN LAAMDTEEAI
RGNPIDGVVL LVGCDKTTPA LLMGAASCDL PTIVVTGGPM LNGKHKGKDV GSGTLVWELH
QEYKAGNISL AAFMNAEADM SRSTGTCNTM GTASTMACMV ETLGVSLPHN ATIPAVDSRR
QVLAHMSGMR IVDMVKEDLT LSKILSRDAF INAIKVNAAI GGSTNAVIHL KAIAGRIGVE
LSLDDWRHGY TVPTIVNLKP SGQYLMEDFY YAGGLPAVLR QLFEHDLLSK NTLTVNAASL
WDNVKEAPCY NQEVIMSLEN PLVENGGIRV LRGNLAPRGA VIKPSAASAH LMQHRGKAVV
FESFDDYNAR IGDPELDIDE NSIMVLKNCG PKGYPGMAEV GNMGLPPKLL KKGIKDMVRI
SDARMSGTAF GTVVLHVAPE AQALGPLAAV QNGDMIALDT YAGTLQLEIS DQELQARLAK
LATVKSIPVN GGYLSLFKEH VLQADEGCDF DFLVGCRGAE IPAHSH