Gene Shewmr4_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1979 
Symbol 
ID4252552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2353647 
End bp2355149 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content49% 
IMG OID638118592 
ProductL-arabinose isomerase 
Protein accessionYP_734109 
Protein GI113970316 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000173872 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0652997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCT TCAAACAAAA ACAAGTGTGG TTTATCACGG GTTCGCAGGA TTTATACGGC 
CCAAAAGTAT TAGAGCAAGT CGCTAAAAAC AGTGAGCAAA TTGTTTATGG CTTTAATGAA
TCTTCCGCCA TTTCCATTGA AGTGGTGTAT AAGCCAACCG TAAAATCCCC ACGTGAAATT
CACGCCGTAT GTCAAGCGGC CAACAGCGAT GAAAACTGTG TTGGCGTTAT TCTGTGGATG
CACACTTTCT CTCCTGCCAA GATGTGGATT GCTGGCCTTA ATGAATTAAG CAAGCCATTC
ATGCACTTAC ACACTCAGTT CAATGCTGAG CTCCCTTGGA GCGAAATCAA TATGAACTAC
ATGAACACCC ACCAAAGTGC TCACGGTTGC CGCGAATTTG GTTTTATCGG CACTCGTATG
CGTAAAGAGC GCAAAGTGGT TGTGGGTCAC TGGCAATCGA GCGATGTACA GGCTCAAATC
GATGATTGGT GCCGCGCAGC GGCGGGTTGG CACGAGAGCC AAAACCTGCG TATTGCCCGC
TTTGGCGACA ACATGCGTCA AGTGGCCGTA ACCGAAGGCG ACAAAGTTGC CGCACAAATT
CAATTCGGTT ACGAAGTGCA CGCCTACAGC TTAGGTGAAC TCAATGAAGC GATTGCAGCC
ATTGCCGAAG GCGATGTAAC CGCACAACTC GACCGTTACG CCAGCGAATA CCAAGTGGGT
AACGAGCTAT TTGGCGATGA ATACCAATTA GACCGTTTAA GAAAAGAAGC CAAGATTGAA
CTCGGCTTAA CCCAATTCTT AACCCAAGGT GGATTTGGTG CCTTTACCAA CTGCTTCGAA
AACCTCACCG GCATGACAGG ATTACCAGGA CTGGCTACCC AACGTCTGAT GGCGAACGGT
TTCGGTTACG GCGGTGAAGG TGACTGGAAA ACGGCTGCCA TGGTGCGCAT CATGAAGGTG
ATGGGCCAAG GCCGCGCCGG TGGTACTTCA TTTATGGAAG ACTACACCTA TAACTTTGGG
GCGACTGACC AAGTTCTTGG CGCCCACATG TTAGAAGTGT GCCCATCGAT TGCTGCTGCA
AAACCGCGTT TAGAAGTTCA CCGCCACACC ATTGGTGTGC GTTGTGACGT GCCACGTCTG
TTATTCACAG GTAAAGCGGG CCCAGCAATC AACGTATCGA CTATCGATTT AGGCAACCGT
TTCCGTATCA TTCTCAATGA ATTAGATACA GTGACACCAC CACAGGATCT GCCAAATCTG
CCTGTCGCAT CTGCGCTGTG GGAGCCTCGT CCGAATTTAG CGGTTGCCGC CGCAGCTTGG
ATCCACGCCG GTGGTGCTCA CCACTCAGCT TACAGCCAAG CTATCACGAC GGATCAGATT
GTCGACTTTG CTGAAATGGC CGGTGCTGAA CTGGTTATCA TCGATGCCGA TACTAAGATC
CGCGAGTTTA AGAATGAGCT TCGCCAAAAT TCCGTTTATT ACGGTTTAGC AAGAGGTTTA
TAA
 
Protein sequence
MKAFKQKQVW FITGSQDLYG PKVLEQVAKN SEQIVYGFNE SSAISIEVVY KPTVKSPREI 
HAVCQAANSD ENCVGVILWM HTFSPAKMWI AGLNELSKPF MHLHTQFNAE LPWSEINMNY
MNTHQSAHGC REFGFIGTRM RKERKVVVGH WQSSDVQAQI DDWCRAAAGW HESQNLRIAR
FGDNMRQVAV TEGDKVAAQI QFGYEVHAYS LGELNEAIAA IAEGDVTAQL DRYASEYQVG
NELFGDEYQL DRLRKEAKIE LGLTQFLTQG GFGAFTNCFE NLTGMTGLPG LATQRLMANG
FGYGGEGDWK TAAMVRIMKV MGQGRAGGTS FMEDYTYNFG ATDQVLGAHM LEVCPSIAAA
KPRLEVHRHT IGVRCDVPRL LFTGKAGPAI NVSTIDLGNR FRIILNELDT VTPPQDLPNL
PVASALWEPR PNLAVAAAAW IHAGGAHHSA YSQAITTDQI VDFAEMAGAE LVIIDADTKI
REFKNELRQN SVYYGLARGL