Gene Shewana3_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_2066 
Symbol 
ID4476312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2473107 
End bp2474609 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content50% 
IMG OID639726651 
ProductL-arabinose isomerase 
Protein accessionYP_869702 
Protein GI117920510 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2160] L-arabinose isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000205701 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000020018 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGCCT TCAAACAAAA ACAAGTGTGG TTTATCACGG GTTCGCAGGA TTTATACGGC 
CCAAAAGTAT TAGAGCAAGT CGCTAAAAAC AGTGAGCAAA TTGTTCATGG CTTTAATGAA
TCATCCGCCA TTTCCATCGA AGTGGTGTAT AAGCCAACCG TAAAATCCCC ACGGGAAATT
CACGCCGTAT GCCAAGCGGC CAACAGCGAT GAAAACTGTG TTGGCGTTAT TCTGTGGATG
CACACTTTCT CTCCAGCCAA GATGTGGATT GCTGGCCTTA ATGAATTAAG CAAGCCATTC
ATGCACTTAC ACACGCAGTT CAATGCTGAG CTCCCTTGGA GCGAAATCAA CATGAACTAC
ATGAACACCC ACCAAAGTGC TCACGGTTGC CGCGAATTTG GTTTTATCGG CACTCGTATG
CGTAAAGAGC GCAAAGTGGT TGTGGGTCAC TGGCAATCGA GCGATGTACA GGCTCAAATC
GATGATTGGT GCCGCGCAGC GGCAGGTTGG CACGAGAGCC AAAACCTGCG TATCGCCCGC
TTTGGCGACA ACATGCGTCA AGTGGCCGTA ACCGAAGGTG ACAAAGTTGC CGCACAAATT
CAATTCGGTT ATGAAGTGCA CGCCTACAGC TTAGGTGAAC TCAATGAGGC GATTGCAGAC
ATTGCCGAAG GCGATGTAAC CGCACAACTC GACCGTTACG CCAGCGAATA CCAAGTAGGT
AACGAGCTAT TTGGCGATGA ATACCAATTA GACCGTTTAA GAAAAGAAGC CAAGATTGAA
CTCGGCTTAA CCCAATTCTT AACCCAAGGT GGATTTGGTG CCTTTACCAA CTGCTTCGAA
AACCTCACTG GCATGACAGG ATTACCCGGA CTGGCTACTC AACGTCTGAT GGCGAACGGT
TTCGGTTACG GCGGCGAAGG TGACTGGAAA ACGGCTGCCA TGGTGCGCAT CATGAAAGTG
ATGGGCCAAG GCCGTGCCGG TGGTACTTCA TTTATGGAAG ACTACACCTA TAACTTTGGC
GCGACTGACC AAGTTCTTGG CGCCCACATG CTAGAAGTGT GCCCATCGAT TGCTGCTGCA
AAACCGCGTT TAGAAGTTCA CCGCCACACC ATTGGTGTGC GTTGTGACGT GCCACGTCTG
TTATTCACTG GTAAAGCGGG CCCAGCAATC AACGTATCGA CTATCGATTT AGGCAACCGT
TTCCGTATCA TTCTTAATGA ATTAGATACA GTGACACCAC CACAGGATCT GCCAAATCTG
CCTGTCGCGT CTGCGCTGTG GGAGCCTCGT CCGAATTTAG CGGTTGCCGC CGCAGCTTGG
ATCCACGCCG GTGGTGCTCA CCACTCAGCT TACAGCCAAG CTATCACGAC GGATCAGATT
GTCGACTTTG CTGAAATGGC CGGTGCTGAA CTGGTTATCA TCGATGCCGA CACTAAGATC
CGCGAGTTTA AGAATGAGCT TCGCCAAAAT TCCGTTTATT ACGGTTTAGC AAGAGGTTTA
TAA
 
Protein sequence
MKAFKQKQVW FITGSQDLYG PKVLEQVAKN SEQIVHGFNE SSAISIEVVY KPTVKSPREI 
HAVCQAANSD ENCVGVILWM HTFSPAKMWI AGLNELSKPF MHLHTQFNAE LPWSEINMNY
MNTHQSAHGC REFGFIGTRM RKERKVVVGH WQSSDVQAQI DDWCRAAAGW HESQNLRIAR
FGDNMRQVAV TEGDKVAAQI QFGYEVHAYS LGELNEAIAD IAEGDVTAQL DRYASEYQVG
NELFGDEYQL DRLRKEAKIE LGLTQFLTQG GFGAFTNCFE NLTGMTGLPG LATQRLMANG
FGYGGEGDWK TAAMVRIMKV MGQGRAGGTS FMEDYTYNFG ATDQVLGAHM LEVCPSIAAA
KPRLEVHRHT IGVRCDVPRL LFTGKAGPAI NVSTIDLGNR FRIILNELDT VTPPQDLPNL
PVASALWEPR PNLAVAAAAW IHAGGAHHSA YSQAITTDQI VDFAEMAGAE LVIIDADTKI
REFKNELRQN SVYYGLARGL