Gene Shewmr4_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1039 
Symbol 
ID4251112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1212682 
End bp1213896 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content50% 
IMG OID638117612 
Productphosphopentomutase 
Protein accessionYP_733176 
Protein GI113969383 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.536446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000672539 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGTA CAGTTATAAT GATGTTGGAT TCCTTTGGCG TGGGCGCCGC TGGCGATGCC 
GCCAAGTTTG GGGATCTAGG TTCTGATACT TTTGGCCATA TCGCTAAAGC GTGTGCCGAA
GGTAAAGCCG ATATCGGCCG TGAAGGCCCG TTAACGCTGC CAAACTTGGC GCGTTTAGGT
TTAGCCCATG CGGCGATGGA AAGCACTGGG GCGTTTGCTC CAGGCTTTGC GGACGATGTT
GAGCTGATTG GTGCCTATGG CCACGCTCAG GAATTAAGTT CGGGTAAAGA TACTCCGAGC
GGTCACTGGG AAATGGCGGG TGTGCCCGTA TTATTCGACT GGGGCTATTT CAGCGAGCAC
CAAAACTCGT TCCCTAAAGA GCTGACAGAT AAGATTCTCG CCCGTGCAGG ACTCGATGGC
TTTTTAGGTA ACTGCCATGC TTCTGGTACC ACGATTCTGG AAGAATTAGG CGAAGAGCAC
ATGCGTTCTG GCAAGCCGAT TTTTTACACT TCGGCGGATT CGGTATTCCA GATTGCCTGC
CATGAAGGCA CATTTGGTTT AGAAAATTTA TATCGTCTTT GCGAAATCGC CCGCGAAGAG
TTAGAGCCTT ACAACATTGG CCGCGTGATT GCGCGTCCAT TCGATGGCAC TGGCCCAAGC
GATTTTGCTC GTACTGGTAA CCGTAAGGAT TACTCCCTCG AGCCGCCAGC GAAGACGGTA
TTAGATAAGT TAAAAGCCGC CGGTGGTGAA GTGGTGAGTG TGGGCAAGAT TGCCGATATT
TACGCTTACT GTGGTATCAC CAAAAAGGTG AAGGCAAACG GTTTAGAAGC GCTATTTGAT
GCGACTTTAG ACGAAGTGAA ATCAGCGGGT GAAAATACTA TTGTATTCAC TAACTTTGTT
GATTTTGACT CCCACTATGG TCACCGCCGT GATGTGGCAG GTTATGCGAA AGGGCTGGAG
TATTTCGACT CGCGTTTACC TGAAATGCTC GCGCTGCTGG ATGAGGACGA TCTATTAATC
CTCACCGCTG ATCATGGTTG CGACCCAACA TGGCAAGGTA CGGATCATAC CCGTGAATAT
GTGCCTGTAT TGGCCTATGG CGCAGGGCTA AAAGCCGGTT CACTCGGTCG CCGTAACAGT
TTCGCCGATA TCGGCCAATC TATCGCAAGC TACTTCAAGC TTGAGCCGAT GGAATACGGT
GAGTCGTTTA TCTAA
 
Protein sequence
MKRTVIMMLD SFGVGAAGDA AKFGDLGSDT FGHIAKACAE GKADIGREGP LTLPNLARLG 
LAHAAMESTG AFAPGFADDV ELIGAYGHAQ ELSSGKDTPS GHWEMAGVPV LFDWGYFSEH
QNSFPKELTD KILARAGLDG FLGNCHASGT TILEELGEEH MRSGKPIFYT SADSVFQIAC
HEGTFGLENL YRLCEIAREE LEPYNIGRVI ARPFDGTGPS DFARTGNRKD YSLEPPAKTV
LDKLKAAGGE VVSVGKIADI YAYCGITKKV KANGLEALFD ATLDEVKSAG ENTIVFTNFV
DFDSHYGHRR DVAGYAKGLE YFDSRLPEML ALLDEDDLLI LTADHGCDPT WQGTDHTREY
VPVLAYGAGL KAGSLGRRNS FADIGQSIAS YFKLEPMEYG ESFI