Gene Shewana3_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1043 
Symbol 
ID4479388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp1220286 
End bp1221500 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content50% 
IMG OID639725586 
Productphosphopentomutase 
Protein accessionYP_868684 
Protein GI117919492 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000600335 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACGTA CAGTTATAAT GATGTTGGAT TCCTTTGGCG TGGGCGCTGC TGGTGACGCC 
GCCAAGTTTG GGGATCTAGG TTCTGACACA TTTGGCCATA TCGCTAAAGC GTGTGCCGAA
GGTGAAGCCG ATATCGGCCG TGAAGGTCCG TTAACGCTGC CAAACTTGGC CCGTTTAGGG
TTAGCCCATG CGGCGATGGA AAGCACGGGG GCGTTTGCTC CAGGCTTTGC GGACAATGTT
GAGTTGATTG GTGCCTATGG CCATGCTCAG GAATTAAGTT CGGGTAAAGA TACCCCGAGC
GGTCACTGGG AAATGGCGGG AGTACCCGTA TTATTCGAAT GGGGCTATTT TAGCGAGCAC
CAAAACTCAT TCCCTAAAGA GCTGACCGAT AAGATTTTGG CCCGTGCGGG ACTCGATGGC
TTTTTAGGTA ACTGCCATGC TTCTGGTACT ACTATTCTGG AAGAATTAGG CGAAGAGCAC
ATGCGCTCTG GTAAGCCAAT TTTTTACACC TCGGCAGATT CTGTATTCCA GATTGCATGC
CATGAAGGCA CATTTGGCTT AGAAAATTTA TATCGTCTTT GCGAAATCGC CCGTGAAGAG
TTAGAACCTT ATAACATTGG CCGCGTGATT GCGCGTCCAT TCGATGGTAC TGGCCCAAGC
GACTTTGCTC GTACTGGTAA CCGTAAGGAT TACTCCCTCG AGCCGCCAGC GAAGACAGTG
TTAGATAAGT TAAAAGCCGC CGGTGGTGAA GTGGTGAGTG TGGGCAAGAT TGCCGACATT
TACGCTTACT GTGGCATCAC CAAAAAGGTG AAGGCAAACG GTTTAGAAGC GTTATTTGAT
GCAACGTTAG CCGAAGTGAA ATCAGCGGGT GAAAATACTA TCGTATTCAC TAACTTTGTT
GATTTTGACT CCCACTATGG CCACCGCCGT GATGTAGCAG GTTATGCGAA AGGGCTGGAG
TATTTCGACT CGCGTTTACC TGAAATGCTG GCGCTGCTGG ATGAGGACGA TCTGTTAATC
CTCACCGCTG ACCATGGTTG CGACCCAACA TGGCAAGGTA CTGACCATAC TCGTGAATAT
GTGCCTGTAT TGGCCTATGG CGCAGGGCTG AAAGCGGGGT CACTCGGTCG CCGTAACAGT
TTCGCCGATA TCGGTCAATC TATCGCAAGC TACTTCAAGC TTGAGCCGAT GGAATACGGT
GAGTCGTTTA TCTAA
 
Protein sequence
MKRTVIMMLD SFGVGAAGDA AKFGDLGSDT FGHIAKACAE GEADIGREGP LTLPNLARLG 
LAHAAMESTG AFAPGFADNV ELIGAYGHAQ ELSSGKDTPS GHWEMAGVPV LFEWGYFSEH
QNSFPKELTD KILARAGLDG FLGNCHASGT TILEELGEEH MRSGKPIFYT SADSVFQIAC
HEGTFGLENL YRLCEIAREE LEPYNIGRVI ARPFDGTGPS DFARTGNRKD YSLEPPAKTV
LDKLKAAGGE VVSVGKIADI YAYCGITKKV KANGLEALFD ATLAEVKSAG ENTIVFTNFV
DFDSHYGHRR DVAGYAKGLE YFDSRLPEML ALLDEDDLLI LTADHGCDPT WQGTDHTREY
VPVLAYGAGL KAGSLGRRNS FADIGQSIAS YFKLEPMEYG ESFI