Gene SbBS512_E1631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1631 
Symbol 
ID6269196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1483961 
End bp1485253 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content50% 
IMG OID641725720 
ProductPAP2 family protein 
Protein accessionYP_001880219 
Protein GI187732315 
COG category[T] Signal transduction mechanisms 
COG ID[COG2453] Predicted protein-tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.832741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTACAAG GCGCTGGCTG GTTATTGTTG CTGGCCCCGT TTTTTTTCTT CACCTATGGA 
TCTCTTAATC AGTTCACCGC GGTTCAGGAC CTTAACAGCC ATGATATCCC CAGTCAGGTA
TTCGGTTGGG AAACGGCGAT CCCTTTTCTT CCCTGGACTA TTGTTCCTTA CTGGAGTCTG
GATCTTTTAT ATGGATTTTC GCTGTTCGTT TGTAGCACGA CATTCGAACA GCGCCGACTT
GTCCACCGGC TTATTCTGGC AACGGTAATG GCCTGCTGCG GTTTTTTGCT CTATCCGCTG
AAGTTTAGTT TTATCCGTCC TGAAGTGAGT GGGGTGACGG GATGGCTATT TTCGCAACTT
GAACTGTTTG ATCTGCCTTA TAACCAGTCT CCTTCGCTGC ATATTATTCT CTGCTGGCTA
CTTTGGCGTC ACTTTCGTCA GCATCTGGCT GAGAGGTGGC GTAAAGTCTG TGGCGGATGG
TTTTTACTCA TCGCTATTTC TACTCTGACG ACCTGGCAGC ATCATTTTAT TGATGTCATC
ACAGGGCTGG CGGTAGGTAT GTTGATTGAC TGGATGGTGC CCGTCGATCG TCGTTGGAAT
TATCAGAAAC CTGATCAACG TCGAATCAAA ATAGCACTGC CATATGTCGT AGGCGCGGGC
TCGTGCATTG TGTTGATGGA GCTAATGGTG ATGATTCAGT TATGGTGGTC AGTCTGGTTA
TGTTGGCCAG TATTATCGCT ACTCATTATT GGCCGTGGGT ACGGTGGGCT TGGCGCGATA
ACAACAGGGA AAGATAGTCA GGGGAAACTC CCGCCCGCCG TTTACTGGCT GACATTGCCC
TGGCGCATCG GGATGTGGCT GTCTATGCGT TGGTTTTGTC GTCGCCTGGA GCCGGTGAGC
AAAATTACTG CTGGTGTTTA TTTAGGGGAG TTTCCACGAC ATATTCCGGC ACAGAATGCG
GTTCTGGACG TCACCTTTGA ATTCCCTCGG GGACGAGCGA CAAAAGATCG ACTCTATTTC
TGTGTACCGA TGCTGGATCT GGTGGTTCCG GAAGAGGGGG AGCTCCGACA GGCCGTGGCG
ATGCTGGAAA CATTACGCGA AGAGCAAGGC AGCGTTCTGG TCCATTGCGC GTTGGGATTA
TCGCGCAGTG CGCTGGTAGT GGCGGCATGG TTGTTATGTT ACGGACACTG TAAAACCGTT
GATGAAGCGA TTAGTTATAT TCGAGCCAGA CGTTCGCATA TTGTGCTTAA GGAAGATCAC
AAAGCGATGC TGAAATTATG GGAAAACAGG TAA
 
Protein sequence
MLQGAGWLLL LAPFFFFTYG SLNQFTAVQD LNSHDIPSQV FGWETAIPFL PWTIVPYWSL 
DLLYGFSLFV CSTTFEQRRL VHRLILATVM ACCGFLLYPL KFSFIRPEVS GVTGWLFSQL
ELFDLPYNQS PSLHIILCWL LWRHFRQHLA ERWRKVCGGW FLLIAISTLT TWQHHFIDVI
TGLAVGMLID WMVPVDRRWN YQKPDQRRIK IALPYVVGAG SCIVLMELMV MIQLWWSVWL
CWPVLSLLII GRGYGGLGAI TTGKDSQGKL PPAVYWLTLP WRIGMWLSMR WFCRRLEPVS
KITAGVYLGE FPRHIPAQNA VLDVTFEFPR GRATKDRLYF CVPMLDLVVP EEGELRQAVA
MLETLREEQG SVLVHCALGL SRSALVVAAW LLCYGHCKTV DEAISYIRAR RSHIVLKEDH
KAMLKLWENR