Gene Rpic12D_4473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic12D_4473 
Symbol 
ID8022161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12D 
KingdomBacteria 
Replicon accessionNC_012857 
Strand
Start bp1109538 
End bp1110884 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content57% 
IMG OID644833247 
ProductZonular occludens toxin 
Protein accessionYP_002984387 
Protein GI241666028 
COG category[R] General function prediction only 
COG ID[COG4128] Zonula occludens toxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.272014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000467157 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCATTT TCCACGAAGG TTTGCCGCGT TCGGGTAAGA GCTATGAGGC GATGGTGAAG 
CGCATTATTC CTGCGCTTCA GAAGGGCCGT AAGGTGTTTG CTCGGTTGAA TGGCATGGAT
TACGAGAAGA TTGCCGAGGT CGCTGAGATG CCGTTGGAGC GTGTGCGTGA GCTGCTGCAC
GAGATCCCGA AGGACAAGGT CAAGGAGTGG TTCAAGGTCG TTGAGAACGA CTCGCTTGTG
ATCTTGGATG AGTTGCAGAA TTTTTGGCCC GATAGTGGTC GGCGTTTGCC TCTCGATCAG
ATCGAAGCGA TTTCAGAGCA CGGGCATCGT GGTCTGGACA TTGTTGGCAT GGGGCAGCTG
CTTAAGGGTA AGGGTGGCTG CGATGCGAAC TGGGTGAACC GTTGCGATCA GAAGATTGTT
TTCGAGAAGC AGAATGCTCG TGGCGCAGAC GATAAGTACC GATGGACCGC CTATAAGGGC
AAGTTGGTCG ACGGGAAGAT TCAATTCCTG AAGGTCAATA CCGGTGTTGT GAGCTACGAC
AAGAAGTATT TCGGCACATA TGCGACGCGT GTCGACGGTT CGGACAATGC CGAGACGTAT
CAGGATGCGC GCACCAACGT GTGGAATAAC CCAGTGCTGC GCAAGTTCGC GCCGGCCATG
CTGATCTGTG CGGGCGTTGC TGTTTGGTAT CTGTGGCATG CCTTCAAGGG CGGTGGCTTG
GAGAAGAGCC TCGGTGGCCA CAAGGTCGAA AAATCCGTTA CGGTAACGAG TACTCCGCCG
GCGGTTACTT CGGCATCAGC TGTGCAGGTT GGTGCAGCTC AAGCTCAGCC TGCAGCTGCC
AAGAGTGCCC CCGTTAGCGA GTCGCAGAAG CAGGATGCGA TGGCGGACGA CTATGTGGCG
ACGATTTCGC AGAAATGGCG TCCAAGGCTT TCTGGCCTGG TTTGGGGCGC GAAGGGCGCG
CGCCTGGTGG TGGAGTGGTA TGACGAGAGT TTTCGCTTGA AGGAGCGGTT CAGCGCGGCC
CAGCTGGAGG AGTTCGGGTG GGGCGTTGCC AGGTCGGCCT ATGGTGAGCA CATCATTCTC
AGCAAGGGCG GTGTGCATAT CGCTGTCACC AGCTGGCCGA TGGAGTCCTT CGGCAAGGTC
AGTGAGGCCG ATAGCAGGGC GATTGCGAGC CAGTCGAGTG GCGGGGCACC AGGCTTCGGC
CCGTCCGATT GGCGGCGTGA TGAGGGTGTT TCTGGTGGTG GTGGATCTGT GGTCAGCAGG
GACAGCGGTT CAGATTGGCC TGGCTATGGT GCCGATGGCT TGGTCAAGCA TGCCAAGCCT
GTGCGGTCGA TCTTGACATC CGGTTGA
 
Protein sequence
MFIFHEGLPR SGKSYEAMVK RIIPALQKGR KVFARLNGMD YEKIAEVAEM PLERVRELLH 
EIPKDKVKEW FKVVENDSLV ILDELQNFWP DSGRRLPLDQ IEAISEHGHR GLDIVGMGQL
LKGKGGCDAN WVNRCDQKIV FEKQNARGAD DKYRWTAYKG KLVDGKIQFL KVNTGVVSYD
KKYFGTYATR VDGSDNAETY QDARTNVWNN PVLRKFAPAM LICAGVAVWY LWHAFKGGGL
EKSLGGHKVE KSVTVTSTPP AVTSASAVQV GAAQAQPAAA KSAPVSESQK QDAMADDYVA
TISQKWRPRL SGLVWGAKGA RLVVEWYDES FRLKERFSAA QLEEFGWGVA RSAYGEHIIL
SKGGVHIAVT SWPMESFGKV SEADSRAIAS QSSGGAPGFG PSDWRRDEGV SGGGGSVVSR
DSGSDWPGYG ADGLVKHAKP VRSILTSG