Gene EcolC_2953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2953 
Symbol 
ID6065650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3220648 
End bp3222081 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content54% 
IMG OID641602364 
ProductRHS protein 
Protein accessionYP_001725906 
Protein GI170020952 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.944321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.880791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCCGG ATAACCGTAT CGCCCGTGAC GCGCACTATC TTTACCGGTA TGACCGTCAC 
GGCAGGCTGA CGGAGAAAAC CGACCTCATC CCGGAAGGGG TTATCCGCAC GGATGATGAG
CGCACCCACC GGTACCATTA CGACAGTCAG CACCGGCTGG TGCACTACAC GCGGACACAA
TATGCAGAGC CGCTGGTCGA AAGCCGCTAT CTTTACGACC CGCTGGGCCG CAGGGTGGCA
AAACGGGTAT GGCGACGTGA ACGGGACCTG ACGGGCTGGA TGTCGCTGTC ACGGAAACCG
CAAGTGACCT GGTACGGCTG GGACGGCGAC CGGCTGACCA CGATACAGAA CGACAGGAGC
CGCATCCAGA CGATTTATCA GCCGGGGAGC TTCACACCAC TCATCAGAGT TGAAACTGCC
ACCGGTGAAC TGGCCAGAAC GCAGCGCCGC AGCCTGGCGG ATGCGCTTCA GCAGTCCGGC
GGCGAAGACG GTGGCAGTGT GGTGTTCCCG CCGGTGCTGG TGCAGATGCT CGACCGGCTG
GAAAGTGAAA TCCTGGCTGA CCGGGTGAGT GAGGAAAGCC GCCGCTGGCT GGCATCGTGC
GGCCTGACCG TGGAGCAGAT GCAAAACCAG ATGGACCCGG TGTACACGCC GGCGCGAAAA
ATCCACCTGT ACCACTGCGA CCATCGCGGC CTGCCGCTGG CGCTTGTCAG CACGGAAGGG
GCAACAGAAT GGTGCGCAGA ATACGATGAA TGGGGCAACC TGCTGAATGA AGAGAACCCG
CATCAGCTGC AGCAGCTTAT CCGCCTGCCG GGGCAGCAGT ATGATGAGGA GTCCGGCCTG
TATTACAACC GCCACCGCTA TTATGACCCG CTGCAGGGGA GGTATATCAC TCAGGATCCG
ATTGGGCTGA AGGGGGGATG GAATTTTTAT CAGTATCCGC TGAATCCGGT TCAGTATATA
GATTCAATGG GACTGGCATC AAAATATGGA CACTTAAATA ATGGCGGATA TGGAGCGAGA
CCCAACAAAC CGCCTACGCC CGATCCAAGT AAATTGCCGG ACATAGCGAA ACAATTAAGA
CTGCCATATC CTATTGACCA GGCCAGTAGT GCGCCTAATG TTTTCAAAAC ATTCTTCAGA
GCATTAAGCC CTTACGACTA CACACTGTAT TGCAGGAAGT GGGTAAAACC AAATCTGACT
TGTACGCCAC AGGATGATTC CCAGTATCCA GGGATGGATA CAAAGACAGC AAGTGATTAC
CTGCCACAGA CAAATTGGCC AACAACTCAA TTACCACCAG GATATACTTG TGCAGAACCC
TATTTATTCC CAGACATTAA TAAACCCGAT GGGCCAGCAA CAGCAGGGAT AGATGATTTG
GGTGAAATTT TAGCTAAGAT GAAACAGAGA ACATCGAGAG GAATAAGAAA ATGA
 
Protein sequence
MWPDNRIARD AHYLYRYDRH GRLTEKTDLI PEGVIRTDDE RTHRYHYDSQ HRLVHYTRTQ 
YAEPLVESRY LYDPLGRRVA KRVWRRERDL TGWMSLSRKP QVTWYGWDGD RLTTIQNDRS
RIQTIYQPGS FTPLIRVETA TGELARTQRR SLADALQQSG GEDGGSVVFP PVLVQMLDRL
ESEILADRVS EESRRWLASC GLTVEQMQNQ MDPVYTPARK IHLYHCDHRG LPLALVSTEG
ATEWCAEYDE WGNLLNEENP HQLQQLIRLP GQQYDEESGL YYNRHRYYDP LQGRYITQDP
IGLKGGWNFY QYPLNPVQYI DSMGLASKYG HLNNGGYGAR PNKPPTPDPS KLPDIAKQLR
LPYPIDQASS APNVFKTFFR ALSPYDYTLY CRKWVKPNLT CTPQDDSQYP GMDTKTASDY
LPQTNWPTTQ LPPGYTCAEP YLFPDINKPD GPATAGIDDL GEILAKMKQR TSRGIRK