Gene EcHS_A0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0749 
Symbol 
ID5592505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp760300 
End bp761733 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content55% 
IMG OID640919926 
ProductRHS repeat-containing protein 
Protein accessionYP_001457500 
Protein GI157160182 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.00294679 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGCCGG ATAACCGTAT CGCCCGTGAC GCGCACTATC TTTACCGGTA TGACCGTCAC 
GGCAGGCTGA CGGAGAAAAC CGACCTCATC CCGGAAGGGG TTATCCGCAC GGATGATGAG
CGCACCCACC GGTACCATTA CGACAGTCAG CACCGGCTGG TGCACTACAC GCGGACACAA
TATGCAGAGC CGCTGGTCGA AAGCCGCTAT CTTTACGACC CGCTGGGCCG CAGGGTGGCA
AAACGGGTGT GGCGACGTGA ACGGGACCTG ACGGGCTGGA TGTCGCTGTC ACGGAAACCG
CAAGTGACCT GGTACGGCTG GGACGGCGAC CGCCTGACCA CAATACAGAA CGACAGAACC
CGCATCCAGA CGATTTATCA GCCGGGGAGC TTCACGCCAC TCATCAGGGT TGAAACCGCC
ACCGGTGAGC AGGCGAAAAC GCAGCGCCGC AGCCTGGCGG ATACCCTTCA GCAGTCCGGC
GGCGAAGACG GTGGCAGTGT GGTGTTCCCG CCGGTGCTGG TGCAGATGCT CGACCGGCTG
GAAAGTGAAA TCCTGGCTGA CCGGGTGAGT GAGGAAAGCC GCCGCTGGCT GGCATCGTGC
GGCCTGACGG TGGAGCAGAT GCAAAACCAG ATGGACCCGG TGTACACGCC GGCGCGAAAA
ATCCACCTGT ACCACTGCGA CCATCGCGGC CTGCCGCTGG CGCTTGTCAG CACGGAAGGG
GCAACAGAAT GGTGCGCAGA ATACGATGAA TGGGGCAACC TGCTGAATGA AGAGAACCCG
CATCAGCTGC AGCAGCTTAT CCGCCTGCCG GGGCAGCAGT ATGATGAGGA GTCCGGCCTG
TATTACAACC GCCACCGCTA TTATGACCCG CTGCAGGGGA GGTATATCAC TCAGGATCCG
ATTGGGCTGA AGGGGGGATG GAATTTTTAT CAGTATCCGC TGAATCCGGT TCAGTATATA
GATTCAATGG GACTGGCATC AAAATATGGA CACTTAAATA ATGGCGGATA TGGAGCGAGA
CCCAACAAAC CGCCTACGCC CGATCCAAGT AAATTGCCGG ACATAGCGAA ACAATTAAGA
CTGCCATATC CTATTGACCA GGCCAGTAGT GCGCCTAATC TTTTCAAAAC ATTCTTCAGA
GCATTAAGCC CTTACGACTA CACACTGTAT TGCAGGAAGT GGGTAAAACC AAATCTGACT
TGTACGCCAC AGGATGATTC CCAGTATCCA GGGATGGATA CAAAGACAGC AAGTGATTAC
CTGCCACAGA CAAATTGGCC AACAACTCAA TTACCACCAG GATATACTTG TGCAGAACCC
TATTTATTCC CAGACATTAA TAAACCCGAT GGGCCAGCAA CAGCAGGGAT AGATGATTTG
GGTGAAATTT TAGCTAAGAT GAAACAGAGA ACATCGAGAG GAATAAGAAA ATGA
 
Protein sequence
MWPDNRIARD AHYLYRYDRH GRLTEKTDLI PEGVIRTDDE RTHRYHYDSQ HRLVHYTRTQ 
YAEPLVESRY LYDPLGRRVA KRVWRRERDL TGWMSLSRKP QVTWYGWDGD RLTTIQNDRT
RIQTIYQPGS FTPLIRVETA TGEQAKTQRR SLADTLQQSG GEDGGSVVFP PVLVQMLDRL
ESEILADRVS EESRRWLASC GLTVEQMQNQ MDPVYTPARK IHLYHCDHRG LPLALVSTEG
ATEWCAEYDE WGNLLNEENP HQLQQLIRLP GQQYDEESGL YYNRHRYYDP LQGRYITQDP
IGLKGGWNFY QYPLNPVQYI DSMGLASKYG HLNNGGYGAR PNKPPTPDPS KLPDIAKQLR
LPYPIDQASS APNLFKTFFR ALSPYDYTLY CRKWVKPNLT CTPQDDSQYP GMDTKTASDY
LPQTNWPTTQ LPPGYTCAEP YLFPDINKPD GPATAGIDDL GEILAKMKQR TSRGIRK