Gene ECH74115_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0721 
SymbolrlpA 
ID6969382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp748735 
End bp749823 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content57% 
IMG OID643384756 
Productrare lipoprotein A 
Protein accessionYP_002269269 
Protein GI209399263 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0797] Lipoproteins 
TIGRFAM ID[TIGR00413] rare lipoprotein A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000475341 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.474734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGC AGTGGCTCGG GATCTGCATC GCGGCAGGAA TGCTCGCGGC ATGTACAAGC 
GATGATGGTC AGCAACAGAC AGTAAGTGTA CCGCAGCCTG CGGTATGTAA CGGCCCCATA
GTTGAAATTA GCGGGGCGGA CCCGCGTTTC GAACCACTGA ACGCGACGGC AAATCAGGAT
TACCAGCGCG ACGGTAAAAG CTACAAAATC GTGCAGGATC CGTCTCGATT TATCCAGGCG
GGACTGGCGG CAATCTATGA TGCCGAACCA GGCAGTAACC TGACGGCCTC TGGCGAAGCT
TTCGATCCGA CACAGCTGAC GGCGGCCCAT CCAACGCTTC CGATCCCCAG CTACGCCAGA
ATCACTAACC TGGCTAACGG GCGAATGATC GTGGTGCGCA TTAATGATCG CGGTCCTTAC
GGCAACGACC GCGTTATTTC GCTTTCTCGC GCAGCAGCTG ACCGTCTTAA CACGTCAAAC
AACACCAAAG TTCGTATCGA TCCGATTATT GTCGCCCAGG ATGGTTCGCT TTCTGGTCCT
GGTATGGCTT GTACCACAGT CGCCAAACAG ACTTACGCCC TGCCTGCACC TCCCGATTTA
AGCGGTGGCG CGGGAACAAG TTCAGTGTCT GGCCCGCAGG GTGACATTCT TCCGGTCAGT
AATTCGACGC TAAAAAGCGA AGATCCGACC GGCGCGCCGG TAACCAGCAG CGGTTTCCTC
GGCGCACCAA CGACCTTAGC GCCAGGTGTA CTGGAAGGCA GCGAACCGAC GCCTGCTCCA
CAGCCCGTTG TTACAGCTCC GTCGACAACG CCTGCAACCT CGCCTGCAAT GGTGACACCG
CAAGCCGCCT CGCAAAGCGC CAGCGGCAAC TTTATGGTGC AGGTCGGGGC CGTAAGCGAT
CAGGCTCGTG CGCAACAGTA CCAACAGCAA CTGGGACAGA AGTTCGGCGT CCCCGGTCGC
GTAACTCAAA ATGGCGCGGT CTGGCGGATC CAGCTTGGCC CATTCGCCAA CAAAGCCGAA
GCCAGTACCT TGCAGCAACG TTTGCAAACC GAAGCCCAAT TACAGTCATT TATTACCACC
GCGCAGTAG
 
Protein sequence
MRKQWLGICI AAGMLAACTS DDGQQQTVSV PQPAVCNGPI VEISGADPRF EPLNATANQD 
YQRDGKSYKI VQDPSRFIQA GLAAIYDAEP GSNLTASGEA FDPTQLTAAH PTLPIPSYAR
ITNLANGRMI VVRINDRGPY GNDRVISLSR AAADRLNTSN NTKVRIDPII VAQDGSLSGP
GMACTTVAKQ TYALPAPPDL SGGAGTSSVS GPQGDILPVS NSTLKSEDPT GAPVTSSGFL
GAPTTLAPGV LEGSEPTPAP QPVVTAPSTT PATSPAMVTP QAASQSASGN FMVQVGAVSD
QARAQQYQQQ LGQKFGVPGR VTQNGAVWRI QLGPFANKAE ASTLQQRLQT EAQLQSFITT
AQ