Gene B21_00591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00591 
SymbolrlpA 
ID8113409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp624266 
End bp625354 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content57% 
IMG OID644846867 
Producthypothetical protein 
Protein accessionYP_002998440 
Protein GI251784136 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0797] Lipoproteins 
TIGRFAM ID[TIGR00413] rare lipoprotein A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0164207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAAGC AGTGGCTCGG GATCTGCATC GCGGCAGGAA TGCTCGCGGC ATGTACAAGC 
GATGATGGTC AGCAACAGAC GGTAAGTGTA CCGCAGCCTG CGGTATGTAA CGGCCCCATA
GTTGAAATTA GCGGGGCGGA CCCGCGTTTC GAACCACTGA ACGCGACGGC AAATCAGGAT
TACCAGCGCG ACGGTAAAAG CTACAAAATC GTGCAGGATC CGTCTCGATT TAGCCAGGCG
GGACTGGCGG CAATCTATGA CGCCGAACCT GGCAGTAACC TGACGGCCTC TGGCGAAGCT
TTCGATCCGA CACAGTTGAC GGCGGCCCAT CCAACGCTTC CGATCCCCAG CTACGCCAGA
ATCACTAACC TGGCTAACGG GCGAATGATC GTGGTGCGCA TTAATGATCG CGGTCCTTAC
GGCAACGACC GCGTTATTTC GCTTTCTCGC GCAGCAGCTG ACCGTCTTAA CACATCAAAC
AACACCAAAG TTCGTATCGA TCCGATTATT GTCGCCCAGG ATGGTTCGCT TTCTGGTCCT
GGTATGGCTT GTACCACTGT CGCCAAACAG ACTTACGCCC TGCCTGCACC TCCCGATTTA
AGCGGTGGCG CGGGAACAAG TTCAGTATCT GGCCCGCAGG GTGACATTCT TCCGGTCAGT
AATTCGACGC TAAAAAGCGA AGATCCGACC GGCGCGCCGG TAACCAGCAG CGGTTTCCTC
GGCGCACCAA CGACCTTAGC GCCTGGTGTA CTGGAAGGCA GCGAACCGAC ACCTGCACCA
CAGCCCGTTG TTACAGCTCC GTCGACAACG CCTGCAACCT CGCCTGCAAT GGTGACACCG
CAAGCCGCCT CGCAAAGCGC CAGCGGCAAC TTTATGGTGC AAGTCGGGGC CGTTAGCGAT
CAGGCTCGTG CGCAACAGTA CCAACAGCAA CTGGGACAGA AGTTCGGCGT CCCCGGTCGC
GTAACTCAAA ATGGCGCGGT CTGGCGGATC CAGCTTGGCC CATTCGCCAG CAAAGCCGAA
GCCAGTACCT TGCAGCAACG TTTGCAAACC GAAGCCCAAT TACAGTCATT TATTACCACC
GCGCAGTAG
 
Protein sequence
MRKQWLGICI AAGMLAACTS DDGQQQTVSV PQPAVCNGPI VEISGADPRF EPLNATANQD 
YQRDGKSYKI VQDPSRFSQA GLAAIYDAEP GSNLTASGEA FDPTQLTAAH PTLPIPSYAR
ITNLANGRMI VVRINDRGPY GNDRVISLSR AAADRLNTSN NTKVRIDPII VAQDGSLSGP
GMACTTVAKQ TYALPAPPDL SGGAGTSSVS GPQGDILPVS NSTLKSEDPT GAPVTSSGFL
GAPTTLAPGV LEGSEPTPAP QPVVTAPSTT PATSPAMVTP QAASQSASGN FMVQVGAVSD
QARAQQYQQQ LGQKFGVPGR VTQNGAVWRI QLGPFASKAE ASTLQQRLQT EAQLQSFITT
AQ