Gene EcSMS35_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1354 
Symbol 
ID6143516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1342552 
End bp1343790 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content51% 
IMG OID641616232 
ProductPqiA family integral membrane protein 
Protein accessionYP_001743412 
Protein GI170683557 
COG category[S] Function unknown 
COG ID[COG2995] Uncharacterized paraquat-inducible protein A 
TIGRFAM ID[TIGR00155] integral membrane protein, PqiA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000343828 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.147943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGGCTA TCGGCGAGGA ACTGCCGCGT GGTGATTACC AACGTTGCCC GCAATGTGAC 
ATGCTGTTTA GCCTGCCCGA GATAAATTCT CATCAAAGTG CCTATTGTCC GCGCTGTCAG
GCAAAAATTC GTGACGGGCG CGACTGGTCG CTAACGCGCC TGGCGGCAAT GGCGTTCGCT
ATGCTGTTGT TGATGCCGTT TGCCTGGGGC GAACCGCTGT TGCATATCTG GCTGTTAGGC
ATCCGTATTG ACGCCAACGT TATGCAAGGC ATCTGGCAAA TGACCAAACA GGGCGATGCG
ATAACGGGGT CGATGGTCTT TTTTTGCGTT ATTGGTGCCC CCCTTATTCT GGTGACCTCC
ATAGCTTATT TATGGTTTGG TAACCGACTG GGAATGAATT TACGTCCGGT ACTGCTGATG
CTTGAGCGAC TGAAAGAGTG GGTAATGCTC GATATCTACC TGGTCGGCAT TGGCGTTGCT
TCTATAAAGG TACAGGATTA TGCCCATATC CAGGCGGGTG TCGGCTTGTT CTCTTTTGTG
GCGTTGGTGA TTTTAACGAC GGTGACGTTG TCACATCTTA ATGTCGAAGA ACTGTGGGAG
CGATTTTATC CGCAGTGCCC CGCTACGCGA AGGGACGAGA AACTCCGTGT CTGTCTTGGG
TGTCATTTTA CCGGCTATCC TGATCAGCGT GGTCGCTGCC CGCGTTGCCA TATCCCGCTA
CGCCTGCGTC GCCGTCATAG TCTGCAAAAA TGCTGGGCGG CGCTGTTAGC GTCAATCGTT
TTGTTGTTAC CTGCCAACCT GTTGCCTATC TCTATCATTT ATCTGAATGG TGGTCGGCAG
GAAGATACGA TTCTTTCCGG AATTATGTCG CTGGCAAGTA GCAACATTGC GGTTGCGGGA
ATCGTGTTTA TCGCCAGTAT TCTGGTACCG TTTACTAAAG TGATCGTCAT GTTCACTTTA
CTGTTGAGTA TTCATTTTAA ATGCCAGCAA GGTTTACGCA CACGCATTCT GTTACTGCGG
ATGGTGACCT GGATTGGTCG CTGGTCGATG CTCGACCTGT TTGTCATATC TTTAACCATG
TCGCTGATTA ATCGCGATCA GATCCTCGCT TTTACTATGG GACCGGCTGC GTTTTATTTC
GGCGCAGCGG TAATTTTGAC TATTCTTGCT GTGGAATGGC TGGACAGCCG CTTACTTTGG
GATGCACATG AGTCAGGAAA CGCCCGCTTC GACGACTGA
 
Protein sequence
MRAIGEELPR GDYQRCPQCD MLFSLPEINS HQSAYCPRCQ AKIRDGRDWS LTRLAAMAFA 
MLLLMPFAWG EPLLHIWLLG IRIDANVMQG IWQMTKQGDA ITGSMVFFCV IGAPLILVTS
IAYLWFGNRL GMNLRPVLLM LERLKEWVML DIYLVGIGVA SIKVQDYAHI QAGVGLFSFV
ALVILTTVTL SHLNVEELWE RFYPQCPATR RDEKLRVCLG CHFTGYPDQR GRCPRCHIPL
RLRRRHSLQK CWAALLASIV LLLPANLLPI SIIYLNGGRQ EDTILSGIMS LASSNIAVAG
IVFIASILVP FTKVIVMFTL LLSIHFKCQQ GLRTRILLLR MVTWIGRWSM LDLFVISLTM
SLINRDQILA FTMGPAAFYF GAAVILTILA VEWLDSRLLW DAHESGNARF DD