Gene Shewmr4_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1734 
Symbol 
ID4252308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2061841 
End bp2063091 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID638118345 
ProductLolC/E family lipoprotein releasing system, transmembrane protein 
Protein accessionYP_733865 
Protein GI113970072 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID[TIGR02212] lipoprotein releasing system, transmembrane protein, LolC/E family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00172788 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.754695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAC CATTAGCCCT CTCCATTGGT TGGCGTTTTT ATCGTGCGCG CCAATCCAAT 
AGTTTTATTA GTTTTATCTC CTTTGCATCA ACCGCGGGCA TTGCGCTAGG GGTTGCAGTA
CTGATTGTGG TGCTCTCAGC AATGAATGGC TTTGAGCGTG AGTTAGAGCA GCGCTTGTTA
GGTGTGATCT CCCAAGCCGA TGTGGTTGGC GTGAATGAGC CGATTGCCGA CTGGCGCGCA
GTTGAGCAAA CCGCCATGCA GATTGAAGGC ATTACGGCGG CGGCACCTTT TATTCGGATG
CAAGGATTAG TACAAAAGCC CGGTGGTTTT CAGGGGCTTG CTGTTGTGGG AATCGACCCT
GAGCAAGAGG CAAAAGTCTC GACTCTCTCG CAATTTATGT CGAAAGAGAC TTGGCAAGGC
TTAGGCGAGG ATGACAATCA CATCGTCCTC GGTGAGAGCT TGCTGAAAAA GTTAGGCCTC
GAAGTTGGCG ATACCCTCGC TCTGTATGTG CAAGATCTTG ATCCTGAACA TGCCGGCAGT
TTACGGGCGG CCAAGAGCCA TCGCTTTGTG GTGTCGGGTG TGTACCGTTT AGGTGGCGAG
CTTGAGTTAA CCACGGCGTA TATTCCGATG CGCTATGCGG CGAATATCCT GAATTTACAT
CAAGGTGTCA CTGGGGTGCG GATCAGTGTG GCGCAGGTGT TTGATGCGCC AGCGAAAATT
CGTGAGTTGG GTTATGCCTT AAACCAGTCC GTTTATATCA GTGATTGGAC GCGTACCCAA
GGGCATTTAT ATCAAGATAT TCAATTGGTT CGCACCATTA TGTATCTCGT TTTGGTGTTA
GTGATTGGCG TGGCCTGTTT CAATATTGTC TCAACGCTAG TCATGGCGGT GCGGGATAAA
GCCAGTGAAA TCGCCATTCT GATGACCATG GGGTTAAGCC GTCTCTCAGT GATGGGGATT
TTTATGGTGC AAGGCGCGTT AAATGGCCTT GTAGGTTGTG CCCTCGGCGG TGTGATAGGT
ATTGCGACCG CGGTGAATCT CAGTGGTATT GCCCGTGGTA TTGAGCAGCT GCTCGGAATT
CAACTCCTGT CGGCCGATGT GTATTTTGTG GATTTTCTGC CGTCAGAGCT ACATATGACA
GATGCTGGTT TAGTGATTGC CACGGCGTTT GTGATGAGTC TTATCGCAAC CCTGTATCCC
GCGTGGAAGG CGAGCCAGAT TGGCCCTGCG CAGGCGTTGG CGGGTAGGTA G
 
Protein sequence
MKGPLALSIG WRFYRARQSN SFISFISFAS TAGIALGVAV LIVVLSAMNG FERELEQRLL 
GVISQADVVG VNEPIADWRA VEQTAMQIEG ITAAAPFIRM QGLVQKPGGF QGLAVVGIDP
EQEAKVSTLS QFMSKETWQG LGEDDNHIVL GESLLKKLGL EVGDTLALYV QDLDPEHAGS
LRAAKSHRFV VSGVYRLGGE LELTTAYIPM RYAANILNLH QGVTGVRISV AQVFDAPAKI
RELGYALNQS VYISDWTRTQ GHLYQDIQLV RTIMYLVLVL VIGVACFNIV STLVMAVRDK
ASEIAILMTM GLSRLSVMGI FMVQGALNGL VGCALGGVIG IATAVNLSGI ARGIEQLLGI
QLLSADVYFV DFLPSELHMT DAGLVIATAF VMSLIATLYP AWKASQIGPA QALAGR