Gene SeD_A2888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2888 
Symbol 
ID6874412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2780724 
End bp2781902 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content55% 
IMG OID642785933 
Productouter membrane protein assembly complex subunit YfgL 
Protein accessionYP_002216583 
Protein GI198243095 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID[TIGR03300] outer membrane assembly lipoprotein YfgL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.597931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTGC GTAAATTACT TCTGCCAGGG TTGCTTTCCG TTACCCTGCT CAGCGGCTGT 
TCACTGTTTA GCGGCGAAGA AGATGTCGTC AAGATGTCTC CATTACCGCA GGTTGAAAAC
CAGTTTACCC CGACCACTGT CTGGAGCACC TCTGTCGGTA ACGGAATTGG CGAATTTTAT
TCTAACCTTC ATCCGGTAAT GGCAGATAAC GTGGTCTATG CAGCCGATCG CGCGGGCGTG
GTAAAAGCGC TCAATGCGGA TGATGGCAAA GAGATCTGGT CTGTGAATCT GGGCGAGAAA
GACGGCTGGT TCTCTCGTTC GTCCGCGTTG TTGTCCGGCG GCGTCACTGT CGCGGGCGGT
CACGTCTATA TCGGCAGTGA AAAGGCCGAG GTTTATGCGC TGAATACCAG CGACGGCACC
ACAGCATGGC AGACGAAGGT CGCAGGCGAA GCGTTGTCTC GCCCGGTTGT CAGTGATGGA
ATCGTCCTTA TCCATACCAG CAACGATCAG TTGCAGGCGC TAAATCAGGC GGATGGCGCC
ATTAAATGGA CGGTGAACCT GGATATGCCT TCGTTGTCGC TGCGCGGCGA ATCTGCGCCG
GCAACCGCGT TCGGCGCGGC TATTGTTGGT GGTGATAATG GCCGCGTCAG CGCCGTGTTG
ATGCAACAAG GCCAGATGAT TTGGCAACAG CGTATCTCCC AGGCCACTGG CCCAACGGAA
ATTGACCGTC TGAGCGACGT TGATACCACG CCGGTTGTGG TAAACGGCGT TGTTTACGCG
TTGGCGTATA ATGGTAACTT AACGGCGCTG GATCTGCGCA GTGGTCAGAT TATGTGGAAA
CGCGAGCTGG GTTCGGTCAA TGACTTTATC GTCGACGGCG ACCGTATCTA CCTGGTCGAT
CAGAACGATC GCGTGCTGGC GCTGACCACT GGCGGCGGCG TAACGCTGTG GACGCAAAGC
GATCTGCTGC ACCGTTTGCT GACCTCCCCG GTGCTGTATA ATGGTGATTT AGTCGTCGGC
GATAGCGAAG GTTATCTGCA CTGGATTAAT GTTGATGACG GGCGTTTTGT GGCCCAACAA
AAAGTAGACA GTTCCGGTTT CCTGACCGAA CCGACGGTGG CGGATGGTAA ACTGCTCATC
CAGGCCAAAG ACGGTACGGT CTACGCGATT ACGCGTTAA
 
Protein sequence
MQLRKLLLPG LLSVTLLSGC SLFSGEEDVV KMSPLPQVEN QFTPTTVWST SVGNGIGEFY 
SNLHPVMADN VVYAADRAGV VKALNADDGK EIWSVNLGEK DGWFSRSSAL LSGGVTVAGG
HVYIGSEKAE VYALNTSDGT TAWQTKVAGE ALSRPVVSDG IVLIHTSNDQ LQALNQADGA
IKWTVNLDMP SLSLRGESAP ATAFGAAIVG GDNGRVSAVL MQQGQMIWQQ RISQATGPTE
IDRLSDVDTT PVVVNGVVYA LAYNGNLTAL DLRSGQIMWK RELGSVNDFI VDGDRIYLVD
QNDRVLALTT GGGVTLWTQS DLLHRLLTSP VLYNGDLVVG DSEGYLHWIN VDDGRFVAQQ
KVDSSGFLTE PTVADGKLLI QAKDGTVYAI TR