Gene SNSL254_A3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3122 
Symbol 
ID6485726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3037696 
End bp3038958 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content60% 
IMG OID642738434 
ProductYgbK domain protein 
Protein accessionYP_002042158 
Protein GI194445270 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.827819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0402148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAA TTGGCGTCAT TGCCGACGAT TTTACCGGCG CGACTGACAT CGCCAGTTTT 
CTGGTCGAAA ACGGGATGCC GACAGTGCAG ATCAATGATG TCCCAACCGG GACGCAACCG
GAAGGATGCG ACGCGGTAGT TATCAGCCTG AAAACCCGCT CATGCCCGGC GCAAGAGGCG
ATAAAACAAT CGCTGGCGGC GCTGGTATGG CTGAAAAAAC AGGGCTGCCA GCAAGTCTAT
TTCAAATATT GCTCGACTTT CGATAGTACC GCCGAAGGCA ATATCGGCCC GGTCACCGAT
GCGCTGATGG TGGCGCTGGA TACCTCATTT ACCGTGATTT CTCCCGCGCT GCCGGTTAAC
GGACGCACGG TTTATCAGGG CTATCTGTTT GTCATGAACC ACTTGCTGGC GGAGTCCGGT
ATGCGCCACC ACCCTATCAA TCCGATGACC GACAGCTACC TGCCGCGTCT GATGGAAGCG
CAGGCGCAAG GGCGCTGCGG CGTTATTCCG GCTCAGACGC TTGATGAAGG CGTTGCCGCG
ACCCGTGCGG CGCTGTCGCG TTTACAGCAG GAAGGATATC GCTACGCGGT ACTTGACGCG
CTCAATGAGC GGCACCTGGA AATCCAGGGC GAGGTTTTGC GTGATGCCCC GCTAGTGACC
GGCGGTTCCG GGCTGGCAAT GGGGCTGGCG CGTCAGTGGG CGAAGCACGG CGTTTCTCAG
GCCCGTTCCG CAGGCTATCC GCTGAGCGGT CGCGCGGTGG TGCTTTCCGG TTCCTGTTCG
CAAATGACGA ATCAGCAGGT GGCCTTCTAT CGACAACATG CTCCCACACG CGACGTTGAC
GTGGCGCGCT GCCTGTCATC CGAGGCGCGC GAGGCCTACG CTGAAGCGCT GGCGCAGTGG
GTGCTCAGTC AGGACAGCGA ACTGGCGCCA ATGATTAGCG CCACCGCCTC CACGCAGGCG
CTGGCCGCCA TCCAGCAGCA ATATGGCGCT ACCGAAGCCA GCCATGCGGT AGAGGCGCTC
TTTTCCCTGC TGGCCGCTCG CTTAACGGAA GGCGGTATCA CCCGGTTTAT CGTGGCGGGC
GGCGAAACCT CGGGCGTGGT GACGCAAAGC CTCGGTATTA CCGGTTTTCA CATTGGACCG
TGCATTTCAC CCGGCGTGCC GTGGGTCAAC GCGCTCCATG CGCCAGTCTC GCTGGCGCTA
AAGTCAGGTA ATTTTGGCGA TGAATCCTTT TTCATCCGCG CTCAAAGGGA GTTTCAGGTA
TGA
 
Protein sequence
MLKIGVIADD FTGATDIASF LVENGMPTVQ INDVPTGTQP EGCDAVVISL KTRSCPAQEA 
IKQSLAALVW LKKQGCQQVY FKYCSTFDST AEGNIGPVTD ALMVALDTSF TVISPALPVN
GRTVYQGYLF VMNHLLAESG MRHHPINPMT DSYLPRLMEA QAQGRCGVIP AQTLDEGVAA
TRAALSRLQQ EGYRYAVLDA LNERHLEIQG EVLRDAPLVT GGSGLAMGLA RQWAKHGVSQ
ARSAGYPLSG RAVVLSGSCS QMTNQQVAFY RQHAPTRDVD VARCLSSEAR EAYAEALAQW
VLSQDSELAP MISATASTQA LAAIQQQYGA TEASHAVEAL FSLLAARLTE GGITRFIVAG
GETSGVVTQS LGITGFHIGP CISPGVPWVN ALHAPVSLAL KSGNFGDESF FIRAQREFQV