Gene SNSL254_A1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1702 
Symbol 
ID6485612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1670243 
End bp1672363 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content53% 
IMG OID642737082 
Productputative TonB-dependent receptor yncD 
Protein accessionYP_002040834 
Protein GI194445165 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.770108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value0.940102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA TTTCCGTTAG ACAGCGACTT TATCCCGCCC TCTTATTGCC TCTGACCTTC 
TCCCCGGTCC TTCAGGCCGC CAGCGCGTCC AATGAACAAA CCATGATAGT CACCGCAACG
CCGCAAACCG TATCCGAACT GGATACACCT GCCGCCGTTA GCGTCATTGA GGGAGAAGAC
ATGCGTCTGG CGACGCCGCG CGTTAATTTA TCTGAGTCTT TAACCAGCGT CCCGGGATTG
CAGGTGCAGA ACCGGCAGAA CTACGCGCAG GATCTGCAAA TCTCTATCCG TGGATTTGGC
TCGCGTTCCG CCTTTGGCGT GCGCGGCATT CGTCTGTATG TTGATGGCAT TCCCGCCACC
ATGCCGGATG GACAAGGCCA GATTTCGAAT ATTGATATTA ACAGCATACA AGACGTTGAA
GTGTTACGCG GCCCCTTCTC AGCGCTATAC GGCAATGCTT CCGGCGGCGT AATAAATGTC
ACCACCGAAA CCGGGAGACA GCCGCCCACC CTTGAGGCCA GCAGCTACTA CGGCAGTTAT
GGAAGCTGGC GCTATGGGCT AAAAGCCACG GGCGCGATGG GTGACGGCAC ACAGCCTGGC
GACGTGGATT ACACGGTATC AACTACCCGT TTCACCACCC ACGGCTACCG CGATCACAGC
GGCGCGCGAA AAAATCTGGC TAATGCCAAA CTGGGCGTGC GCCTTGACGA TGTCAGTAAA
CTCTCGCTGA TTTTTAATAG TGTGGACATT AAGGCGGACG ATCCCGGCGG ACTTACCGAA
TCTGAATGGA AAGCAGATCC GCAACAGGCG CCCCGCGCTG AACAGTACAA TACGCGTAAA
ACCATCAAAC AAACTCAGGC AGGATTGCGT TATGAACGTC AGCTCAGCGC GCAGGATGAT
ATCAGCGTGA TGGCCTACGC CGGAGAGCGG GAAACCACGC AATACCAGTC TATTCCCCTG
GTGGCACAGC TAAAACCGGC CCAGGCTGGC GGCGTAATTA CACTGCAACG CCACTATCAG
GGTATTGATT CCCGCTGGAC TCACCGGGGA GAATTAGGTG TGCCGGTCAC CTTCACTGGC
GGAGTAAACT ATGAAAACAT GAGCGAAAAC CGCAAGGGTT ACAATAACTT CCGTCTCAAC
AACGGCACCC CTGAGTTTGG GCATAAAGGC GATTTACGGC GGGATGAGCG TAACCTGATG
TGGAATGTCG ATCCTTATCT GCAAACCCAG TGGCAGCTTA CGCAAAAACT CTCGCTGGAT
GCTGGCGTGC GCTACAGTTC CGTGTGGTTC GATTCTAACG ATCACTATAT CGCGCCCGGC
AATGGCGATG ATAGCGGCGA CGCCAGCTAT CACCACTGGC TACCCGCCGG ATCGCTAAAA
TATGCGTTAA CTGACGCCTG GAATCTCTAT CTTGCTGCCG GTCGCGGATT TGAAACTCCC
ACTATCAATG AGCTTTCTTA TCGCGCTGAC GGCCAAAGCG GGTTCAACTT TGATCTCAAA
CCGTCGACCA ATGATACGGT GGAAGTCGGC AGTAAAACGC GGATTGGCAA TGGCTTGTTG
ACCGCCGCGT TATTTCAAAC CGATACCGAT GATGAAATTG TCGTCGCCAG TAGTATGGGA
GGACGCACTA CCTACAAAAA CGCGGGCAAA ACTCGCCGCC AGGGCGCAGA ACTCGCGTTG
GACCAGCGCT TCGCCGGCGA CTGGCGGGTG AAAGCATCAT GGACCTGGCT GGATGCGACC
TATCGCAGTA ACGTTTGTCA GGGGCAAAAC TGTGATGGAA ACCGAATGCC CGGCATCGCC
CGTAATATGG GATTCGCCTC ATTAGGTTTT ATCCCGGATG AGGGGTGGTA CGCCGGAACA
GATGTTCGGT ATATGGGCGA TATCATGGCC AATGATGAAA ATACCGCCAA AGCGCCTTCG
TATACCGTTG TTGGACTAAA TACAGGGTAT AAATTTAATT ACAGCCAGCT TACCGTCGAT
ATTTTTGGGC GGGTAGATAA TCTATTTGAT AAAGAATATA TCGGTTCGGT GATCGTTAAT
GAATCTAATG GCCGATATTA TGAGCCAGCG CCAGGGCGTA ACTATGGCGT GGGGATTAAT
CTGGCATGGC GATTTGAATA A
 
Protein sequence
MKIISVRQRL YPALLLPLTF SPVLQAASAS NEQTMIVTAT PQTVSELDTP AAVSVIEGED 
MRLATPRVNL SESLTSVPGL QVQNRQNYAQ DLQISIRGFG SRSAFGVRGI RLYVDGIPAT
MPDGQGQISN IDINSIQDVE VLRGPFSALY GNASGGVINV TTETGRQPPT LEASSYYGSY
GSWRYGLKAT GAMGDGTQPG DVDYTVSTTR FTTHGYRDHS GARKNLANAK LGVRLDDVSK
LSLIFNSVDI KADDPGGLTE SEWKADPQQA PRAEQYNTRK TIKQTQAGLR YERQLSAQDD
ISVMAYAGER ETTQYQSIPL VAQLKPAQAG GVITLQRHYQ GIDSRWTHRG ELGVPVTFTG
GVNYENMSEN RKGYNNFRLN NGTPEFGHKG DLRRDERNLM WNVDPYLQTQ WQLTQKLSLD
AGVRYSSVWF DSNDHYIAPG NGDDSGDASY HHWLPAGSLK YALTDAWNLY LAAGRGFETP
TINELSYRAD GQSGFNFDLK PSTNDTVEVG SKTRIGNGLL TAALFQTDTD DEIVVASSMG
GRTTYKNAGK TRRQGAELAL DQRFAGDWRV KASWTWLDAT YRSNVCQGQN CDGNRMPGIA
RNMGFASLGF IPDEGWYAGT DVRYMGDIMA NDENTAKAPS YTVVGLNTGY KFNYSQLTVD
IFGRVDNLFD KEYIGSVIVN ESNGRYYEPA PGRNYGVGIN LAWRFE