Gene SNSL254_A1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1701 
Symbol 
ID6486556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1668918 
End bp1669979 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content52% 
IMG OID642737081 
Producthypothetical protein 
Protein accessionYP_002040833 
Protein GI194445562 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.117434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value0.828136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTTAC GTCATCTTTT TTCGCCGCGC CTGCGTGGTT CTTTATTGTT AGGTTCGCTC 
CTCGTCGCAT CCTCATTTAG CACGCTGGCG GCGGAAGACA TGCTGCGTAA AGCGGTAGGC
AAAGGCGCTT ATGAGATGGC CTGGAGTCAG CAAGAAAACG CGCTCTGGCT GGCTACATCG
CAAAGCCGTA AACTGGATAA AGGCGGCGTA GTTTATCGTC TCGACCCGGT GACGCTGGAA
ATCACGCAAG CGATTCATAA CGATCTCAAG CCGTTCGGCG CCACCATCAA TGCCGCGACC
CAAACGCTGT GGTTTGGCAA TACCATTAAC AGCGCTGTTA CCGCGATTGA TGCCAAAACG
GGTGATGTAA AAGGTCGTCT GGTACTTGAT GCGCGCAAAC GTACTGAAGA GGTTCGTCCG
TTACAGCCCC GTGAGCTGGT TGCCGATGCG TCTACCAACA CGATCTACAT TAGCGGTGTT
GGTAAAGAGA GTGCTATTTG GGTAGTGGAT GGCGAAACCA TCAAACTGAA AACGACGATC
GAAAATACCG GCAAAATGAG TACGGGTCTG GCGCTCGACA GTAAAGCGCA ACGCCTGTAC
ACCACCAATG CGGATGGCGA ATTTATCACC ATCGATACCG CCAGCAATAA AATTCTCAGT
CGTAAGAAGT TGCTGGATGA CGGTAAAGAA CACTTCTTTA TTAACCTGAG TCTCGATACC
GCAGGTCATC GCGCGTTTAT CACCGACTCG AAGGCGACTG AGGTTCTGGT TGTCGATACC
CGTAATGGCA ATATTCTTGC CAAAATCGCG GCGCCTGCTT CTTTGGCCGT CCTGTTTAAC
CCGACACGTA ACGAGGCGTA TGTGACACAT CGTCAGGCAG GTCAGGTCAG CGTGATCGAT
GCGAAGACCT ATAACGTTGT TAAAACGTTC GATACGCCGA CGTACCCGAA TAGCCTGGCG
CTATCGGCAG ACGGTAAAAC GCTCTACGTC AGCGTGAAGC AGAAATCGAC ACGTGAACAG
GAAGCGACGC AGCCGGATGA TGTTATTCGC ATTGCTCTGT AA
 
Protein sequence
MHLRHLFSPR LRGSLLLGSL LVASSFSTLA AEDMLRKAVG KGAYEMAWSQ QENALWLATS 
QSRKLDKGGV VYRLDPVTLE ITQAIHNDLK PFGATINAAT QTLWFGNTIN SAVTAIDAKT
GDVKGRLVLD ARKRTEEVRP LQPRELVADA STNTIYISGV GKESAIWVVD GETIKLKTTI
ENTGKMSTGL ALDSKAQRLY TTNADGEFIT IDTASNKILS RKKLLDDGKE HFFINLSLDT
AGHRAFITDS KATEVLVVDT RNGNILAKIA APASLAVLFN PTRNEAYVTH RQAGQVSVID
AKTYNVVKTF DTPTYPNSLA LSADGKTLYV SVKQKSTREQ EATQPDDVIR IAL