Gene SNSL254_A2125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2125 
Symbol 
ID6484051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2055927 
End bp2057411 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content51% 
IMG OID642737480 
Productcytoplasmic alpha-amylase 
Protein accessionYP_002041227 
Protein GI194444955 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0000021601 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAACC CCACGTTATT GCAGTACTTC CACTGGTATT ATCCCGACGG CGGTAAACTC 
TGGTCTGAGC TGGCGGAACG TGCTGATGGG CTGAATGATA TCGGTATCAA TATGGTCTGG
CTACCGCCCG CCTGTAAAGG CGCCTCCGGC GGCTATTCCG TAGGCTATGA TAGCTACGAC
CTGTTTGACC TCGGCGAATT TGACCAAAAA GGAACTATCG CGACAAAGTA CGGCGATAAA
CGCCAGTTAC TGACGGCGAT AGACGCGCTC AAAAAAAATA ATATTGCGGT GCTGCTCGAC
GTCGTCGTGA ACCACAAAAT GGGCGCAGAC GAAAAAGAAC GTATCCGCGT TCAGCGCGTG
AATCAGGATG ACCGCACGCA AATCGATGAC AACATCATTG AATGCGAAGG CTGGACGCGC
TACACCTTCC CTGCCCGCGC GGGCCAGTAT TCCAACTTTA TTTGGGACTA TCACTGTTTC
AGCGGCATTG ATCACATCGA GAATCCCGAC GAAGACGGCA TTTTTAAGAT CGTCAATGAC
TATACCGGCG ATGGCTGGAA CGATCAGGTT GATGATGAGC TGGGTAATTT CGACTATCTG
ATGGGGGAAA ATATCGATTT TCGCAATCAC GCGGTTACGG AAGAGATTAA ATATTGGGCT
CGTTGGGTCA TGGAACAAAC CCACTGTGAC GGCTTTCGCC TGGACGCGGT AAAACATATA
CCCGCCTGGT TTTATAAAGA ATGGATTGAG CATGTACAGG CGGTTGCGCC AAAACCGCTG
TTTATTGTCG CAGAATACTG GTCGCATGAA GTGGATAAAC TGCAAACGTA CATCGATCAG
GTCGACGGGA AAACCATGCT GTTCGACGCG CCGTTGCAGA TGAAATTTCA CGAGGCCTCG
CGCCAGGGCG CGGAGTATGA CATGCGCCAC ATATTCACCG GCACTCTGGT AGAAGCCGAC
CCTTTTCATG CGGTGACGCT GGTCGCTAAC CACGATACAC AACCGTTACA GGCGCTGGAA
GCGCCGGTAG AACCCTGGTT CAAACCATTG GCCTATGCGC TGATCCTGCT TCGTGAAAAC
GGCGTACCGT CAGTGTTTTA TCCCGATTTA TACGGCGCCA GCTATGAAGA TAGCGGCGAA
AATGGCGAGA CCTGTCGGGT CGACATGCCG GTGATTAACC AACTGGATCG GCTGATCCTC
GCTCGTCAGC GTTTTGCGCA CGGTATACAA ACACTCTTTT TCGATCATCC TAACTGTATC
GCCTTTAGTC GCAGCGGTAC TGAAGAGAAT CCAGGCTGTG TGGTTGTACT TTCCAATGGC
GACGACGGTG AAAAAACCCT CCTGCTCGGC GACAATTACG CTAACAAGAC CTGGCGTGAT
TTTCTGGGAA ACCGCAGTGA GCATGTTGTA ACTAATGATC AAGGCGAAGC GACGTTCTTC
TGCAACGCAG GCAGCGTCAG CGTGTGGGTC ATTGAGGACG TGTGA
 
Protein sequence
MKNPTLLQYF HWYYPDGGKL WSELAERADG LNDIGINMVW LPPACKGASG GYSVGYDSYD 
LFDLGEFDQK GTIATKYGDK RQLLTAIDAL KKNNIAVLLD VVVNHKMGAD EKERIRVQRV
NQDDRTQIDD NIIECEGWTR YTFPARAGQY SNFIWDYHCF SGIDHIENPD EDGIFKIVND
YTGDGWNDQV DDELGNFDYL MGENIDFRNH AVTEEIKYWA RWVMEQTHCD GFRLDAVKHI
PAWFYKEWIE HVQAVAPKPL FIVAEYWSHE VDKLQTYIDQ VDGKTMLFDA PLQMKFHEAS
RQGAEYDMRH IFTGTLVEAD PFHAVTLVAN HDTQPLQALE APVEPWFKPL AYALILLREN
GVPSVFYPDL YGASYEDSGE NGETCRVDMP VINQLDRLIL ARQRFAHGIQ TLFFDHPNCI
AFSRSGTEEN PGCVVVLSNG DDGEKTLLLG DNYANKTWRD FLGNRSEHVV TNDQGEATFF
CNAGSVSVWV IEDV