Gene SNSL254_A3994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3994 
Symbol 
ID6483603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3881999 
End bp3883144 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content41% 
IMG OID642739254 
Productlipopolysaccharide 1,2-N-acetylglucosaminetransferase 
Protein accessionYP_002042964 
Protein GI194443168 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAA AAATCATATT TACTGTTACT CCTATATTTT CAATTCCTCC TCGTGGTGCG 
GCTGCGGTAG AAACCTGGAT TTACCAGGTT GCAAAACGAC TATCAATACC GAGTGCTATT
GCTTGTATAA AGAATGCTGG CTATCCTGAA TATAATAAAA TAAACGATAA TTGTGATATT
CATTACATTG GGTTTAGTAA AGTTTATAAG CGTCTTTTTC AGAAATGGAC TCGTCTCGAC
CCACTACCCT ATTCCCAGCG CGTCCTTAAT ATTAGAGATA AAGTGACTAC CCAGGAAGAT
AGCGTCATTG TTATTCATAA TAGTATGAAA CTGTATCGGC AGATCAGAGA GCGCAATCCG
AATGCAAAAC TGGTTATGCA CATGCATAAC GCATTTGAAC CAGAACTTCC TGATAACGAT
GCAAAAATTA TCGTGCCCAG TCAGTTTCTT AAAGCGTTTT ATGAAGAAAG ATTGCCTGCC
GCTGCTGTTA GTATTGTGCC TAATGGTTTT TGTGCTGAGA CTTATAAAAG AAACCCACAA
GATAATCTTC GTCAGCAATT AAATATTGCG GAAGATGCCA CCGTTCTCTT ATATGCCGGG
AGAATTTCGC CTGATAAAGG CATCCTGTTG CTTTTGCAGG CGTTCAAACA ATTACGTACC
TTAAGAAGTA ATATTAAACT TGTCGTTGTT GGCGACCCTT ATGCAAGCCG CAAGGGTGAA
AAAGCAGAGT ATCAAAAGAA AGTACTGGAC GCCGTAAAAG AGATTGGAAC GGATTGTATT
ATGGCAGGGG GGCAATCTCC CGACCAGATG CATAACTTCT ATCATATAGC CGATCTGGTT
ATTGTGCCAT CTCAGGTTGA AGAAGCATTT TGCATGGTAG CTGTAGAAGC GATGGCAGCA
GGAAAAGCGG TTCTTGCCAG CAAAAAAGGG GGGATTAGCG AATTTGTGTT AGATGGCATA
ACGGGCTATC ACCTCGCAGA GCCTATGTCG AGCGACAGTA TAATTAATGA TATTAAACGT
GCGCTTGCTG ATAAGGAACG CCACCAGATT GCCGAAAAAG CAAAATCCCT GGTGTTTTCA
AAATACAGTT GGGAAAATGT AGCGCAGCGT TTCGAGGAAC AAATGAAAAG CTGGTTTGAT
AAGTGA
 
Protein sequence
MIKKIIFTVT PIFSIPPRGA AAVETWIYQV AKRLSIPSAI ACIKNAGYPE YNKINDNCDI 
HYIGFSKVYK RLFQKWTRLD PLPYSQRVLN IRDKVTTQED SVIVIHNSMK LYRQIRERNP
NAKLVMHMHN AFEPELPDND AKIIVPSQFL KAFYEERLPA AAVSIVPNGF CAETYKRNPQ
DNLRQQLNIA EDATVLLYAG RISPDKGILL LLQAFKQLRT LRSNIKLVVV GDPYASRKGE
KAEYQKKVLD AVKEIGTDCI MAGGQSPDQM HNFYHIADLV IVPSQVEEAF CMVAVEAMAA
GKAVLASKKG GISEFVLDGI TGYHLAEPMS SDSIINDIKR ALADKERHQI AEKAKSLVFS
KYSWENVAQR FEEQMKSWFD K