Gene SNSL254_A2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2281 
SymbolwcaM 
ID6483184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2191468 
End bp2192871 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content51% 
IMG OID642737627 
Productputative colanic acid biosynthesis protein 
Protein accessionYP_002041369 
Protein GI194446744 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00000130996 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCGCGA CTAAATTCTC CCGACGTACT CTCCTGACGG CAGGTTCTGC GCTTGCTGTT 
CTTCCTTTTC TGCGCGCCTT GCCGGTACAG GCGCGTGAAC CTCGCGAGAC CGTCGATATT
AAGGATTATC CGGCGGATGA CGGTATCGCC TCGTTCAAAC AGGCCTTCGC CGACGGACAG
ACCGTGGTCG TACCGCCAGG ATGGGTGTGT GAAAATATCA ATGCGGCGAT AACGATTCCG
GCGGGAAAAA CTCTGCGGGT ACAGGGCGCG GTGCGTGGGA ATGGCCGGGG ACGGTTTATT
TTGCAGGACG GGTGTCAGGT GGTGGGGGAG CAGGGCGGCA GTCTGCACAA TGTGACGCTG
GATGTTCGCG GGTCGGACTG TGTGATTAAA GGCGTGGCGA TGAGCGGCTT TGGCCCCGTC
GCGCAAATTT TCATCGGTGG TAAGGAACCG CAGGTGATGC GTAATCTCAT TATCGATGAC
ATCACCGTTA CCCACGCCAA CTACGCCATT CTCCGCCAGG GATTTCATAA CCAAATGGAC
GGCGCGCGGA TTACGCATAG CCGCTTTAGC GATTTGCAGG GGGACGCCAT TGAGTGGAAT
GTCGCGATTC ACGACCGCGA CATCCTGATT TCCGATCATG TCATCGAACG CATTGATTGT
ACCAATGGCA AAATCAACTG GGGGATCGGC ATCGGGCTGG CGGGTAGCAC CTATGACAAC
AGTTATCCTG AAGACCAGGC AGTAAAAAAC TTTGTGGTGG CCAATATTAC CGGATCTGAT
TGCCGACAGC TGGTGCACGT AGAAAATGGC AAACATTTCG TCATTCGCAA TGTCAAAGCC
AAAAACATCA CGCCCGATTT CAGTAAAAAT GCGGGTATTG ATAACGCAAC GATCGCAATT
TATGGCTGTG ATAATTTCGT CATTGATAAT ATTGATATGA CGAATAGTGC CGGGATGCTC
ATCGGCTATG GCGTCGTTAA AGGAAAATAC CTGTCAATTC CGCAAAACTT TAAATTAAAC
GCTATTCGGT TGGATAATCG CCAGGTTGCT TATAAATTAC GCGGCATTCA AATTTCATCC
GGTAACGCCC CCTCATTTGT TGCCATCACC AATGTACGGA TGACGCGTGC TACGCTGGAA
CTGCATAATC AACCGCAGCA CCTCTTTTTG CGTAATATCA ACGTGATGCA AACTTCAGCG
ATTGGCCCGG CGTTAAAAAT GCATTTCGAT TTGCGTAAAG ATGTCCGTGG TCAATTTATG
GCCCGCCAGG ACACGCTGCT TTCCCTCGCT AATGTTCATG CCATCAATGA AAACGGGCAG
AGTTCCGTGG ATATCGACAG GATTAATCAC CAAACCGTGA ATGTCGAAGC AGTGAATTTT
TCGCTGCCGA AGCGGGGAGG GTAA
 
Protein sequence
MPATKFSRRT LLTAGSALAV LPFLRALPVQ AREPRETVDI KDYPADDGIA SFKQAFADGQ 
TVVVPPGWVC ENINAAITIP AGKTLRVQGA VRGNGRGRFI LQDGCQVVGE QGGSLHNVTL
DVRGSDCVIK GVAMSGFGPV AQIFIGGKEP QVMRNLIIDD ITVTHANYAI LRQGFHNQMD
GARITHSRFS DLQGDAIEWN VAIHDRDILI SDHVIERIDC TNGKINWGIG IGLAGSTYDN
SYPEDQAVKN FVVANITGSD CRQLVHVENG KHFVIRNVKA KNITPDFSKN AGIDNATIAI
YGCDNFVIDN IDMTNSAGML IGYGVVKGKY LSIPQNFKLN AIRLDNRQVA YKLRGIQISS
GNAPSFVAIT NVRMTRATLE LHNQPQHLFL RNINVMQTSA IGPALKMHFD LRKDVRGQFM
ARQDTLLSLA NVHAINENGQ SSVDIDRINH QTVNVEAVNF SLPKRGG