Gene SeAg_B2040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2040 
SymbolmdoC 
ID6796101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1977972 
End bp1979126 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content46% 
IMG OID642776264 
Productglucans biosynthesis protein 
Protein accessionYP_002146895 
Protein GI197247793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000145807 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCTG TACCCGCGCC GCGTGAATAT TTTCTTGACT CTATCCGCGC ATGGCTGATG 
TTGTTAGGGA TTCCCTTTCA TATCTCGTTG ATCTATTCCA CTCACAGTTG GCATGTCAAT
AGCGCCACGC CATCATGGTG GCTAACCCTG TTTAACGATT TTATCCACGC TTTTCGTATG
CAGGTGTTTT TTGTTATTTC TGGTTATTTT TCGTACATGT TGTTTTTACG TTATCCGTTA
AAACGCTGGT GGAAAGTACG GGTAGAACGT GTGGGTATTC CTATGCTTAC CGCAATCCCT
TTGCTTACCT TGCCGCAATT TATCCTGTTG CAATATGTCA AAGAGAAAAC AGAGAACTGG
CCTACACTCT CTGCCTATGA AAAATATAAT ACGTTAGCGT GGGAACTCAT TTCACATCTG
TGGTTTTTAC TGGTGCTGGT GATATTAACC ACAGTCAGCA TCGGGATTTT TACCTGGTTC
CAAAAAAGGC AGGAAACAAG CAAGCCTCGT CCCGCCGCTA TTTCACTAGC CAGGCTTTCG
CTTATTTTTT TCCTGCTGGG GATGGCTTAC GCCGCTATCA GGCGCATTAT TTTCATCGTA
TATCCGGCAA TCCTCAGTGA CGGCATGTTC AATTTTATTG TGATGCAAAC GCTATTTTAT
GTGCCGTTTT TTATTCTCGG CGCGTTGGCC TTCATTCACC CCGATCTGAA AGCGCGCTTC
ACCACGCCCT CACGCGGATG CACTTTAGGC GCTGCCGTTG CTTTTATCGC CTATCTGCTG
AATCAACGTT ATGGGAGCGG CGACGCCTGG ATGTACGAAA CCGAATCCGT GATTACGATG
GTCATGGGGC TGTGGATGGT GAACGTAGTA TTTTCACTGG GGCATCGCTT GTTAAACTTC
CAGTCCGCGC GCGTCACCTA TTTCGTGAAT GCTTCGCTGT TTATTTATCT GGTGCATCAT
CCCTTAACGC TTTTCTTTGG CGCGTATATT ACGCCGCATA TCTCCTCCAA CCTGATCGGG
TTCTTGTGCG GGCTGATATT TGTTATGGGT ATTGCGTTAA TTCTGTATGA AATTCATTTA
CGCATCCCGC TTCTGAAATT TCTCTTTTCA GGTAAACCGC CGGTAAAACA AGAAAGCCGC
GCCGCGATCG GGTAG
 
Protein sequence
MSSVPAPREY FLDSIRAWLM LLGIPFHISL IYSTHSWHVN SATPSWWLTL FNDFIHAFRM 
QVFFVISGYF SYMLFLRYPL KRWWKVRVER VGIPMLTAIP LLTLPQFILL QYVKEKTENW
PTLSAYEKYN TLAWELISHL WFLLVLVILT TVSIGIFTWF QKRQETSKPR PAAISLARLS
LIFFLLGMAY AAIRRIIFIV YPAILSDGMF NFIVMQTLFY VPFFILGALA FIHPDLKARF
TTPSRGCTLG AAVAFIAYLL NQRYGSGDAW MYETESVITM VMGLWMVNVV FSLGHRLLNF
QSARVTYFVN ASLFIYLVHH PLTLFFGAYI TPHISSNLIG FLCGLIFVMG IALILYEIHL
RIPLLKFLFS GKPPVKQESR AAIG