Gene SeD_A4057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4057 
Symbol 
ID6873474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3898932 
End bp3900209 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content52% 
IMG OID642787006 
Product2,3-diketo-l-gulonate trap transporter large permease yian 
Protein accessionYP_002217633 
Protein GI198242335 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.998541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.502761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTGG TGATATTTCT CTGCTGCCTG CTCGGCGGGA TCGCGATAGG TTTACCCATC 
GCCTGGTCGC TGCTGCTTTG CGGCGCTGCT CTGATGGCAT ACCTGGATAT GTTTGACGTG
CAGATTATGG CGCAAACCCT GGTTAACGGC GCGGACAGTT TCTCCCTGCT GGCTATTCCC
TTTTTTGTTT TGGCCGGTGA AATCATGAAC GCGGGCGGCC TGTCAAAGCG AATTGTCGAC
CTGCCGATGA AGCTGGTCGG CCATAAACCC GGCGGCCTGG GCTACGTGGG CGTTATTGCG
GCAATGATTA TGGCCAGCCT TTCCGGCTCT GCGGTAGCAG ATACCGCTGC GGTCGCCGCG
CTGCTGGTGC CGATGATGCG CTCCGCAAAC TACCCGATCA ACCGCTCCGT TGGGTTAATC
GCTTCCGGCG GGATCATTGC GCCAATTATT CCACCCTCGA TTCCTTTTAT TATCTTCGGC
GTTTCCAGCG GCTTGTCGAT CAGCAAGCTG TTTATGGCCG GGATCGCACC GGGCATCATG
ATGGGCGCGG CGCTTATGCT CACCTGGTGG TGGCAGGCCG GGCGATTAAA TCTCCCTTCT
CAGCCTAAAG CAACACCGCG TGAAATCTGG CAATCATTGG TTTCAGGTAT CTGGGCGCTG
TTTTTACCGG TGATTATTAT CGGCGGCTTC CGTTCCGGAC TTTTCACGCC AACGGAGGCA
GGGGCGGTTG CCGCGTTTTA CGCCCTCTTT GTCGCCGTTG TTATCTATCG GGAATTAACG
TTTTCCAGTC TCTACCACGT GCTGGTCAAT GCCGCCAAAA CGACGTCAGT CGTCATGTTT
CTGGTGGCCG CGGCCCAGGT ATCCGCCTGG CTGATTACGA TCGCGGAATT ACCCATGATG
GTGTCAGATT TGCTGCAGCC GCTGGTCGAC TCTCCGCGAC TCTTATTTAT CGTCATTATG
ATCTCAATTA TGGTCGTCGG TATGGTGATG GATTTAACGC CAACGGTGTT AATTCTTACC
CCTGTATTAT TGCCATTAGT TAAAGAAGCC AATATTGACC CGATTTATTT CGGCGTCATG
TTCATTATTA ACTGCTCTAT TGGATTAATC ACACCGCCCG TTGGCAACGT CCTCAACGTT
ATTTCCGGGG TAGCAAAATT GAAATTTGAT GACGCGGTAA GAGGCGTATT CCCTTACGTT
GTCGTACTGA TGTCGCTGCT GGTTTTATTT ATTTTTATTC CCGAGCTAAT TATCACACCG
CTTAAATGGA TTAATTAA
 
Protein sequence
MAVVIFLCCL LGGIAIGLPI AWSLLLCGAA LMAYLDMFDV QIMAQTLVNG ADSFSLLAIP 
FFVLAGEIMN AGGLSKRIVD LPMKLVGHKP GGLGYVGVIA AMIMASLSGS AVADTAAVAA
LLVPMMRSAN YPINRSVGLI ASGGIIAPII PPSIPFIIFG VSSGLSISKL FMAGIAPGIM
MGAALMLTWW WQAGRLNLPS QPKATPREIW QSLVSGIWAL FLPVIIIGGF RSGLFTPTEA
GAVAAFYALF VAVVIYRELT FSSLYHVLVN AAKTTSVVMF LVAAAQVSAW LITIAELPMM
VSDLLQPLVD SPRLLFIVIM ISIMVVGMVM DLTPTVLILT PVLLPLVKEA NIDPIYFGVM
FIINCSIGLI TPPVGNVLNV ISGVAKLKFD DAVRGVFPYV VVLMSLLVLF IFIPELIITP
LKWIN