Gene SeHA_C3995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3995 
Symbol 
ID6490486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3871775 
End bp3873052 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content52% 
IMG OID642744096 
Product2,3-diketo-l-gulonate trap transporter large permease yian 
Protein accessionYP_002047701 
Protein GI194451797 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTGG TGATATTTCT CTGCTGCCTG CTCGGCGGGA TCGCGATAGG TTTACCCATC 
GCCTGGTCGC TGCTGCTTTG CGGCGCTGCT CTGATGGCAT ACCTGGATAT GTTTGACGTG
CAGATTATGG CGCAAACCCT GGTTAACGGC GCGGACAGTT TCTCCCTGCT GGCTATTCCC
TTTTTTGTTT TGGCCGGTGA AATCATGAAC GCGGGCGGCC TGTCAAAGCG AATTGTCGAC
CTGCCGATGA AGCTGGTCGG CCATAAACCC GGCGGCCTGG GCTACGTGGG CGTTATTGCG
GCAATGATTA TGGCCAGCCT TTCCGGCTCT GCGGTAGCAG ATACCGCTGC GGTCGCCGCG
CTGCTGGTGC CGATGATGCG CTCCGCAAAC TACCCGATCA ACCGCTCCGT TGGGTTAATC
GCTTCCGGCG GGATCATTGC GCCAATTATT CCACCCTCGA TTCCTTTTAT TATCTTCGGC
GTTTCCAGCG GCTTGTCGAT CAGCAAGCTG TTTATGGCCG GGATCGCACC GGGCATCATG
ATGGGCGCGG CGCTTATGCT CACCTGGTGG TGGCAGGCCG GGCGATTAAA TCTCCCTTCT
CAGCCTAAAG CAACACCGCG TGAAATCTGG CAATCATTGG TTTCAGGTAT CTGGGCGCTG
TTTTTACCGG TGATTATTAT CGGCGGCTTC CGTTCCGGAC TTTTCACGCC AACGGAGGCA
GGGGCGGTTG CCGCCTTTTA CGCCCTCTTT GTCGCCGTGG TTATCTATCG GGAATTAACG
TTTTCCAGTC TCTACCACGT GCTGGTCAAT GCCGCCAAAA CGACGTCAGT CGTCATGTTT
CTGGTGGCCG CGGCCCAGGT ATCCGCCTGG CTGATTACGA TCGCGGAATT ACCCATGATG
GTGTCAGATT TGCTGCAGCC GCTGGTCGAC TCTCCGCGAC TCTTATTTAT CGTCATTATG
ATCTCAATTA TGGTCGTCGG TATGGTGATG GATTTGACGC CAACGGTGTT AATTCTTACC
CCTGTATTAT TGCCATTAGT TAAAGAAGCC AATATTGACC CGATTTATTT CGGCGTCATG
TTCATTATTA ACTGCTCTAT TGGATTAATC ACACCGCCCG TTGGCAACGT CCTCAACGTT
ATTTCCGGGG TAGCAAAATT GAAATTTGAT GACGCGGTAA GAGGCGTATT CCCTTACGTT
GTCGTCCTGA TGTCGCTGCT GGTTTTATTT ATTTTTATTC CCGAGCTAAT TATCACACCG
CTTAAATGGA TTAATTAA
 
Protein sequence
MAVVIFLCCL LGGIAIGLPI AWSLLLCGAA LMAYLDMFDV QIMAQTLVNG ADSFSLLAIP 
FFVLAGEIMN AGGLSKRIVD LPMKLVGHKP GGLGYVGVIA AMIMASLSGS AVADTAAVAA
LLVPMMRSAN YPINRSVGLI ASGGIIAPII PPSIPFIIFG VSSGLSISKL FMAGIAPGIM
MGAALMLTWW WQAGRLNLPS QPKATPREIW QSLVSGIWAL FLPVIIIGGF RSGLFTPTEA
GAVAAFYALF VAVVIYRELT FSSLYHVLVN AAKTTSVVMF LVAAAQVSAW LITIAELPMM
VSDLLQPLVD SPRLLFIVIM ISIMVVGMVM DLTPTVLILT PVLLPLVKEA NIDPIYFGVM
FIINCSIGLI TPPVGNVLNV ISGVAKLKFD DAVRGVFPYV VVLMSLLVLF IFIPELIITP
LKWIN