Gene SeHA_C4572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4572 
SymbolmalK 
ID6491972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4446831 
End bp4447940 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content58% 
IMG OID642744644 
Productmaltose/maltodextrin transporter ATP-binding protein 
Protein accessionYP_002048221 
Protein GI194449504 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.535764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCG TACAGCTACG AAATGTAACG AAAGCCTGGG GTGACGTGGT GGTATCGAAA 
GATATTAACC TCGATATCCA TGACGGGGAG TTCGTGGTGT TTGTGGGACC GTCAGGCTGT
GGTAAATCGA CCTTGCTGCG TATGATCGCC GGGCTTGAAA CCATCACCAG TGGCGACCTG
TTTATCGGGG AAACCCGTAT GAATGATATT CCGCCTGCCG AGCGCGGCGT GGGCATGGTA
TTCCAGTCTT ATGCGCTCTA TCCCCATCTC TCCGTTGCAG AAAACATGTC TTTCGGCCTC
AAGCTGGCGG GCGCCAAAAA AGAGGTAATG AATCAACGCG TCAATCAGGT GGCGGAAGTG
CTGCAACTGG CGCATCTGCT GGAACGTAAG CCAAAAGCGC TTTCCGGCGG GCAGCGTCAG
CGCGTAGCGA TTGGCCGCAC GCTGGTGGCG GAGCCGCGTG TGTTTTTGCT GGATGAACCG
CTCTCTAACC TGGACGCCGC GCTGCGCGTG CAGATGCGCA TTGAAATTTC TCGCCTGCAT
AAACGTCTGG GCCGCACGAT GATTTACGTC ACCCACGATC AGGTCGAGGC GATGACGCTG
GCCGACAAAA TCGTGGTGCT GGACGCCGGT CGCGTCGCTC AGGTCGGTAA GCCGCTGGAG
CTGTACCACT ATCCGGCGGA CCGCTTTGTC GCGGGCTTCA TCGGCTCGCC GAAGATGAAC
TTCCTGCCGG TGAAAGTGAC CGCCACCGCG ATTGAACAAG TCCAGGTCGA ACTGCCGAAT
CGCCAGCAAA TCTGGCTGCC GGTCGAAAGT CGCGGCGTGC AGGTCGGCGC CAATATGTCT
TTAGGCATTC GGCCGGAACA CCTGCTGCCG AGCGATATCG CCGATGTCAC CCTGGAAGGC
GAAGTCCAGG TGGTCGAGCA GTTAGGGCAC GAAACACAAA TTCATATCCA GATCCCCGCC
ATCCGTCAAA ACCTGGTTTA TCGCCAGAAT GACGTGGTGT TGGTAGAAGA GGGCGCCACA
TTCGCTATCG GCCTGCCGCC AGAGCGCTGT CATCTGTTCC GCGAGGATGG CAGCGCATGT
CGTCGTCTGC ATCAAGAGCC GGGTGTTTAA
 
Protein sequence
MASVQLRNVT KAWGDVVVSK DINLDIHDGE FVVFVGPSGC GKSTLLRMIA GLETITSGDL 
FIGETRMNDI PPAERGVGMV FQSYALYPHL SVAENMSFGL KLAGAKKEVM NQRVNQVAEV
LQLAHLLERK PKALSGGQRQ RVAIGRTLVA EPRVFLLDEP LSNLDAALRV QMRIEISRLH
KRLGRTMIYV THDQVEAMTL ADKIVVLDAG RVAQVGKPLE LYHYPADRFV AGFIGSPKMN
FLPVKVTATA IEQVQVELPN RQQIWLPVES RGVQVGANMS LGIRPEHLLP SDIADVTLEG
EVQVVEQLGH ETQIHIQIPA IRQNLVYRQN DVVLVEEGAT FAIGLPPERC HLFREDGSAC
RRLHQEPGV