Gene SeHA_C3876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3876 
SymbollivK 
ID6489775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3745494 
End bp3746603 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content54% 
IMG OID642743983 
Producthigh-affinity branched-chain amino acid ABC transporter periplasmic leucine-specific-binding protein LivK 
Protein accessionYP_002047589 
Protein GI194449084 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGA AAGCGAAAAC AATAATCGCA GGGATTGTTG CATTAGCAGC CTCGCAGGGG 
GCAATGGCAG ATGATATTAA AGTCGCCATA GTCGGGGCGA TGTCCGGCCC GGTAGCGCAA
TGGGGCGATA TGGAATTTAA CGGCGCGCGC CAGGCCATTA AAGACATCAA CGCTAAAGGC
GGGATTAAAG GCGATAAGCT GGTCGGCGTA GAGTACGATG ATGCCTGCGA TCCAAAACAG
GCGGTGGCGG TGGCCAACAA AATCGTTAAC GACGGTATTC AATACGTTAT TGGTCACTTG
TGTTCTTCTT CTACTCAGCC AGCATCCGAT ATCTATGAAG ATGAAGGTAT TCTGATGATC
TCCCCGGGGG CGACTAACCC GGAGCTGACC CAGCGCGGCT ATCAGTACAT TATGCGTACC
GCCGGCCTGG ACTCCTCCCA GGGGCCAACG GCCGCGAAAT ACATCCTGGA AACGGTGAAA
CCGCAGCGCA TCGCTATCAT TCACGATAAA CAGCAATACG GCGAAGGACT GGCGCGCTCC
GTGCAGGATG GCCTGAAGCA GGGTAATGCC AATATTGTCT TTTTTGATGG TATTACCGCC
GGTGAAAAAG ATTTCTCCGC CCTGATTGCC CGCTTGCAAA AAGAGAATAT CGACTTTGTG
TATTACGGCG GCTACTACCC GGAAATGGGG CAGATGCTGC GCCAGGCGCG GGCTAATGGC
CTGAAAACGC AATTTATGGG GCCGGAAGGC GTGGGTAACG CGTCGCTGTC CAATATTGCG
GGCGGTGCGG CGGAAGGCAT GTTGGTGACG ATGCCAAAAC GTTATGACCA GGACCCGGCG
AATAAAGCGA TTGTCGAGGC GCTGAAAGCC GACAAGAAAG ATCCCAGCGG TCCGTATGTC
TGGATCACCT ACGCCGCCGT CCAGTCGCTG GCGACCGCAA TGACGCGTAG CGCCAGCCAT
GCTCCGCTGG ATCTGGTGAA AGATCTTAAA GCTAACGGGG CTGATACCGT TATTGGGCCG
CTGAAATGGG ATGAAAAAGG CGATCTTAAG GGATTTGAAT TTGGCGTCTT CCAGTGGCAC
GCCGACGGCT CGTCAACCGT CGCGAAGTAA
 
Protein sequence
MKRKAKTIIA GIVALAASQG AMADDIKVAI VGAMSGPVAQ WGDMEFNGAR QAIKDINAKG 
GIKGDKLVGV EYDDACDPKQ AVAVANKIVN DGIQYVIGHL CSSSTQPASD IYEDEGILMI
SPGATNPELT QRGYQYIMRT AGLDSSQGPT AAKYILETVK PQRIAIIHDK QQYGEGLARS
VQDGLKQGNA NIVFFDGITA GEKDFSALIA RLQKENIDFV YYGGYYPEMG QMLRQARANG
LKTQFMGPEG VGNASLSNIA GGAAEGMLVT MPKRYDQDPA NKAIVEALKA DKKDPSGPYV
WITYAAVQSL ATAMTRSASH APLDLVKDLK ANGADTVIGP LKWDEKGDLK GFEFGVFQWH
ADGSSTVAK