Gene SeHA_C3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3894 
Symbol 
ID6488572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3762674 
End bp3763723 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content52% 
IMG OID642744001 
Producthypothetical protein 
Protein accessionYP_002047607 
Protein GI194449282 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCG CTCAACCCGA CAAAACGGGT ATGCATATTC TGCTGAAACT GGCCTCCCTG 
GTTATTATTC TCGCCGGGAT TCATGCCGCG GCGGATATTA TTGTACAACT CTTGCTGGCG
CTCTTTTTCG CCATTGTTCT TAATCCATTA GTGACCTGGT TTATTCGCCG GGGCGTGAAG
CGACCGCTGG CGATTACTAT CGTGGTGGTC GTCATGCTGA TCGTGCTTAC CGCGCTGGTG
GGTGTGCTTG CCGCATCGCT TAATGAATTT ATCGCTATGC TGCCTAAATA CAGCAAGGAG
CTGACGCGTA AAGTCTTACA CCTTCAGGAG TTAATGCCCT TCCTGAATTT ACATATGTCG
CCGGAACGTA TGCTCCGCGG GATGGATTCC GATAAAATTA TGCTCTTCAC CACAACATTA
ATGACCGGCG TATCGGGCGC GATGGCAAGC ATTGTGCTGC TGGTGATGAC CGTGGTTTTT
ATGCTGTTTG AGGTGCGTCA CGTGCCTTAT AAATTACGTT TCGCGCTAAA CAATCCACAA
ATCCATATTG CCGGCCTGCA CCGCGCCCTG AAAGGCGTCT CGCATTATCT GGCGCTGAAA
ACGCTGCTCA GTTTATGGAC GGGCGCGATT ATCTGGCTGG GACTGGCGTT AATGGATATT
CAGTTTGCGC TAATGTGGGG CGTACTGGCC TTTTTGCTCA ATTACGTTCC TAATATCGGC
TCGGTGATTT CCGCCGTTCC CCCCATGATT CAGGCGCTGT TATTTAACGG TTTTTACGAG
TGCGTACTGG TCGGCGCGCT CTTTTTGGTG GTCCATATGG TGATTGGCAA CATTATGGAG
CCACGCATGA TGGGCCATCG TCTGGGGATG TCCACGCTGG TGGTATTTCT TTCGTTACTG
GTCTGGGGAT GGCTATTAGG CCCGGTCGGG ATGCTCTTGT CCGTTCCTTT GACCAGCGTC
TGCAAAATCT GGATGGAAAC CACTAAAGGC GGCAGCAAAC TGGCGATCTT ACTGGGGCCA
GGCCGACCGA AAAGCCGTTT ACCGGGATGA
 
Protein sequence
MATAQPDKTG MHILLKLASL VIILAGIHAA ADIIVQLLLA LFFAIVLNPL VTWFIRRGVK 
RPLAITIVVV VMLIVLTALV GVLAASLNEF IAMLPKYSKE LTRKVLHLQE LMPFLNLHMS
PERMLRGMDS DKIMLFTTTL MTGVSGAMAS IVLLVMTVVF MLFEVRHVPY KLRFALNNPQ
IHIAGLHRAL KGVSHYLALK TLLSLWTGAI IWLGLALMDI QFALMWGVLA FLLNYVPNIG
SVISAVPPMI QALLFNGFYE CVLVGALFLV VHMVIGNIME PRMMGHRLGM STLVVFLSLL
VWGWLLGPVG MLLSVPLTSV CKIWMETTKG GSKLAILLGP GRPKSRLPG