Gene SeHA_C2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2994 
SymbolproX 
ID6489041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2932791 
End bp2933786 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content54% 
IMG OID642743150 
Productglycine betaine transporter periplasmic subunit 
Protein accessionYP_002046774 
Protein GI194449221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.00724532 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACATA CCGTGATATT TGCCTCAGCG TTTGCCACCC TTGTCACCGC CAGCGCTTTC 
GCTGCCGACC TGCCTGGCAA AGGCATTACC GTCCAACCTA TCCAAAGCAC GATTTCTGAA
GAGACCTTCC AGACGTTACT GGTCAGTCGG GCGCTGGAAA AGTTGGGTTA TACCGTAAAT
AAGCCGAGTG AAGTCGATTA CAACGTAGGC TACACCTCCA TCGCCTCTGG CGATGCGACG
TTTACCGCCG TGAACTGGCA GCCGCTGCAT GATGATATGT ATGCCGCAGC AGGCGGAGAC
AAAAAATTTT ACCGCGAGGG CGTCTTCGTC TCCGGCGCTG CGCAGGGCTA TCTGATCGAT
AAAAAAACCG CCGAGCAGTA CAACATCACC AATATCGCTC AGCTAAAAGA TCCGAAAATC
GCCAAAATCT TCGATACCAA TGGCGACGGA AAAGCGGACA TGATGGGCTG CTCGCCGGGC
TGGGGCTGCG AAGCCGTGAT CAATCACCAG AATAAAGCGT TTGATCTGCA AAAAACCGTT
GAGGTCAGCC ACGGTAACTA TGCGGCGATG ATGGCTGATA CCATTACCCG TTTTAAAGAA
GGCAAGCCGG TGCTGTATTA CACCTGGACC CCGTACTGGG TGAGCGACGT AATGAAGCCG
GGTAAAGATG TCGTCTGGCT ACAGGTGCCG TTCTCCTCTC TGCCGGGCGA ACAGAAAAAT
ATTGATACTA AACTGCCGAA CGGCGCGAAC TATGGGTTCC CGGTTAATAC CATGCATATT
GTCGCCAATA AGGCATGGGC GGAGAAAAAC CCGGCGGCGG CGAAACTGTT CGCCATCATG
AAGCTGCCGC TGGCGGATAT CAACGCGCAG AACGCCATGA TGCATGCCGG TAAATCGTCT
GAAGCCGATG TTCAGGGCCA CGTGGACGGC TGGATCAACG CCCACCAGCA GCAGTTTGAC
GGCTGGGTGA AAGAAGCGCT GGCCGCGCAG AAATAA
 
Protein sequence
MRHTVIFASA FATLVTASAF AADLPGKGIT VQPIQSTISE ETFQTLLVSR ALEKLGYTVN 
KPSEVDYNVG YTSIASGDAT FTAVNWQPLH DDMYAAAGGD KKFYREGVFV SGAAQGYLID
KKTAEQYNIT NIAQLKDPKI AKIFDTNGDG KADMMGCSPG WGCEAVINHQ NKAFDLQKTV
EVSHGNYAAM MADTITRFKE GKPVLYYTWT PYWVSDVMKP GKDVVWLQVP FSSLPGEQKN
IDTKLPNGAN YGFPVNTMHI VANKAWAEKN PAAAKLFAIM KLPLADINAQ NAMMHAGKSS
EADVQGHVDG WINAHQQQFD GWVKEALAAQ K