Gene SeHA_C1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1994 
SymboltreA 
ID6491655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1942878 
End bp1944587 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content55% 
IMG OID642742199 
Producttrehalase 
Protein accessionYP_002045842 
Protein GI194450913 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACCCC CAGAGATTCG CCGTTCTGTT CTACTGCAGA AAGCCATAAA ACTGGCGCTG 
GCAGGGACGC TGCTGACGTT TGCATCGTTT TCGGCGACTG CCGCAGACCC GTCTTCCGAC
ACTGAAACTC CGCAGCCGCC GGATATTTTG CTTGGCCCGC TCTTTAATGA TGTCCAGAAT
GCAAAACTCT TCCCCGATCA GAAAACCTTT GCTGACGCCA TACCTAATAG CGATCCGCTT
ATGATTCTTG CGGATTATCG TATGCAGCGG AACCAGTCCG GCTTCGATTT ACGTCATTTT
GTTGATGTTA ACTTCACCCT GCCGAAAGCG GGTGAAAAAT ATGTCCCGCC TGCCGGGCAG
TCATTGCGTG AACATATTGA TGGCCTGTGG CCGGTGCTGA CACGTTCAAC TAAAAACGTC
GAAAAGTGGG ACTCGCTCTT GCCGTTGCCT GAATCCTATG TCGTGCCGGG TGGTCGATTC
AGAGAGATTT ACTACTGGGA CAGCTACTTT ACGATGCTGG GGCTGGCGGA AAGCGGGCAC
TGGGATAAGG TGGCGGATAT GGTGGCGAAC TTTGGTTACG AAATTGACGC CTGGGGGCAT
ATTCCTAACG GCAACCGTAC CTACTACCTG AGTCGTTCGC AGCCGCCTTT CTTTGCGTTT
ATGGTTGAGT TACTGGCGCA ACATGAAGGT GACGATGCGC TGAAAGAATA CCTGCCGCAA
CTGCAAAAAG AGTACGCCTA CTGGATGGAG GGCGTTGAGA CATTGCAGCC AGGGCAACAA
AACCAACGCG TCGTCAAACT GGAAGACGGC AGCGTTCTCA ACCGCTACTG GGACGATCGG
GATACGCCCC GCCCTGAATC CTGGGTTGAA GATATCGCTA CCGCCAAAAG CAACCCCAAC
CGCCCGGCAA CGGAGATCTA TCGAGACCTC CGTTCTGCTG CCGCCTCCGG CTGGGATTTC
AGCTCCCGCT GGATGGATAA TCCGCAGCAG CTCAGTACCA TTCGTACCAC CACTATTGCC
CCTGTCGATC TTAACGCTCT GCTGTATCAA CTGGAGAAAA CCCTCGCCCG CGCCAGCGCT
GCGGCGGGCG ATCGGGCCAA AGCCTCGCAC TATGACGCGC TGGCCAACGC GCGGCAAAAA
GCCATTGAAA TGCATCTGTG GAATAACAAA GAGGGTTGGT ATGCCGACTA CGATCTGAAG
AACAATAAAA TCCGTGACCA ACTCACCGCT GCCGCGCTGT TCCCGCTCTA TGTAAACGCC
GCCGCGAAAG ATCGCGCCGC GAAAGTGGCG GCGGCCCAGG CGCATCTGCT ACAGCCTGGC
GGGCTGGCTA CCACCTCGGT TAAAAGCGGA CAGCAATGGG ATGCGCCAAA CGGCTGGGCG
CCGTTACAAT GGGTCGCTGC CGAAGGATTG CAAAATTATG GGCAGGATGA CGTGGCAATG
GAAGTCACCT GGCGCTTTTT AACCAATGTG CAGCACACCT ACGATCGCGA GAAAAAACTG
GTCGAAAAAT ATGATGTCAG CAGCACCGGA ACCGGCGGTG GCGGCGGCGA ATATCCACTT
CAGGACGGCT TTGGCTGGAC CAACGGCGTG ACGCTGAAAA TGCTCGATCT GATTTGTCCG
CAGGAAAAAC CGTGCGATAG CGTACCGTCT ACTCGTCCGG CATCGTTAAG CGCAACGCCG
ACAAAAACGC CGTCTGCAGC GACGCAGTAA
 
Protein sequence
MIPPEIRRSV LLQKAIKLAL AGTLLTFASF SATAADPSSD TETPQPPDIL LGPLFNDVQN 
AKLFPDQKTF ADAIPNSDPL MILADYRMQR NQSGFDLRHF VDVNFTLPKA GEKYVPPAGQ
SLREHIDGLW PVLTRSTKNV EKWDSLLPLP ESYVVPGGRF REIYYWDSYF TMLGLAESGH
WDKVADMVAN FGYEIDAWGH IPNGNRTYYL SRSQPPFFAF MVELLAQHEG DDALKEYLPQ
LQKEYAYWME GVETLQPGQQ NQRVVKLEDG SVLNRYWDDR DTPRPESWVE DIATAKSNPN
RPATEIYRDL RSAAASGWDF SSRWMDNPQQ LSTIRTTTIA PVDLNALLYQ LEKTLARASA
AAGDRAKASH YDALANARQK AIEMHLWNNK EGWYADYDLK NNKIRDQLTA AALFPLYVNA
AAKDRAAKVA AAQAHLLQPG GLATTSVKSG QQWDAPNGWA PLQWVAAEGL QNYGQDDVAM
EVTWRFLTNV QHTYDREKKL VEKYDVSSTG TGGGGGEYPL QDGFGWTNGV TLKMLDLICP
QEKPCDSVPS TRPASLSATP TKTPSAATQ