Gene SeHA_C2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2093 
Symbol 
ID6490809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2020826 
End bp2022877 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content51% 
IMG OID642742289 
Productprotease 2 
Protein accessionYP_002045932 
Protein GI194448248 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.559794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCAA AAGCCAATCG AATTCCCTAT GCCATGACCG TACATGGCGA TACGCGCATT 
GATAATTATT ACTGGCTGCG AGATGACACT CGCTCGCAGC CGGAAGTCCT TGATTACCTG
CATCAGGAAA ATGAGTATGG CCGGAAGGTC ATGACCTCTC AGCAGGCGTT ACAGGACCGC
ATTCTAAAAG AAATTATCGA TCGCATCCCG CCCAGAGAAG TTTCCGCTCC GTATGTGAAA
AACGGCTATC GCTACCGTTA TATCTATGAA CCCGGCTGCG AATATGCCAT CTATCAACGA
CAATCGGCGT TAAGCGAAGA GTGGGATGTG TGGGAAACCT TGCTCGATGC GAACCAGCGG
GCCGCGCACA GCGAATTTTA TACGCTCGGC GGACTTGCCA TTACGCCGGA TAATACCATC
ATGGCGCTGG CGGAAGATTA TTTATCCCGT CGTCAGTATG GGTTGCGTTT TCGTAACCTC
GAAAGCGGTA ACTGGTATCC GGAACTGCTG GATAACGTTG CGCCTGAATT TGTCTGGGCC
AATGATTCCC TGACCCTTTA CTATGTGCGT AAGCATAAGA AGACGCTGCT GCCCTATCAG
GTTTGGCGAC ACACGATTGG CACTCCGTCA TCGCAAGATG AACTGGTATA TGAAGAGAAA
GACGATACCT TTTATGTCAG CCTGCATAAA ACCACTTCGC AGCATTATGT GGTGATTCAT
CTTGCCAGCG CCACCACTAG CGAAGTGCTA TTACTTGACG CGGAACTGGC CGATGCCGAG
CCGTTTTCAT TTTTACCGCG CCGCAAAGAC CATGAATATA GTCTCGATCA CTATCAACAT
AAGTTTTACC TGCGTTCTAA CCGGAACGGT AAAAACTTTG GGTTGTACCG TACCCGCGTG
CGCAATGAAA ACGCCTGGGA AGAGCTGATC CCTCCGCGCG AGCATATTAT GCTGGAAGGG
TTTACCCTGT TTACCGACTG GTTAGTGGTC GAAGAGCGTC AACGGGGGCT TACCAGCCTG
CGGCAAATTA ACCGTAAAAC CCGTGAAGTG ATAGGCATTG CCTTTGACGA TCCGGCTTAC
GTGACGTGGC TTGCCTATAA TCCCGAACCT GAGACCTCCC GGCTGCGTTA CGGCTATTCT
TCAATGACGA CGCCAGATAC CTTGTTTGAA CTGGATATGG ATACCGGAGA ACGACGGGTA
CTTAAACAGA CGGAAGTGCC TGGGTTTGAT TCTGGCTGTT ATCAGAGCGA ACACCTGTGG
ATCACCGCAC GCGACGGCGT CGAAGTGCCG GTATCGCTGG TTTATCATCA GAAGTATTTT
CGTAAAGGGC AAAATCCGCT TCTGGTTTAC GGCTACGGAT CTTACGGTTC CAGTATTGAC
GCCGACTTCA GCAGCAGCCG ACTGAGCTTG CTGGATCGTG GCTTTGTTTA CGCAATCGTA
CACGTTCGCG GTGGCGGTGA GCTGGGGCAG CAGTGGTATG AAGATGGCAA ATTCCTCAAA
AAGCGGAATA CTTTTAATGA CTATCTTGAT GCCTGCGATG CCTTATTAAA ACTGGGTTAC
GGTTCGCCGT CGCTGTGTTA CGGGATGGGC GGGAGCGCGG GCGGAATGCT AATGGGCGTC
GCTATCAACG AACGCCCCGA GCTTTTCCAC GGCGTTATTG CCCAGGTACC CTTTGTTGAT
GTATTAACCA CGATGCTGGA TGAGTCGATC CCACTAACGA CAGGAGAGTT TGAAGAGTGG
GGGAACCCGC AGGATATTGA GTATTATGAC TATATGAAAA GCTATAGTCC TTATGACAAT
GTCAAAGCGC AGGACTATCC GCACCTGCTG GTGACGACAG GATTGCACGA TTCCCAGGTG
CAATACTGGG AACCGGCGAA GTGGGTGGCA AAATTACGCG AGCTAAAAAC GGACCAACGT
CTGCTGCTGC TATGTACGGA TATGGACTCC GGGCACGGTG GTAAGTCGGG GCGGTTTAAA
TCCTACGAAG GCGTCGCGCT GGAGTTCGCC TTTTTAATCG GCCTGGCGCA GGGAACCTTA
CATAGCGCAT AG
 
Protein sequence
MLPKANRIPY AMTVHGDTRI DNYYWLRDDT RSQPEVLDYL HQENEYGRKV MTSQQALQDR 
ILKEIIDRIP PREVSAPYVK NGYRYRYIYE PGCEYAIYQR QSALSEEWDV WETLLDANQR
AAHSEFYTLG GLAITPDNTI MALAEDYLSR RQYGLRFRNL ESGNWYPELL DNVAPEFVWA
NDSLTLYYVR KHKKTLLPYQ VWRHTIGTPS SQDELVYEEK DDTFYVSLHK TTSQHYVVIH
LASATTSEVL LLDAELADAE PFSFLPRRKD HEYSLDHYQH KFYLRSNRNG KNFGLYRTRV
RNENAWEELI PPREHIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY
VTWLAYNPEP ETSRLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFD SGCYQSEHLW
ITARDGVEVP VSLVYHQKYF RKGQNPLLVY GYGSYGSSID ADFSSSRLSL LDRGFVYAIV
HVRGGGELGQ QWYEDGKFLK KRNTFNDYLD ACDALLKLGY GSPSLCYGMG GSAGGMLMGV
AINERPELFH GVIAQVPFVD VLTTMLDESI PLTTGEFEEW GNPQDIEYYD YMKSYSPYDN
VKAQDYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDQR LLLLCTDMDS GHGGKSGRFK
SYEGVALEFA FLIGLAQGTL HSA