Gene SeHA_C1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1204 
Symbol 
ID6487615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1185364 
End bp1186926 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content48% 
IMG OID642741444 
Productpeptidase family C69 
Protein accessionYP_002045095 
Protein GI194447920 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.376431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCAA AGAATATTGC TGGCTCGATC ACTTTCTCGG TTTCCAATCC AGGGTCAATT 
CATCCCATAA CGCGTTATTC AGTGGGAACC TTTATGAAAA AGTATCTTGC ATTCGCCGTT
ACGCTGCTGG GTATGGGTAA AGTCATCGCC TGTACTACCC TTTTGGTAGG CAATCAGGCT
TCGGCTGACG GCTCCTTCAT TATCGCGCGC AACGAAGATG GCTCGGCAAA TAACGCCAAG
CATAAGGTTA TTCATCCCAT CGCGTTTCAT CAACAAGGCG AGTATAAAGC ACATCGGAAC
AATTTTAGCT GGCCGCTTCC GGAGACAGCG ATGCGCTATA CGGCAATTCA TGACTTTGAT
ACTAACGATA ACGCCATGGG TGAAGCCGGT TTCAATTCGG CGGGCGTCGG AATGAGCGCA
ACGGAAACCA TTTATAACGG CAGAGCGGCG CTGGCTGCCG ATCCTTACGT GACAAAAACG
GGGATCACGG AAGACGCCAT TGAGTCCGTG ATCCTGCCAG TGGCGCAATC GGCGCGTCAG
GGCGCCAAAT TACTGGGAGA CATTATTGAA CAAAAAGGCG CTGGCGAAGG TTTCGGCGTC
GCGTTTATTG ATAGCAAAGA GATATGGTAT CTGGAGACGG GAAGCGGTCA TCAATGGCTG
GCAGTACGAC TTCCGGCAGA TAGCTATTTC GTTTCCGCCA ATCAGGGACG TTTACGCCAT
TACGATCCGA ATGATAACGC GAATTATATG GCGTCACCAA CGTTAGTAAG CTTTGCGAAA
AAGCAGGGAT TATATGATCC GGCCCGCGGC GAATTCGACT TTCATCAAGC CTATTCGCAG
GATAACAAAA ACGATACCAC CTATAATTAT CCGCGCGTCT GGACGCTACA ACACCAGTTT
AATCCGCATC TGGATACGGT CGTTAGCGAA GGGGAAACAT TTCCTGTTTT TTTAACGCCA
ATAACGAAGA TCAGCGTGGC GGCAGTAAAA AACGCGCTAC GCAATCACTA TCAGGGAACG
TCGCACGACC CTTATGCCAG TCATAATCCA CAAGAACCAT GGCGACCGAT ATCCGTTTTT
CGTACCCAGG AGTCACATAT TTTACAGGTC AGACCGAAAT TACCGCAGGC TATCGGCAAC
GTAGAATACA TCGCCTATGG AATGCCATCT CTTAGCGTCT ATCTCCCCTA TTATCAGGGG
ATGCGTCATT ATCAACCCGG AGATGATAAA GGAACCGATC GGGCGAGCAA CGACTCTACC
TACTGGACAT TCCGCACGCT GCAAACGCTG GTTATGCAGG ACTACAATGC GTTTGCGCCA
GATGTGCAAC ACGCCTGGAA AACATTTGAA CAGCAAACAG CTAAGCAGCA GTATAAGATG
GAGCAGAGCT ATCTGAGATT ATATGCGTCG CATCCGAAAG AAGCGCAACG CTTACTGCAA
AATTTTGAAG ATAAAACGAT GCAAAATGCG CAGACGCTCG CCCGTCGCCT GACCAATAAT
ATTATTACGA CAATGACTTA CCGTACAGAT ATGAAATATC ACTTTTCAAG TACGCAACCA
TAA
 
Protein sequence
MPAKNIAGSI TFSVSNPGSI HPITRYSVGT FMKKYLAFAV TLLGMGKVIA CTTLLVGNQA 
SADGSFIIAR NEDGSANNAK HKVIHPIAFH QQGEYKAHRN NFSWPLPETA MRYTAIHDFD
TNDNAMGEAG FNSAGVGMSA TETIYNGRAA LAADPYVTKT GITEDAIESV ILPVAQSARQ
GAKLLGDIIE QKGAGEGFGV AFIDSKEIWY LETGSGHQWL AVRLPADSYF VSANQGRLRH
YDPNDNANYM ASPTLVSFAK KQGLYDPARG EFDFHQAYSQ DNKNDTTYNY PRVWTLQHQF
NPHLDTVVSE GETFPVFLTP ITKISVAAVK NALRNHYQGT SHDPYASHNP QEPWRPISVF
RTQESHILQV RPKLPQAIGN VEYIAYGMPS LSVYLPYYQG MRHYQPGDDK GTDRASNDST
YWTFRTLQTL VMQDYNAFAP DVQHAWKTFE QQTAKQQYKM EQSYLRLYAS HPKEAQRLLQ
NFEDKTMQNA QTLARRLTNN IITTMTYRTD MKYHFSSTQP