Gene SeD_A3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3989 
Symbol 
ID6875033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3828338 
End bp3829783 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content54% 
IMG OID642786945 
Producthypothetical protein 
Protein accessionYP_002217573 
Protein GI198242408 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.801432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.0802476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGCCT CTGCCGGCTA TGTGCAGGCA GATGCGCTCC AGCCCGATCC GGCATGGCAA 
CAGGGGACGC TGGCTAATGG GTTACAGTGG CAAGTGTTGG CTACGCCTCA GCGCCCCAGC
GATCGTATTG AAGTTCGTCT CCAGGTTAAT ACCGGTTCGC TCACCGAAAG TACACAACAG
AGCGGGTTCA GCCATGCGAT TCCCCGTATC GCGCTGACGC AAAGCGGTGG TCTGGATGCC
GCACAGGCAC GTTCTTTATG GCAGCAAGGG TTTGATCCGA AACGTCCCAT GCCGCCCGTT
ATTGTTTCTT ATGATTCCAC GCTCTATAAC CTCAGTTTAC CCAATAACCG TAACGATCTG
CTGAAAGAAG CGCTGACCTA TCTGGCTAAC GTCTCCGGTA AATTAACCAT TACGCCAGAG
ACGGTGAATC ATGCGTTAAG CAGCGAAGAT ATGGTTGCGA CGTGGCCAGC AGATACTAAA
GAGGGCTGGT GGCGTTATCG GCTGAAAGGG TCGGCGTTAT TGGGGCACGA TCCCGCGGAA
CCGTTAAAGC AGCCGGTAGA CGCAGCCAAA ATTCAGGCTT TCTATGAAAA ATGGTACACC
CCGGATGCCA TGACGCTGAT TGTTGTCGGC AACATTGATG CGCGCTCCGT CGCCGAGCAG
ATCAACAAAA CGTTCGGTAC GCTGAAAGGT AAACGCGAAA CGCCCGCCCC GGTGCCGACG
CTTTCGCCGC TGCGGGCGGA ATCAGTGAGC ATTATGACCG ATGCGGTGCG CCAGGATCGT
CTCTCCATTA TGTGGGATAC GCCGTGGCAA CCGATTCGCG AATCGGCGGC GCTGTTGCGC
TACTGGCAGG CGGATCTGGC GCGCGAAGCG CTGTTCTGGC ATATCCAGCA AGAGCTTACT
AAAAATAACG CGAAAGATAT TGGTCTGGGG TTTGACTGCC GGGTTCTGTT CCTGCGCGCG
CAGTGCGCCA TCAACATTGA ATCACCTAAT GATAAGCTCA ATACCAATTT GAGCCTGGTG
GCGAATGAAC TGGCGAAAGT ACGCGATAAA GGTTTGTCGG AAGAGGAGTT TACTGCGCTG
GTGGCGCAGA AAAATCTCGA ATTGCAAAAG CTGTTCGCGA CCTACGCGCG TACCGATACT
GACATTTTGA CTGGACAGCG TATGCGCTCG CTGCAGAATC AAGTGGTGGA TATCGCGCCG
GAGCAGTATC AGAAGCTGCG TCAGAATTTC CTCAACAGCC TGACCGTTGA TATGCTCAAT
CAGAATTTAC GTCAGCAGCT ATCGCAGGAG ATGGCATTAA TTTTGCTGCA ACCGCAAGGC
GAGCCGGAAT TTAATATGAA GGCGTTAAAG GCGACGTGGG ATGAAATCAT GGTCCCGACA
ACTGCCGCCG CTGTTGAAGC AGATGAGGCG CATCCGGAAG TGACGGAGAC ACCGGCGGCA
CAGTAA
 
Protein sequence
MLASAGYVQA DALQPDPAWQ QGTLANGLQW QVLATPQRPS DRIEVRLQVN TGSLTESTQQ 
SGFSHAIPRI ALTQSGGLDA AQARSLWQQG FDPKRPMPPV IVSYDSTLYN LSLPNNRNDL
LKEALTYLAN VSGKLTITPE TVNHALSSED MVATWPADTK EGWWRYRLKG SALLGHDPAE
PLKQPVDAAK IQAFYEKWYT PDAMTLIVVG NIDARSVAEQ INKTFGTLKG KRETPAPVPT
LSPLRAESVS IMTDAVRQDR LSIMWDTPWQ PIRESAALLR YWQADLAREA LFWHIQQELT
KNNAKDIGLG FDCRVLFLRA QCAINIESPN DKLNTNLSLV ANELAKVRDK GLSEEEFTAL
VAQKNLELQK LFATYARTDT DILTGQRMRS LQNQVVDIAP EQYQKLRQNF LNSLTVDMLN
QNLRQQLSQE MALILLQPQG EPEFNMKALK ATWDEIMVPT TAAAVEADEA HPEVTETPAA
Q