Gene Sked_21040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSked_21040 
Symbol 
ID8633739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSanguibacter keddieii DSM 10542 
KingdomBacteria 
Replicon accessionNC_013521 
Strand
Start bp2346530 
End bp2348629 
Gene Length2100 bp 
Protein Length699 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidyl-dipeptidase Dcp 
Protein accessionYP_003314860 
Protein GI269795405 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.547895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.690567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACT CTTCCCACGA GCTCGACGCG AGCAACCCCT TCGCGAGCCG GTCCACCCTT 
CCCTACGCCC TGCCCGACTT CTCGGCGATC CGCGACGAGC ACTACATCCC GGCGGTGCGC
GCCGGGATGG CCGCGGAGCT CGCCGAGATC GAGGCGATCG TCACCGACCC GAACCCGCCG
ACCGTCGAGA ACACCCTCGA GGCGCTCGAG ACGAGCGGTG AGGTGCTCGA CCGAGCCCTC
ACCGTCCTCT ACAACGTCGC CTCGGCCGAC GCGAGCCCTG CGCTCGAGGA CATCGAGGAG
ACGCTCGCCC CCGAGCTCTC GGCGCACCAC GACACCATCT ACATGGACGC GCGCCTCTAC
GCCCGCGTCG TCGCCCTCGA CACCGCGGTC CGCGCGGGCG AGGTCGAGGC CGGTGACGAC
ACCCGCTGGC TGCTCGAGAA CCTCCTCCAG GACTTCCGCC GCTCGGGCAT CGACCTGAGC
GCCGAGGACC AGGCGACGCT CCGCGACCTC AACGCGCGCA CCACGTCGCT GGAGGCGGCC
TTCGGGCGGC GCCTCCTGGC CGGTGCGAAC GCGGCGAGCG TCTTCGTCGA CGACGTCGCC
GACCTCGAGG GCCTCGCCGA CGACGCGATC GCGGCAGCCG CGCAGGCCGC CGCCGATCGC
GGCGAGGAGG GGCGCTACCT CCTCGAGATG CAGCTGCCGA CGCAGCAGAC CGTGCTCGCC
TCGCTCGCCC GTCGTGACGT GCGCCGCCGG GTGCACGAGG CCTCGGTGAC GCGCGGGGCG
ACCGGTGACG ACACCGACAC CCGCGAGATC GTCGTCGAGC TCGCCCGGCT GCGGGCCGAG
CACGCCCGCC TGCTCGGCTA CGACCACCAC GCGGCGTACA TCGCCGAGGA CGCCACCGCC
AAGACGACCG AGGCCGTGAA CGCGATGCTC GCCCCGCTGG CCCCGGCCGC GGCCGCGAAC
GCCCGCAAGG AGGCCCTCGA CCTCACCGAG GCGCTCGTCG CCGACCTCGG CGACCCGGGC
GCGACGCTCG AGGCCTGGGA CTGGGCCTAC TACGCCGAGC GTGTCCGCAA GCAGCGCTAC
TCCCTCGACG ACGCCCTGCT CCGCCCGTAC CTCGAGCTCG AGAAGGTCGT CCAGGACGGC
GTCTTCAAGG CCGCCACCAA GCTCTACGGC ATCACCTTCT CCGAGCGCAC CGACCTCGTG
GGCTACCACC CCGACGTGCG GGTCTTCGAG GTCTTCGACA CCGACGGCGC CGGCATGGGC
CTGTTCCTCG CCGACTACTA CACGCGCGAG TCCAAGCGCG GCGGCGCGTG GATGAACAAC
CTCGTCGACC AGAGCTACCT CACCGGTGAG CTGCCGGTCG TGGTCAACAA CCTGAACATC
GTCAAGCCGC CGGCGGGGGA GCCCACGCTG CTCGTCTTCG ACGAGGTCAT CACGCTCTTC
CACGAGTTCG GCCACGCGCT GCACGGGCTC TTCTCCGCCG TCCGGTACCC CTCGCACTCG
GGCACCGACG TGCCGCGCGA CTTCGTCGAG TACCCCTCGC AGGTCAACGA GATGTGGGCG
TGGGACGAGT CGATCCTGCG CTCCTACGCG GTCCACCACG TCACGGGCGA GCCGCTCCCG
GAGCAGTGGG TGCGCACCAT GCTCGACTCC CGCCTGTTCA ACGAGGGCTT CGCGACGACC
GAGTACCTCG CCGCGACGCT CCTCGACCAG GCCTGGCACC AGGTGACCCC GGAGCAGGTC
CCGAGCTCCG TGGACGAGGT CCTGCCCTTC GAGGCGGCCG CTCTCGAGGC CGCCGGCGTC
GCGCTCTCCG AGGTCCCGGC CCGCTACCGC ACCACGTACT TCAACCACGT GTTCGGCGGT
GGCTACTCGG CCGGCTACTA CTCGTACATC TGGTCCGAGG TGCTCGACGC GGAGACCGTC
GAGTGGTTCC GCGAGAACGC CGACGCCGAC GGCCGGGTCC TCAACCGCGC CTCGGGCGAC
CGCTTCCGCG CCGCGCTGCT GTCCCGCGGA GGGTCGACCG ACCCGATGCA GGCCTTCGGC
GAGCTGCGCG GGCGGGCGCC CGAGATCGCC CCGCTGCTGG CCCGCCGCGG CCTCGTCTGA
 
Protein sequence
MSDSSHELDA SNPFASRSTL PYALPDFSAI RDEHYIPAVR AGMAAELAEI EAIVTDPNPP 
TVENTLEALE TSGEVLDRAL TVLYNVASAD ASPALEDIEE TLAPELSAHH DTIYMDARLY
ARVVALDTAV RAGEVEAGDD TRWLLENLLQ DFRRSGIDLS AEDQATLRDL NARTTSLEAA
FGRRLLAGAN AASVFVDDVA DLEGLADDAI AAAAQAAADR GEEGRYLLEM QLPTQQTVLA
SLARRDVRRR VHEASVTRGA TGDDTDTREI VVELARLRAE HARLLGYDHH AAYIAEDATA
KTTEAVNAML APLAPAAAAN ARKEALDLTE ALVADLGDPG ATLEAWDWAY YAERVRKQRY
SLDDALLRPY LELEKVVQDG VFKAATKLYG ITFSERTDLV GYHPDVRVFE VFDTDGAGMG
LFLADYYTRE SKRGGAWMNN LVDQSYLTGE LPVVVNNLNI VKPPAGEPTL LVFDEVITLF
HEFGHALHGL FSAVRYPSHS GTDVPRDFVE YPSQVNEMWA WDESILRSYA VHHVTGEPLP
EQWVRTMLDS RLFNEGFATT EYLAATLLDQ AWHQVTPEQV PSSVDEVLPF EAAALEAAGV
ALSEVPARYR TTYFNHVFGG GYSAGYYSYI WSEVLDAETV EWFRENADAD GRVLNRASGD
RFRAALLSRG GSTDPMQAFG ELRGRAPEIA PLLARRGLV