Gene Ksed_18060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_18060 
Symbol 
ID8373311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1878762 
End bp1880063 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content70% 
IMG OID644992063 
ProductZn-dependent dipeptidase, microsomal dipeptidase 
Protein accessionYP_003149575 
Protein GI256825615 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00000416179 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.24272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCT CCGGTCACCC CCAGATGAGC TCGGCCGGCC TGCCGCACGA GGCCCTGGCC 
ACCGCCGGGC GAGGCAGGAC ACCCGCCGGG CACAATGGCG GTATGACTGA CATCGCTGCC
GTGCGTTCCA TGCTGCGCGA GTTCCCACTC GTCGACGGCC ACAACGACCT TCCGTGGCGG
CTGCACGCCC TGTCGCAGGC CGATGCCGAG TCGACCGATC TGACCGTCGC CGACATCGGA
GCCGGGACCC TGGGGACGGC CACGCAGACC CACACCGACC TGCCTCGCCT GCTGGACGGC
GGCATCGGTG CGCAGTTCTG GAGCGTGTTC GTCCCGGCCC ACCTCTCCGG TGACGACGCC
GTCTCCATGA CCCTGGAGCA GATCGACCGG GTGCGCGCCC TGGTCGAGAT GTTCCCCGAC
CGCCTCGAGC TGGCAGACAC GGCCGCCGAC GTGCGCCGCA TCCACGCGTC GGGCCGCATC
GCGAGCCTCA TGGGTGCTGA GGGCGGCCAC AGCATCAACA ACTCCCTGGC GACGCTCCGC
ATCCTGCGTG AGCTCGGGGT GCGGTACATG ACCCTGACCC ACAACAGCAA TGTCGACTGG
GCGGACAGCG CGACCGACGA CGAGAACATC GGCGGGCTGA GCCGCTTCGG CACCGAGGTG
GTGGCCGAGA TGAACCGGAT CGGCATGCTC GTGGACCTTT CGCACGTCTC GGCAGGCACG
ATGCGGGACG CTCTGGCCGC CTCCCGCGCC CCCGTCGTGT TCACCCACTC CGGTGCGCGC
TCGGTCACTG ACCACCCGCG CAACGTCCCC GATGACGTCC TGGACCGCTT GGCCTCGAAC
GGTGGCGTCT GCATGGCGAC CTTCGTGCCC AGCTTCGTCA ACCAGGGCGC CGCGGACCAC
CGCTTCGAGC GGGAGGACGC TGCCCGGGCT GCGGGCCTCG AACCCACCTC CGACGGATGG
CACCCCTTCC TCGACCGGTA CATGGAGGAG CACCCGCCCC CCGTGGCCAC GATGGACGAC
GTGGTGGCAC ACATCGAGCA CCTGCGTGAG GTCGCGGGGA TCGACCACAT CGGGCTCGGC
GGCGACTATG ACGGCACGCC CACCCTGCCC GAGGGTCTCG AGGACGTGAC CGGCTACCCA
CGCCTGCTGG CCGCGCTGGC GGACCGCGGA TGGAGCCGGG ACGACCTGGA GAAGCTCGCG
GGGGCCAACA TCCTGCGTGT GCTCGAGGCT GCGGACACCG TGGCCGACGT GTTGAGCGAC
GAGCCCGGGC GGCGGTGGCG CATCGAGCAG CTGGACAGCT GA
 
Protein sequence
MTSSGHPQMS SAGLPHEALA TAGRGRTPAG HNGGMTDIAA VRSMLREFPL VDGHNDLPWR 
LHALSQADAE STDLTVADIG AGTLGTATQT HTDLPRLLDG GIGAQFWSVF VPAHLSGDDA
VSMTLEQIDR VRALVEMFPD RLELADTAAD VRRIHASGRI ASLMGAEGGH SINNSLATLR
ILRELGVRYM TLTHNSNVDW ADSATDDENI GGLSRFGTEV VAEMNRIGML VDLSHVSAGT
MRDALAASRA PVVFTHSGAR SVTDHPRNVP DDVLDRLASN GGVCMATFVP SFVNQGAADH
RFEREDAARA AGLEPTSDGW HPFLDRYMEE HPPPVATMDD VVAHIEHLRE VAGIDHIGLG
GDYDGTPTLP EGLEDVTGYP RLLAALADRG WSRDDLEKLA GANILRVLEA ADTVADVLSD
EPGRRWRIEQ LDS