Gene Ksed_11020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_11020 
Symbol 
ID8372610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1129044 
End bp1130144 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content73% 
IMG OID644991380 
Productamidohydrolase, imidazolonepropionase 
Protein accessionYP_003148907 
Protein GI256824947 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.161229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0546284 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG CACAGCAGCC CGTCTGGCAC CTCACGGGCC AGGTCATCAC CGGCCCCGAG 
GAGGTCCGCT CCGAGGCGTG GGTGGTCGAC GGCCGCCTCA CCTTCGAGGC GCCGCCGGCC
GCGATGCCGA CCGAGCGCCT GGAGGGCTAC GTGCTGCCCG GCCTCGTGGA CGCCCACTGC
CACGTGGGGC TGGAGGCCGA TGGGGGAGTG CCGGCCGAGC GGGCCGAGGA GCACGCGGTC
TCCGAACGCG AGGCCGGTGC GCTGCTCCTG CGCGATGCCG GCTCGCCGGT GGACACCTCG
TGGATCCAGG AGCGCGAGGA CCTGCCGCGG CTCATCCGGG CCGGGCGACA CATCGCCCGC
CCGAAGCGCT ACATCCGCAA CTTCGCCCAC GAGATCGAGC CGGACCAGCT GGTCGAGTAC
GTCCGTCGCG AGGCCCGCGC CGGTGACGGC TGGGTGAAGC TGGTGGGGGA CTGGATCGAC
CGCGACGCCG GTGACCTGCG ACCCCTGTGG CCCGTCGACG TGCTCACCGA GGCGATCGCG
GCGGCCCACG AGGAGGGGGC GCGGGTCACC GCGCACTGCT TCGACGAGCA GTCCCTCTTC
GACTTCGCCG CCGCCGGGAC CGACTGCATC GAGCACGCGA CCGGCCTGAC GCCGGAGTCG
GTGGAGATCT TCGCCGCGCA GGACATCGCG ATCGTCCCGA CGCTCATCAA CATCGAGAAC
TTCCCGGCCT TCGCAGCGGC GGGCGAGGCC AAGTTCCCCG CCTACGCCGC CCACATGCGC
GACATGTTCG AGCGCCGCTT CGAGACCGTC GCCCTCGCAC GGGAGGCCGG GGTGCGCATC
TACGCCGGGA CCGATGCGGG GGGCCAGCTC CCGCACGGCC TGATCGCCCG CGAGGTCGAG
GCCCTGATGT CGGTGGGCAT GAGCGCTACC GAGGCCATCG GGGCCGCGAC CTGGGAAGCC
CGGGAGTGGC TGGGCCACGA GGGCCTGGTC GAGGGCGCGA GTGCCGACGT GGTCGTGTAC
GCCGACGACC CGCGCCAGGA CGTGCGGGTG CTGGCGGACC CGCAGCACGT GCTCCTGCGC
GGCGCGCGCC ACGGTGGCTG A
 
Protein sequence
MSTAQQPVWH LTGQVITGPE EVRSEAWVVD GRLTFEAPPA AMPTERLEGY VLPGLVDAHC 
HVGLEADGGV PAERAEEHAV SEREAGALLL RDAGSPVDTS WIQEREDLPR LIRAGRHIAR
PKRYIRNFAH EIEPDQLVEY VRREARAGDG WVKLVGDWID RDAGDLRPLW PVDVLTEAIA
AAHEEGARVT AHCFDEQSLF DFAAAGTDCI EHATGLTPES VEIFAAQDIA IVPTLINIEN
FPAFAAAGEA KFPAYAAHMR DMFERRFETV ALAREAGVRI YAGTDAGGQL PHGLIAREVE
ALMSVGMSAT EAIGAATWEA REWLGHEGLV EGASADVVVY ADDPRQDVRV LADPQHVLLR
GARHGG