Gene Ksed_24400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_24400 
Symbol 
ID8373943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp2521573 
End bp2522649 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content76% 
IMG OID644992666 
Producthydroxymethylbilane synthase 
Protein accessionYP_003150170 
Protein GI256826210 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.485393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCCG CGCAGGAGAC CGCCGCCGGG GCGCAGGGCC GGGTGGTCCG GCTCGGGACG 
CGCCGCAGCG ACCTGGCCAC CACCCAGTCC ACCCACGTGG CGGACGCGCT GCGTGCCCGC
GGTCACGAGG TCGAGCTGGT GCTGGTGACC ACCGAGGGCG ACGTCAACCG GGCCCCGCTG
AGCCGCATCG GGGGCACCGG CGTGTTCGTC AGCGCCCTGC GCGACGCGCT GCTGGCGGGG
GAGATTGACA TCGCGGTGCA CTCGCTCAAG GACCTGCCGG TGGCGCAGCC GCAGGAGCTC
GTCATCGCCG CCGTGCCCGA GCGGGAGGAC CCTCGTGACG CACTGGTGGC CCGGGACGGC
CTCACCCTGG AGACGCTGCC GGCCGGCGCG CGGGTGGGCA CCGGCTCCCC GCGCCGCGAG
GCCCAGCTGC GCGTGGCCCG CCCCGACCTC GAGGTCGTGG ATCTGCGCGG CAACGTGCCG
CGCCGGTTGG CCACGGTCGC CGAGGGCGAG CTCGATGCCG TCGTGCTGGC CGGCGCCGGT
CTGCGCCGCC TCGGCCTCAC CGAGCACGTG ACCCAGTGGC TGGAGCCGCA GGTGATGGTG
CCGGCCCCGG GGCAGGGCGC GCTGGCCGTG GAGTGCCGCA GTGACGACGC CGAGAGCATC
GCGATGCTGG AGCCGCTGGA CCACCTACCC ACGCGCACGG CCACCACGGC CGAGCGTGCC
GTCCTGGGCA CCCTCGAGGC CGGGTGCTCC GCGCCGATCG GTGCGCTGGC CCGCCACGAC
GGTGCCCGCG TGGACCTGAT GACTTTCGCC GGCCCTCGCG ACGGCAGCCG CGCGCTGCTC
GAGGCCGACT GCGCGCGCGG CGTGCCCGCC GGCCACCCCG CAGACACGCC AGACGACACC
CCCGCAGACA CCCCCGACGA CAGCAGGAAC GACGACCGCG TCGCCCTGGA CACCGCCGCC
CGTGAGCTGG GCCGGCGGGT GGCCGAGGCG CTGCGAGGTG GAGGGGCCGA CGTCCTCCCC
GGCGTCGGAG AGGTTCCTGC CCACCCCGAC CCGACGTCCC CGAAGGAGAA CTCGTGA
 
Protein sequence
MSAAQETAAG AQGRVVRLGT RRSDLATTQS THVADALRAR GHEVELVLVT TEGDVNRAPL 
SRIGGTGVFV SALRDALLAG EIDIAVHSLK DLPVAQPQEL VIAAVPERED PRDALVARDG
LTLETLPAGA RVGTGSPRRE AQLRVARPDL EVVDLRGNVP RRLATVAEGE LDAVVLAGAG
LRRLGLTEHV TQWLEPQVMV PAPGQGALAV ECRSDDAESI AMLEPLDHLP TRTATTAERA
VLGTLEAGCS APIGALARHD GARVDLMTFA GPRDGSRALL EADCARGVPA GHPADTPDDT
PADTPDDSRN DDRVALDTAA RELGRRVAEA LRGGGADVLP GVGEVPAHPD PTSPKENS