Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2648 |
Symbol | |
ID | 3968506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | - |
Start bp | 3349193 |
End bp | 3350344 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637921746 |
Product | uroporphyrinogen decarboxylase HemE |
Protein accession | YP_528120 |
Protein GI | 90022293 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000278082 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0916898 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAAT GCCTGATGGA AAGTATAGTG GAATTAGAGT TGCCTCAAGG TAACCCAATT AAAATAAAAT CCGTAGCAGT CGAGCATTAC AAGGTTCCGC TAGCTGAAGT GTTGTCTGAT GCAAAGCACG GGGATCATAC CTATTTTGAG CTAATAGTTA GCCGTATCAC TTGCCAAAAT GGTGTAGAAG GTGTGGGTTA TACCTATACC GGTGGGTCCG GCGGTTCGGC AATCTATTCG CTATTAGTCG ATGAAATTAA GCCTATGCTT GTTGGGCGGG ACGCCACCCA AATTCTAGCC ATATGGGAAG AAATTTATTG GCGCTTACAT TATGTTGGGC GCGGTGGTTT AGTTAGCTTT GCTCAGTCAG CGGTTGATAT TGCATTGTGG GATATTCGCT GCAAGTTGTT GGGGCAACCC CTGTGGAAAG TGGCGGGCGG TTTAAGCAAT AAAACACGCT GCTATGCCGG CGGTATAGAT TTAAATTTTT CGCAAGAAAA ACTATTAAGC AATATACAAG GTTATTTAGA CGCGGGCTTT AATGCTGTAA AAATTAAAGT TGGCAAAGAT AATATTAAAG AAGATATTGC GCGTGTACGC GCAGTGCGAG AGTTAATTGG CAAAGATACC ACATTTATGG TGGATGCCAA CTACTCCATG ACCAAAGAAA AAGCCATTCG TTTTGCTAAC GCCATAGAAG ACCAAAATAT TACTTGGTTT GAAGAGCCAA CATTGCCAGA CGATTACCAA GGCTATGCCG ATATCGCTCA AGCAATATCA ATACCCCTAG CTATGGGTGA AAACCTACAC ACTATTCACG AATTTACCTA TGCCGTGCAA CAAGCCAAGC TTGGTTTTTT GCAGCCCGAT GCTTCTAATA TTGGTGGTAT TACTGGTTGG TTGAACGTTG CAAGTTTAGC AAACGCACAC AACTTACCGG TGTGCAGTCA CGGCATGCAA GAGTTGCACG TTTCACTTAT GTCGTCTCAG CCCAATGCGG GTTATTTAGA AGTTCACTCC TTTCCTATCG ACCAATACAC AACACAACCG CTAGCAATGG AAAACGGTTA CGCACTAGCA CCAGATATAG AAGGCACGGG TGTTGTGTTT GTCGATGAAT TATTACGTGG CCATTTGGCT AAAAAATCCT AA
|
Protein sequence | MSKCLMESIV ELELPQGNPI KIKSVAVEHY KVPLAEVLSD AKHGDHTYFE LIVSRITCQN GVEGVGYTYT GGSGGSAIYS LLVDEIKPML VGRDATQILA IWEEIYWRLH YVGRGGLVSF AQSAVDIALW DIRCKLLGQP LWKVAGGLSN KTRCYAGGID LNFSQEKLLS NIQGYLDAGF NAVKIKVGKD NIKEDIARVR AVRELIGKDT TFMVDANYSM TKEKAIRFAN AIEDQNITWF EEPTLPDDYQ GYADIAQAIS IPLAMGENLH TIHEFTYAVQ QAKLGFLQPD ASNIGGITGW LNVASLANAH NLPVCSHGMQ ELHVSLMSSQ PNAGYLEVHS FPIDQYTTQP LAMENGYALA PDIEGTGVVF VDELLRGHLA KKS
|
| |