Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1740 |
Symbol | |
ID | 5112479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 1888837 |
End bp | 1889865 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640491929 |
Product | hemin-degrading family protein |
Protein accession | YP_001176470 |
Protein GI | 146311396 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3720] Putative heme degradation protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00804322 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0442861 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCACT ACACCCGCTG GCTTGAGCTA AAAGAAGAAA ACCCAGGAAA ATACGCGCGT GACATCGCCG GGCTGATGAA CATCCGCGAA GCGGAACTGA CGTTCGCCCG CGTGGGCCAC GACGCCCGGC GTTTGCGCGA GGACATCCGT GCAATCCTCG GCGCACTGGA AACCGTCGGT GAAACAAAAT GCATCTGCCG CAACGAATAC GCCGTTCACG AACAAGTCGG CACGTTCACC CATCAACACC TCAACGGCCA TGCCGGGCTT GTGCTGAACC CGCGCGCGCT GGATCTGCGC CTGTTCCTGA ACCAATGGGC GAGCGTGTTT CACATCAGCG AAGCCACCGC ACGCGGTGAA CGCCAGAGCA TTCAATTCTT TGACCATCAG GGCGATGCGC TGCTCAAGGT TTATACCACC GATAACACTG ACGCTGGTGC ATGGGGCGAT GTTTTGACCC GTTTTATCAT TGCCGATAAT CCTCCGCTGG AACTGAAAGC GGCTGATGTG GCAGTCAACA GCACGTCGCC CGATGCTGAA AAAGTGGATG CAGAATGGCG AGCCATGACC GACGTGCATC AGTTCTTTAG CCTCCTGCAG CGCCACAGTC TGAGCCGTCA GCAGGCGTTC CGTCTGGTGA GTGATGACCT GGCCTGCAAA GTGGACAATA CCGCGTTGGC GCAACTGCTG GATGCGGCGC ATCAAAGCGG AAACGAAATT ATGATCTTCG TGGGCAACCG TGGCTGCGTG CAGATCTTTA CCGGCGCAGT GGAAAAAGTG GTGCCGATGA AGGGCTGGCT GAACATTTTC AATCCGACGT TTACCCTGCA CTTACTGGAA GAGACGATCG CGGAGACGTG GATTACGCGT AAACCAACCA CCGACGGCCA CGTCACCAGC CTCGAACTGT TCGCCGCAGA CGGCACGCAG ATTGCGCAAC TCTATGGCCA ACGTACCGAA GGCGAACCGG AGCAAACGCA GTGGCGCGCG CAGATCGACG CCCTGACACC AAAAGGGCTC GCTGCATGA
|
Protein sequence | MNHYTRWLEL KEENPGKYAR DIAGLMNIRE AELTFARVGH DARRLREDIR AILGALETVG ETKCICRNEY AVHEQVGTFT HQHLNGHAGL VLNPRALDLR LFLNQWASVF HISEATARGE RQSIQFFDHQ GDALLKVYTT DNTDAGAWGD VLTRFIIADN PPLELKAADV AVNSTSPDAE KVDAEWRAMT DVHQFFSLLQ RHSLSRQQAF RLVSDDLACK VDNTALAQLL DAAHQSGNEI MIFVGNRGCV QIFTGAVEKV VPMKGWLNIF NPTFTLHLLE ETIAETWITR KPTTDGHVTS LELFAADGTQ IAQLYGQRTE GEPEQTQWRA QIDALTPKGL AA
|
| |