Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2887 |
Symbol | |
ID | 3968113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 3659982 |
End bp | 3661409 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637921984 |
Product | prolyl 4-hydroxylase, alpha subunit |
Protein accession | YP_528356 |
Protein GI | 90022529 |
COG category | [I] Lipid transport and metabolism [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG2146] Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [COG3239] Fatty acid desaturase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0125851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00158698 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATAAACC CGAGCGAATA TATTATTGAA GAGCAGTGGT TTAACGCCGA GCTACCACGG CAAGAATTTA GGCAATTTAT GCAAAGAAAT AACTATAGGG GCCTGCTAGA TACAGCACTA TGGTTTACCT GCTTAGCCTT GAGCGGATAT TTAGCCTACA CATTGTTAGG CACTTATTGG GCAATACCCG CGTTTTTTGT TTACGGGGTG TTTTATTGTG CTTGCGATGC GCGCTGGCAT GAGAGCAGCC ACGGCACGGT TTTCCATACA GCATGGCTTA ATACAGCGCT GTGCTTTATA GCAACCGCTA TGCAGCAGCG GGATATTATT TTTACCCACT GGTCACATGT GCGCCATCAC TCTTATACAT TAATTAATGA TATAGACCCA GAAATAACAG TTACCCGCCC GCCCACATTT TGGGAGCACT TTTTAAATTT TTGGAGCCTA GGCGAAGCTA AATACTATAT ACCTATTCTC TTTCAGCATG CATTTGGCAT TGTAAGTAAA GATGCAAAGA AGTTTGTACC CAAAGAGGAG TACACCAAAA TGTTTTGGTG GGCCCGCGCA AGCCTATTGG TAAACCTTAT CCCCATAGCA CTGTGTTTCT ACTTACATAG CGTAGTGCCA ATTTTATTTT TTGGTTTACC TAAAATATAT GGCAACTTAA TTCAGCGCGC ATTTATTTTG GCGCAACATG CAGGGCTAGA TGACGACACA TGGGATCATC GCCGCAATTC GCGCACTATA AAGGTTAACC CACTACTTGG GTTTCTTTAT ATGAATATGC AGTACCACAC CGAGCACCAT ATGAACCCAT TAATGCCGTT TCACCAATTG CCTAAGTTTT CGAAACGCAT TGCAGACCAA ATGCCCAAGC CCTACAACGG ACTGATTGCG GTGTACAAAG AAATGCTACC AGCACTGCGC AAACAAGCCA AAGACCCAAC TTATTTTGTA GAGCGCGAAC TACCCACCGC CAAACAAGAA CGAAAAGTAA AGCGTATTAA ACCGTTAGAG CCCGATGCAG TAATGGTTCC CATGGCGGCG AAAAAGCAAA CATCTGGCCA AACAGACGTT ATTGCAAGCG ATAGCCCCCT TGCCGAAAAC ATACAATGGC ATAAAGCAAT AGTAGCGCAT GAGCTAGGCG AAAATGATGC CGTAAAAGTT TGCATTAACG GGAATCACTA CGCTATTTAC CAATTACAAC ATGGCAGCTA TCACGCCACC GATGGTATTT GCAGCCACGA ACACGCATTC CTCGCCGACG GGCTAGTAGA CGATGGCAAA GTGCTGTGCC CGAAGCACAA TTCAAAGTTT TGCATTAAAA CGGGAAAGGC AATGAACAGA CCGGCTAAGC AGCCAATTAA CGTTTACCCC ATTAAAAAAG AAGGCGATGA CTTGCTAATT GGTTTAAGTA TTCAATAG
|
Protein sequence | MINPSEYIIE EQWFNAELPR QEFRQFMQRN NYRGLLDTAL WFTCLALSGY LAYTLLGTYW AIPAFFVYGV FYCACDARWH ESSHGTVFHT AWLNTALCFI ATAMQQRDII FTHWSHVRHH SYTLINDIDP EITVTRPPTF WEHFLNFWSL GEAKYYIPIL FQHAFGIVSK DAKKFVPKEE YTKMFWWARA SLLVNLIPIA LCFYLHSVVP ILFFGLPKIY GNLIQRAFIL AQHAGLDDDT WDHRRNSRTI KVNPLLGFLY MNMQYHTEHH MNPLMPFHQL PKFSKRIADQ MPKPYNGLIA VYKEMLPALR KQAKDPTYFV ERELPTAKQE RKVKRIKPLE PDAVMVPMAA KKQTSGQTDV IASDSPLAEN IQWHKAIVAH ELGENDAVKV CINGNHYAIY QLQHGSYHAT DGICSHEHAF LADGLVDDGK VLCPKHNSKF CIKTGKAMNR PAKQPINVYP IKKEGDDLLI GLSIQ
|
| |