Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1127 |
Symbol | |
ID | 4242859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1774989 |
End bp | 1775900 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 638106350 |
Product | prolyl 4-hydroxylase, alpha subunit |
Protein accession | YP_720962 |
Protein GI | 113474901 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3751] Predicted proline hydroxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.328998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000557231 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATAACTA CTGTATCCGA AAGCGAACCC CAAGAAGTAA AAATTCAACT CCTATTAGCA GGAGGGCATC AATATACTAT TTATCTCAAA TCTGATGCAC CCTTACTACA ACTTTTAGCC AAAACATTAT TATCTAAACA TCAAAAAAAT GAAAATTCTA CCCTATTTCA AATTCCTCTA GCAGAAGGTC ATTCTGCTTT ATGTTTTCCC AGTGAACATT TAGTCGGAAT GGTAACCGAA CCACCAATTT CTTTACAGAA TTTACAGCCA CAAGATACTA AAAATAATCT CAAAAATTCT ACTAATACTA CTCCTAAAAT AAACAATTTA GAATCTCATT ATTTACTGAT AGAAAATTTT CTAAGCCAAT CAGAAAATAA ACAATTATTG AATTATGTTT TGCAAAGAAA ATCAGATTTT TCTCCAACTA CAACATCAAC TAAAGCTGAA AATTATCGTC GTTCTTTAGT ATTGTACTCT TTCCCAAAAT TCCAAGAATT GATAGTTAAT AGAATTAAAG AAATTTTTCC TGATGTTTTA AATAAGTTGA GTATTCCTGT ATTTTCAATT GCTGAAATTG AAAGTCAGTT AACAGCTCAT AACAATAATA ACTTTTATAA AATTCATAAT GATAATGGTT CTCCTGATAC TGCAACTAGA GTTCTTACTT ATGTTTACTA TTTTTATCGA GAACCTAAAG CATTTACAGA AGGTAAGCTA ATAATTTATG ATAGTAAAAT TCAAGGAAAA TACTATGTAA AAGCTCAAAC ATTTAAAAGT ATAGAACCTA CAAATAATAC TATTGTATTT TTCCTCAGTC GTTATATGCA TGAAGTTTTG CCGGTAACTT GTCCTTCTCA AGATTTTGCG GATAGTCGCT TTACTATTAA TGGTTGGATA CGTCGCAGTT AA
|
Protein sequence | MITTVSESEP QEVKIQLLLA GGHQYTIYLK SDAPLLQLLA KTLLSKHQKN ENSTLFQIPL AEGHSALCFP SEHLVGMVTE PPISLQNLQP QDTKNNLKNS TNTTPKINNL ESHYLLIENF LSQSENKQLL NYVLQRKSDF SPTTTSTKAE NYRRSLVLYS FPKFQELIVN RIKEIFPDVL NKLSIPVFSI AEIESQLTAH NNNNFYKIHN DNGSPDTATR VLTYVYYFYR EPKAFTEGKL IIYDSKIQGK YYVKAQTFKS IEPTNNTIVF FLSRYMHEVL PVTCPSQDFA DSRFTINGWI RRS
|
| |