Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_4357 |
Symbol | purU4 |
ID | 3521955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 4589287 |
End bp | 4590147 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637286798 |
Product | formyltetrahydrofolate deformylase |
Protein accession | YP_271006 |
Protein GI | 71281483 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0788] Formyltetrahydrofolate hydrolase |
TIGRFAM ID | [TIGR00655] formyltetrahydrofolate deformylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.578575 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCAG CTACTCATTA TATTTTAACT TGGCAATGCC CGGACAACAC TGGCGTGTTA GCCAAAGTAT CGCAAAGTTT GTTTTCACAC GGTGCTTTTA TCACCGAAAC ATCACAGTAC AGTGACCCGT ATAGTGAAAC CTTTTTTTCA CGTATTGCTT TTGATGATCG TAACTTAACG GTTTCCAGTA GTGAATTCGT TAAAGCATTA AATGAGTTAG CCAAACCACT CGCCATGCAA TATCAATTAC GTAAACGTGC TGATGTACCC AATGTCCTGA TTGCGGTGTC AAAAGACGAT CATTGTTTAG TCTCATTGTT AACTAAGTGG CGTTCTGGTG CTTTACCGAT CAATATCGTT GGCGTTATCT CTAATCATCA ATATTGTCAG GCGTTAAGTG AATGGCATAA TGTTCCCTTT TACCATTTAC CCGTCAATGC AGAGACCAAA CTCGAACAAG AAGCTCAAAT TACCGACTTA ATGGAAGAAC TTAATATCGA CTTATTAGTA TTAGCCCGTT ACATGCAAAT ACTTTCTGAT GGTTTATGCC AGCAACTACA AGGTAAGGCC ATTAATATCC ATCACTCATT CTTACCTAGC TTTAAAGGTG CTCGCCCTTA TCACCAAGCT CATGCTCGTG GTGTTAAAGT CATTGGGGCA ACCGCGCACT ATGTAACAGC GAATCTTGAT GAAGGTCCAA TTATTGCTCA GGAAGTAAAA CCAATTAACC ACGCTTTTAC CATAGAGCAA ATGGTACATA TGGGCCATGA TTTAGAAGCG ACAGCATTAA GCCATGCCGT AAGGATACAT GCTGAACAAC GCGTTTGTAT CAATGGCGAT AAAACAGTTA TTCTGGCTTA G
|
Protein sequence | MSSATHYILT WQCPDNTGVL AKVSQSLFSH GAFITETSQY SDPYSETFFS RIAFDDRNLT VSSSEFVKAL NELAKPLAMQ YQLRKRADVP NVLIAVSKDD HCLVSLLTKW RSGALPINIV GVISNHQYCQ ALSEWHNVPF YHLPVNAETK LEQEAQITDL MEELNIDLLV LARYMQILSD GLCQQLQGKA INIHHSFLPS FKGARPYHQA HARGVKVIGA TAHYVTANLD EGPIIAQEVK PINHAFTIEQ MVHMGHDLEA TALSHAVRIH AEQRVCINGD KTVILA
|
| |