Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0242 |
Symbol | |
ID | 3747902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 273517 |
End bp | 274563 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637772767 |
Product | dihydrouridine synthase TIM-barrel protein nifR3 |
Protein accession | YP_378561 |
Protein GI | 78188223 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0191961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTG GAGCCCTCAG TATTGAGCGA CCGATAATTC TTGCCCCAAT GGAAGATGTT ACCGATCGCT CATTTCGTCA ACTCTGCAAA CGGCATGGTG CCGATATTGT TTATACCGAA TTTATTAGTG CCGAAGCGTT ACGGCGCGGT GCTGAAAAGT CGCTCCGTAA GCTCAAAGTT GACGACATGG AACGCCCTTA CGCTATCCAA ATTTTTGGGA GCACGGTTGA GTCCATGGTT GAAGCGGCAA TCATTGCAGC CTCCTACCAG CCCGATTATC TCGATATTAA CTTTGGCTGT CCCGTAAAGA AAGTTGCAGG CAAAGGGGCT GGCGCGGCAC TGTTACGTGA GCCTGAAAAA ATGGCAGCAA TTAGTGCAGC AGTGGTTAAA GCCGTTGCAA TTCCCGTAAC GGCTAAAACC CGCATTGGCT GGGATTTCGA TTCCATTAAC ATTCTTGATA TTATTCCCCG CCTTGAAGAT GCTGGTATTC AAGCCTTAGC GCTGCACGGG CGTACTCGTA GCCAGATGTA TAAAGGCACT GCCGATTGGC AATGGATTCG TGCGGCAAAA GAGAAAGCTC ACATTCCGCT GATTGCCAAT GGCGACATTT GGAACGCTGA AGATGCCGCT CGCATGTTTG ATGTTACCGC TGCTGATGGC GTTATGATTG GGCGAGGTTC CATTGGCAAC CCCTTTATTT TTCAGCAAGC CAAACATTTC CTTGCGCACG GTACGTTGCT CCCACCGCCC GATTTTCGCC AACGCATTGC CGTTGCCATT GAGCACTTCC AGCTCTCCTT AGCATATAAA GGTGAAAAAT ATGGCGTGCT TGAAATGCGC CGCCACTATT CCACCTACCT CAAAGGCTTA CCAATGGTAT CGCGTGTGCG CAACAAGTTG GTGCGTGAAG ATAATCCTAA CAATATTGTG GAACTGTTGT TAGCGTACCG CGAAGAGTGC GAAAGTTACG CTCGTGAAGG GAGGTTAAGC GAAGGGGTAG AGTTTCTTAA TGATCATTCA CCAAAGCTGG AAATGAGGGA GGGTTAA
|
Protein sequence | MRIGALSIER PIILAPMEDV TDRSFRQLCK RHGADIVYTE FISAEALRRG AEKSLRKLKV DDMERPYAIQ IFGSTVESMV EAAIIAASYQ PDYLDINFGC PVKKVAGKGA GAALLREPEK MAAISAAVVK AVAIPVTAKT RIGWDFDSIN ILDIIPRLED AGIQALALHG RTRSQMYKGT ADWQWIRAAK EKAHIPLIAN GDIWNAEDAA RMFDVTAADG VMIGRGSIGN PFIFQQAKHF LAHGTLLPPP DFRQRIAVAI EHFQLSLAYK GEKYGVLEMR RHYSTYLKGL PMVSRVRNKL VREDNPNNIV ELLLAYREEC ESYAREGRLS EGVEFLNDHS PKLEMREG
|
| |