Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1464 |
Symbol | |
ID | 5733349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1706886 |
End bp | 1707974 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278602 |
Product | nifR3 family TIM-barrel protein |
Protein accession | YP_001544236 |
Protein GI | 159897989 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00679382 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCTTA TGCCTGATAC TCTGACCCAA GCATTGCCCG AATCATTTCA ACTTGGCTCA ATGCGCATTT TTCCAAACAT GGTGCTGGCC CCGATGGCTG GCGTAACCGA TTCAGTCTTT CGGCGGCTGG TGTTGTCGTT GGGTGGCTGT GGCTTGGTTG TTTCTGAGAT GACCAATGCT GCCAGTGTTT CGCCCAAGGC CATGAAACGC CATCGCTTGC TGGATTATCT GCCCGAAGAA CGGCCAATTT CGATTCAACT TTCGGGCAAC GACCCCGATT TGGTGGCAAC CGCCGCCCGC TTTGTTGAAG AACTTGGCCC CGATGTAATT GACATAAACT GTGGCTGCCC TTCGCCCAAA GTCACTGGTG GCGGCCATGG TTCGGCCTTG CTCAAAGATT TGCCCAAAAT GCAGCAAATG CTCAAAGCCG TGTATGCAGC GATCAACATT CCGTTTACGC TCAAATTTCG GGCTGGCTGG GATGAGCAAT CGCTAAATTA TATTGATACC GCTAAAATTG CTGAAGATGC TGGTTGTGCC GCGATTACCT TGCATCCACG TACCAAAGTG CAAGGCTACA GCGGCGATGC CGATTGGTCG CGGGTTGCTG AAGTTGTGCA AGCAGTCTCG ATTCCGGTGA TTGGCTCAGG CGATGTGCGC ACACCAGCCG ATGCTTTGGC GCGTTTGGAG CAAACTGGGG TTGCAGCAGT GATGATTGGT CGAGCTGCCA TGGCCAATCC ATGGATTTTT CGCCAAATTG CTCAATTGCG GGCAGGCGAA CCCATATTTG TGCCAACACC AAGCGATAAA CGTGATTTGC TGGTGCGCTA TGTTGATATG TGTGCTGAAA CTATGGTTGA GCGCCAAGCC CTTGGCAAGC TAAAACAACT GATTGGTCAA TTCAGCATTG GTTTGTATGC TAGCAACCAA TTGCGCCGCG ATGTACAACG TGCCAACGAA ATTGAAGCCG CCAAAGCGAT TATTGCCAAC TTTTTTGAGC CATTTATCAG CGGCGCGGTC GAGGCGGTCG AAGTGCCCGA CGAAATCGCC GTGATCAAAG AAGGTTGCGA GAACGGCGCG AATAATTAG
|
Protein sequence | MQLMPDTLTQ ALPESFQLGS MRIFPNMVLA PMAGVTDSVF RRLVLSLGGC GLVVSEMTNA ASVSPKAMKR HRLLDYLPEE RPISIQLSGN DPDLVATAAR FVEELGPDVI DINCGCPSPK VTGGGHGSAL LKDLPKMQQM LKAVYAAINI PFTLKFRAGW DEQSLNYIDT AKIAEDAGCA AITLHPRTKV QGYSGDADWS RVAEVVQAVS IPVIGSGDVR TPADALARLE QTGVAAVMIG RAAMANPWIF RQIAQLRAGE PIFVPTPSDK RDLLVRYVDM CAETMVERQA LGKLKQLIGQ FSIGLYASNQ LRRDVQRANE IEAAKAIIAN FFEPFISGAV EAVEVPDEIA VIKEGCENGA NN
|
| |