Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1399 |
Symbol | |
ID | 4270621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 1603920 |
End bp | 1604990 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126155 |
Product | nifR3 family TIM-barrel protein |
Protein accession | YP_742238 |
Protein GI | 114320555 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.372686 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACAC TTAGCCCTGA CTGGCGGGGA CCGACGCGCT CAGGGTATGC TCGCGCTCGA ACAGCGCAAC CTGATTGGAA CATGAACATC GGCCCTTGGA AACTCTCCGG CCGGGTGCTG CTCGCACCCA TGGCCGGCAT CACCGACCTC CCCTTTCGTC AGCTCTGTCG CCAATGGGGT GCCGCCCTCG CGGTTTCGGA GATGCTGTCC GCCGATCCAA CCCTGCGCAA GACGCGCAAG AGCCAATGGC GCGCCACCCT CGCCGACGAC GAGTGCCCAC GGGTGGCGCA GATCGCGGGC GCGGATCCGG TCGCCCTAGC GGAGGCGGCC CGCTACAACG TACAGCGGGG CGCCCAGGTC ATCGACATCA ACATGGGCTG TCCGGCAAAG AAGGTCTGCA ATCGCATGGC CGGCTCCGCA CTGCTCGCGG ACGAGCCCCT GGTGCGCCGG ATCCTGACCG CGGTCGTGTC CGCGGTGGAG GTCCCGGTCA CCCTGAAGTA TCGCACCGGT CCGTCGCCCG AGCGGCGCAA TGCCGTGGCC ATCGCCCGGA TGGCGGAGGA CGCCGGGGTG GCCGCACTGA CGCTTCACGG CCGGACACGG GTCCAGGCCT ACCAGGGTCA GGCCGAATAC CGCAGCGTGG AGGCGGTCTG TCGGGCGGTG GACATCCCGG TCGTCGCCAA CGGCGATATC GATAGCCCAG ACAAGGCGCG GCAGGTGCTG GACGAAACCG GCGCCGATGC GGTCATGGTC GGGCGCGCGG CCCAGGGCCG ACCCTGGCTC TTCCAGGCGA TTCACACCTA TCTGGAGACC GGTACGCGGG TCGCCACGCC TTCGCTGGCG GTGCGCAAGC AGACCCTGCT GACCCACCTG CGCGAGATTC ATCGCTTTTA CGGCGACTGG ATGGGGCCCC GCATCGCGCG CAAGCACATC AAGTGGTATC TACAGGCTCT GCAGGTGGAT CGGTGCCATG TGCAGCCACT GATGCAGCCC ACTGCCCCGG AGGCGCAACG GGTGGCGGTT GCCGATTGCC TGTCACGGCT CAACGAGGCG CCGGCGGCGG CGCCTGCATA G
|
Protein sequence | MQTLSPDWRG PTRSGYARAR TAQPDWNMNI GPWKLSGRVL LAPMAGITDL PFRQLCRQWG AALAVSEMLS ADPTLRKTRK SQWRATLADD ECPRVAQIAG ADPVALAEAA RYNVQRGAQV IDINMGCPAK KVCNRMAGSA LLADEPLVRR ILTAVVSAVE VPVTLKYRTG PSPERRNAVA IARMAEDAGV AALTLHGRTR VQAYQGQAEY RSVEAVCRAV DIPVVANGDI DSPDKARQVL DETGADAVMV GRAAQGRPWL FQAIHTYLET GTRVATPSLA VRKQTLLTHL REIHRFYGDW MGPRIARKHI KWYLQALQVD RCHVQPLMQP TAPEAQRVAV ADCLSRLNEA PAAAPA
|
| |