Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4822 |
Symbol | |
ID | 5736667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6148920 |
End bp | 6149900 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281987 |
Product | nucleotidyl transferase |
Protein accession | YP_001547580 |
Protein GI | 159901333 |
COG category | [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGCAG TTATTTTAGT TGGCGGATTA GGCACACGCC TCCGCCCATT GACCAATCAA CTCCCCAAGC CACTTGTGCC AATTGCCGGC GAAGCCTTGA TGAGCCGAAC CTTGCGGCGT TTGTATAAGC AAGGGGTGCG TCATGTGATT TTGGCGGTGC AATATTTGGC CGAACAATTT TTGGCGGCCT ATGGCGATGG CGCGGCTTTT GGGCTAGATT TGCAGATTGT TCAAGAGCCA GAAGCCCTAG GCACAGCTGG CGCAGTACGC TACGCCCTTG ATCAAACCAA TTTGCTTAAG GCTGGGCCGA TTTTAGTGCT GAATGGCGAT GAACTGACTG ATTTCGATGT GGCCCAACTC TGGCAAGCTC ATGGCCAATT TGGCGGTGTG GCGACGATTG CCGTGCGCCA AGTGGCCGAT ACCTCAGCCT TTGGGGTAGT TGCTAGCGAT GCGAATCAAC GAGTGTATGC CTTTCAAGAA AAACCTGCGG CTGGCACGGC CTTGGCCAAC ACCATCAATA GCGGAGCCTA TGTATTTGAG CCAGCGGCAC TTGCCCAGAT TCCAGCCCAA GGTTTTGCTA TGCTCGAACG CGATCTCTTC CCCAGCTTGC TAGCGACTCA AGCCCTGATT TACGCCTATC AACACAACGC CTACAGCCAA GATATTGGCA CATTGGCAGG CTATTTAGCC GCGAATGAAG CGGTATTGTT GGGCCATTTG CCGCATGAAA CCGTGCATGG CATACAATAT GCAGCAGGAG TGTGGGCTGC GGCTGATGCT CAAATCAGCC CTAGCGCCCA ATTAATTGCC CCGATTATGC TTGGCAGTGG CTGTGTGGTG GGCGAGCATG CCCGACTTGA ACGGGTGATC GCATGGGATC GTGTTACAAT TGAAGCCGCT GCAAACCTAA ACAATGTCGC CATTGCCAAT GATGTGCAGG TTGCCCACCA TGCAACTGTC GAAGGTCTCG CGCTTGGTTA A
|
Protein sequence | MRAVILVGGL GTRLRPLTNQ LPKPLVPIAG EALMSRTLRR LYKQGVRHVI LAVQYLAEQF LAAYGDGAAF GLDLQIVQEP EALGTAGAVR YALDQTNLLK AGPILVLNGD ELTDFDVAQL WQAHGQFGGV ATIAVRQVAD TSAFGVVASD ANQRVYAFQE KPAAGTALAN TINSGAYVFE PAALAQIPAQ GFAMLERDLF PSLLATQALI YAYQHNAYSQ DIGTLAGYLA ANEAVLLGHL PHETVHGIQY AAGVWAAADA QISPSAQLIA PIMLGSGCVV GEHARLERVI AWDRVTIEAA ANLNNVAIAN DVQVAHHATV EGLALG
|
| |