Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1221 |
Symbol | |
ID | 5733114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1412296 |
End bp | 1413285 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641278361 |
Product | nucleotidyl transferase |
Protein accession | YP_001543997 |
Protein GI | 159897750 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1209] dTDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTAA TTATTCCTAC CGCCGGTCTT GGAACCCGCC TACGTCCACA TACCCATACC CGTCCAAAGC CGTTGGTTCC GGTTGCAGGC AAAGCCGTAA TTGGCCACCT GCTTGATAAG CTCAAGGTGC TACCACTCGA CGACGTGGTG TTTATTACCG GGTATTTGGG TACACAAATT GAGGAATATG TTCGTAAGAA CTATAACTTC AAGAGCCATT TTGTTGAACA AACTGAGCTA AAAGGCCAAG CCCATGCAAT TGCTTTGGCC CGCGAGATGG TCTCTGGCCC AACCTTGATT TTATTCGTCG ATACGATTTT TGAAGCAAAC CTGAATGTTT TGAACCAAAC TGATGCCGAC GGCGTGATCT ATGTAAGCGA AGTTGAAGAT CCCTCGCGCT TCGGGGTGGC ATTGCTCGAA GATGGGATTA TTACAAAACT GGTGGAAAAA CCAAGTACGC CAGTTTCCAA TTTGGCCTTG ATCGGCGCAT ACTATGTGCG TGAAGTCAAA GAATTGTTCG CAGCGATCGA TGTGCTGATC GAGCAGAATA TTCAAACCAA AGGCGAGTTT TATTTGGCCG ATGCACTCCA ACTCATGATT AGCAATGGCA CGCGATTTAG TGCCGAAACT GCAACCATGT GGGAAGATTG TGGCACAGCC CCCGCCTTGT TACGCACCAA TCGCTATTTA TTGCAACACG AAACTGGCAA CGTCGAACAA CGTGATGGCG CGATCATCGT TCCACCAGTC TTTATTGGCG AGAATGTTGA GATTCGCAAC TCAATTATTG GGCCATACGT CTCAGTTGCT GACCATAGCG TGATTGTCGA TTCGATTGTG CGCGATTCGA TTATCAATCA AGGAGCCAGC ATTCAATCAT CAACCTTGGA AGGCTCACTT ATCGGTGAAG GAGCGCATAT CAAAGGCGAA TTCCAACACC TTAACGTCGG TGATTCATCA GTTATTACAT TTGGTAGCAC GATTCAGTAA
|
Protein sequence | MNVIIPTAGL GTRLRPHTHT RPKPLVPVAG KAVIGHLLDK LKVLPLDDVV FITGYLGTQI EEYVRKNYNF KSHFVEQTEL KGQAHAIALA REMVSGPTLI LFVDTIFEAN LNVLNQTDAD GVIYVSEVED PSRFGVALLE DGIITKLVEK PSTPVSNLAL IGAYYVREVK ELFAAIDVLI EQNIQTKGEF YLADALQLMI SNGTRFSAET ATMWEDCGTA PALLRTNRYL LQHETGNVEQ RDGAIIVPPV FIGENVEIRN SIIGPYVSVA DHSVIVDSIV RDSIINQGAS IQSSTLEGSL IGEGAHIKGE FQHLNVGDSS VITFGSTIQ
|
| |