Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4165 |
Symbol | |
ID | 5736026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5311411 |
End bp | 5312478 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281319 |
Product | glucose-1-phosphate thymidyltransferase |
Protein accession | YP_001546925 |
Protein GI | 159900678 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1209] dTDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.387622 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGTT TAGTTCTGAG TGGTGGTAAA GGCACACGCC TGCGCCCAAT CACCTATACC AGCGCCAAGC AATTAGTGCC TGTCGCCAAC AAGCCAGTAC TTTTTCGGGT AATCGAAGCC CTGCGCGATG CCAATATTGA TGAAATTGGG ATTGTAATCG GTGATACTGG GGCCGAAGTG CGCAACGCCG TGGGCAATGG CTCACGCTGG GGCGTAAAGA TCGAATATAT TCCCCAAGAA GCTCCGCTCG GCTTGGCTCA CGCGGTCAAA ATTAGTCGTC CGTTTATTGG TGACGATAAA TTTGCACTCT TTTTGGGCGA TAACTGCATC GAAGGCGGAG TTAGTTCGTT GGTGTCGGGC TTTGCTACAT CCGATTACAA TGCCCAAATT GTGCTCAAAC AAGTTGCCAA TCCACAGCAA TATGGTGTCG CTGAGTTGCG TCACGATGGC TCGATCGAAC GCTTGACTGA AAAACCGCGC CAACCGCGCT CAGACTTGGC GTTGGTCGGC ATTTATATGT TCGATCAGCA TATTTGGGAA GCTGTTGAGG CGATCAAGCC TTCTTGGCGG GGCGAGTTGG AAATTACCGA TGCAATTCAA TGGCTGATCG AGCACGATTA CCATGTTCAT GCCCATATTC ACCAAGGCTG GTGGATCGAT ACTGGCAAAC GCGCTGATAT GCTCGATGCC AATCGTTTGG TGCTCGAAGA AATCACGCCC CATGTATCGG GCTTCGTTGA TCGCGATTCG CAATTAGTTG GCAAAGTAAC GATCGAAAAA GGCGCTCAAG TGATCAACAG CGTGATTCGT GGGCCAGCGA TCATCGGCGA AGATACGCGG ATCGTTAACT CATACGTTGG ACCGTTTACC TCAATCTATC ATCATTGTAC GATTGAGGAA AGCGAAATTG AGCACTCGAT TGTCTTGGAA AATAGCGAAA TTATTCGCCT GCCCAACCGC ATCGAAGATA GCTTGATTGG CCGCAATGTC AAATTGCATA CCTCGCCGAT GAAACCCAAA GCCTACCGCT TGATGCTCGG CGATAACTCT GACGTGGGGC TGTTGTAA
|
Protein sequence | MKGLVLSGGK GTRLRPITYT SAKQLVPVAN KPVLFRVIEA LRDANIDEIG IVIGDTGAEV RNAVGNGSRW GVKIEYIPQE APLGLAHAVK ISRPFIGDDK FALFLGDNCI EGGVSSLVSG FATSDYNAQI VLKQVANPQQ YGVAELRHDG SIERLTEKPR QPRSDLALVG IYMFDQHIWE AVEAIKPSWR GELEITDAIQ WLIEHDYHVH AHIHQGWWID TGKRADMLDA NRLVLEEITP HVSGFVDRDS QLVGKVTIEK GAQVINSVIR GPAIIGEDTR IVNSYVGPFT SIYHHCTIEE SEIEHSIVLE NSEIIRLPNR IEDSLIGRNV KLHTSPMKPK AYRLMLGDNS DVGLL
|
| |