Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4083 |
Symbol | |
ID | 5735942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5216028 |
End bp | 5217227 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281235 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001546843 |
Protein GI | 159900596 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGTCT TGATGTTGGC TTGGGAATAT CCACCGCATA TTGTTGGCGG GATGGGCAAG CATATTGCTG AGTTGGTTCC AGTGCTTGAT GCGGCTGGAA TCGAAGTGCA TGTGCTTACG CCATGGTTGC GTGGCGGCCC CCAACATGAG CGTTTTGGCA TGCACAGCCA TATTTGGCGG GTTCAGCCAC CAGCCATGCC CGATTATGGC TTTGTTTCGT TTACCCAAGA AACCAACCGC TATCTTGAGC GCTTCGCCCA CGATTTGGGC AAAACCCATG GCCCATTTGA TCTGATTCAT GGCCATGATT GGCTGACCAG CTATTGTAGC GTCGCTTTGA AATATGCTTG GCATACTCCC TTGATTACAA CGATTCATGC AACTGAACGC GGGCGTGGAC GTGGCTCGCT GGGCGGCGAT CATGCCAAAA CGATTAATGG CTTGGAATGG TGGTTGGCCC ACGAAAGTTG GCGGGTAATT GTGTGCAGCG ATTTTATGGC CGACCAGTTG CATCAATTTT TTGGCACGCC CTTTGATAAA CTCGATGTGA TTGCCAACGG CGTGAATGTG CCAACGATTG AATGGCCGAG CCAAGAGCGC CAGCAATTTC GCCAAAAATA TGCCGCTGAT AACGAAAAAG TAGTGTTTAG CATTGCCCGC ATGGTCTACG AAAAAGGCAT TCAAGTGTTG GTTGAGGCAA TTCCACATGT CTTGGCGCAA CGCCGCGATA TCAAATTTGT GATTGCGGGC ATGGGGCCGT TAGCCGAACA ATTGCGCAAC CGCAGCCGTG AGCTTGGCAT CGATGCACAT GTTTATTGGA CGGGCTTCGT GACCGATCAA GATCGCAATT ATCTCTACAA TGTTGCTGAT GTGGCAGTAT TTCCCAGCAT CTACGAGCCA TTTGGGATTG TAGCCTTGGA AGCGATGGCG GCACATTGCC CGGTCATTGT TTCGGATACT GGTGGTTTGC GTGAGGTGGT GCAAATTCAC GAAACAGGCT TAACGGTTTA CCCTGATAAC CCTGAATCGT TGGCTTGGGG CATTTTGCAT ACGCTGTCCC ACCCCGAATG GACCCAGCAA CGAGTTGAAA ATGCCTTCAA AACGGTGGTT GAGATTTATA ATTGGCCATT AATTGCCTGC CAGACCCAAG CCGTCTATCA ACGGGTTTGC GACGAGCGTG CAACTAGTAT GTGGGGATGA
|
Protein sequence | MRVLMLAWEY PPHIVGGMGK HIAELVPVLD AAGIEVHVLT PWLRGGPQHE RFGMHSHIWR VQPPAMPDYG FVSFTQETNR YLERFAHDLG KTHGPFDLIH GHDWLTSYCS VALKYAWHTP LITTIHATER GRGRGSLGGD HAKTINGLEW WLAHESWRVI VCSDFMADQL HQFFGTPFDK LDVIANGVNV PTIEWPSQER QQFRQKYAAD NEKVVFSIAR MVYEKGIQVL VEAIPHVLAQ RRDIKFVIAG MGPLAEQLRN RSRELGIDAH VYWTGFVTDQ DRNYLYNVAD VAVFPSIYEP FGIVALEAMA AHCPVIVSDT GGLREVVQIH ETGLTVYPDN PESLAWGILH TLSHPEWTQQ RVENAFKTVV EIYNWPLIAC QTQAVYQRVC DERATSMWG
|
| |