Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2596 |
Symbol | |
ID | 5734474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3331231 |
End bp | 3332178 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279736 |
Product | glycosyl transferase family protein |
Protein accession | YP_001545362 |
Protein GI | 159899115 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00437692 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCATC ACATCGATAT TCTGATTCCA AATTATAACG GCGCTAGCTT ATTGGCTGCT TGCCTTGAGA GCTTGCGCCA GCAAACCCGC CGCGATTTTC TAATTACAGT GATTGATGAT GCCTCGCCTG ATGGGAGCGT TCCCGCTTTA CAGGCCGCCT ATCCTGAGGT CAACTGGCTG ATCCAGCCTG AAAATCAAGG CTTTGTGGCG GCAGTTAATC GTGGTTTTCA GGCAACCAGT GCGCCTTGGG TCATCTTGCT TAATAATGAT ACTGAAGTTG AGCCAAGCTT TGTTGCCGCA TTAATTGGTA CGCTTGAGCG TTTTCCAGCT TATGATTTTG CTGCCGCCAA AATGCTGCTC TACAGCCAAC CTGATCATTT GCACACTACT GGCGATGGCT ACAATTGGGA TGGTTTGCCA TGGAGTCGCG GGGTTTGGCA AGTTGATCGT GGTCAGTATG ATGCAATCAG CGAGGTTTTT GGGCCATGTG CTGGGGCAGC GGCCTATAAA CGCGCTAGTT TGCAACAACT TGTCAATCAC CATGGCCAAT TGCTTGATCC ATTGCTCGTG ATGTATTGCG AAGATGTTGA TCTCAATTTG CGGGCGCGGC GAGCTGGAAT GCGCACCCTC TTTGTGCCTC AAGCGCGAGT GCTGCATCAT TTAAGTGCGA CTGGTGGCGG GGTACGCGCC AGCTATTATT GTGGGCGGAA TTTTATTGTG CTGTGGTTGC GCCATATGCC ACTGCAAGCG TGGCCGTATG CCTTGCCAGC TTTTTTGTGG TCGCAACTGA CGATTTTTGG TCAAGCGCTG CGCCATTGGC GTGGCGAAGC TGCCCGTGCT CGCTTACGTG GTCAGTGGGC TGGCTTGCGG CTCATTCCTC AAATTTGGCG TGAACGAACG TTAGCCACAG CAGAAGCACA GCGGCTGCTC GCTTGGCTTG GACGATAG
|
Protein sequence | MSHHIDILIP NYNGASLLAA CLESLRQQTR RDFLITVIDD ASPDGSVPAL QAAYPEVNWL IQPENQGFVA AVNRGFQATS APWVILLNND TEVEPSFVAA LIGTLERFPA YDFAAAKMLL YSQPDHLHTT GDGYNWDGLP WSRGVWQVDR GQYDAISEVF GPCAGAAAYK RASLQQLVNH HGQLLDPLLV MYCEDVDLNL RARRAGMRTL FVPQARVLHH LSATGGGVRA SYYCGRNFIV LWLRHMPLQA WPYALPAFLW SQLTIFGQAL RHWRGEAARA RLRGQWAGLR LIPQIWRERT LATAEAQRLL AWLGR
|
| |