Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3709 |
Symbol | |
ID | 5735573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4664340 |
End bp | 4665710 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280861 |
Product | hypothetical protein |
Protein accession | YP_001546473 |
Protein GI | 159900226 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0516593 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCACAA TTATGTGGAT TAAGAATCAT CGTTGGCTGA GTTCGTTATT GGTAGCCGTG GTTTGCCTTG GTTTAGGCGG GCTGGCAGGT TTGATGCCGT TGCGCTATTT GGCAGGCGTA GCCTTTTGCC TTGGCCTAAC CGCAATTGGT TATGCTTGGG TTTGGCTAAT TAGCAAGCCC GACGAGTTGC GGGGTATGCT GCTGCTCTGT TTTGCAGCGA TGGGCCTGCG TTGGGGAGCC AGCTTTGCGC TCGAATTTCT CTGGCCCAGT TTTGAATCGC TGAGTGATGG CGCAGCGTAT GGCCCGCATG CCATGACGAT TGCCCAAGCT TGGAACGCTA ATTATTTCGC TAGTTACGAG GCGGTGGTTT CAACCCCGGT TGGTGCGCCG GGCTATGTTT ATTTTTCGGC AGTAATTTTT TGGTTATTTG GGCCAAATAC CTTGCTGGTA AAATTGGCTA ATGGCTTGTT TGCTGGCATG GCGGCGGTGT ACACGGCCAA ATTAGGCAAC CATTTTTTCG ATCAACGGGT TGGGCGGTTT GCCGCGTTGT GGATGTTAAT TATGCCATCG CTGATTTTGT GGACTTCGCA AAATCTCAAA GATAGCGCGG TGGTGCTGCT CTCAGTCTGG ATTTTGTATG TAGCCAGTCA AGGTTTGCGC TCGTCGTTAT GGCAAATTCC ATTGTTGGTG CTGCTGATTG GGGCGTTGAT GAGCGTGCGG CGCGAAACCT CAATTGGCAT TGCTTTGATG ATTGCCTTGA CAATTGGCTT TCAGCAAACC CGTCATTGGC TAACCCGCCT GAGCTTGAGC GCGATCACGA TTGTGGCCTT GGGCTTGGTG CTTTCGAGCA GCGGCTATGG CTTTTTGGGC AGCGATTATC TGCAAGAGCG GCTTTCGCTC AGCGCAATTA GCGAAAAACG CGAGGCCAAC TCAACCGGCA CAGGCACAAT TGAAAATACG ATTGATACGA CCACGCCGCT GGGTTTTGCC CGCTACTTGC CAATTGCCTT GATTAATTTC TGGTTGCGAC CGTGGCCGTG GGAAGCTACC AAGAGCACCG CCCAATTGCT GACCATTCCC GAAGCGGCGC TGTTGTGGTA TCCGTTGTGG GTTTTGGCGA TGATTGGCAT GATCTTGGCG TGGCGTAGCC GTTGGCGCGA AACAATGTTG CTCTGGCTCT ATCTGCTGGC TGGCAGTGCG GCGGCGGCTC CGCAGTATGG CAATTTTGGC ACGGCCTATC GCCATCGGGT GCAGCTGTGG CCAATTTTCT TTTTGTTTGC AGGCTATTGT TGGTATCGTT GGCGCGATGC TAGAGTTGAG CAACGTCAGG CGTTGTTGAC CCGCTATGTG CAGAGTATCA AGCAAATTTA G
|
Protein sequence | MRTIMWIKNH RWLSSLLVAV VCLGLGGLAG LMPLRYLAGV AFCLGLTAIG YAWVWLISKP DELRGMLLLC FAAMGLRWGA SFALEFLWPS FESLSDGAAY GPHAMTIAQA WNANYFASYE AVVSTPVGAP GYVYFSAVIF WLFGPNTLLV KLANGLFAGM AAVYTAKLGN HFFDQRVGRF AALWMLIMPS LILWTSQNLK DSAVVLLSVW ILYVASQGLR SSLWQIPLLV LLIGALMSVR RETSIGIALM IALTIGFQQT RHWLTRLSLS AITIVALGLV LSSSGYGFLG SDYLQERLSL SAISEKREAN STGTGTIENT IDTTTPLGFA RYLPIALINF WLRPWPWEAT KSTAQLLTIP EAALLWYPLW VLAMIGMILA WRSRWRETML LWLYLLAGSA AAAPQYGNFG TAYRHRVQLW PIFFLFAGYC WYRWRDARVE QRQALLTRYV QSIKQI
|
| |