Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3837 |
Symbol | |
ID | 5735702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4817267 |
End bp | 4818988 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641280990 |
Product | hypothetical protein |
Protein accession | YP_001546601 |
Protein GI | 159900354 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00126629 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGCAT TACAACGCTT CCGCTATCAT TGGCCTTTGC TGCTCCTCCT GCTGGTGTAT TGCACGCTTG CGGTCTGGTG GAGTACCCGC ATTCCCTTGG GCGAAGGCCC AGATGAGCCA GGCCATGCCG AATATGCCCT ATTTGTGGCC CGCAATAATC GCCTGCCCGA TCAGCGTTTA GGCGATGTGC CAGGCGAAGG CCATCAGCCA CCACTGGCCT ATTGGTTGAT GCAACCATTA GCTCGCTCGG TTGCCGTCGA AGATCGGGTA GTGATTCTAG CGAATAATCA TCGCTGGATT TGGGCTGGGG GCAAGGAGCC AGCCGCCTTT GCTTGGCGCT CAATCGATCG ACCACCCTAC CAAGCCGATG TGCTGGTATG GCATCGTTTG CGTTGGTTTT CAATTGGCTG TGCCGCCCTG ACGATTGTTG TGGGTTATGC AATCGGTTTA CGTTTGGGCT TGAGTAAGAC CTTAGCAACC TTCGCCGCCA GTTTGTTGGC ATTTTGGCCG CAATTGCTAT TTCACTCAGT TTTAGTCTCG AATGATCCAT TGCTATGGCT GTTGTGTGCT GGCTTGTTAT GGCTGTTGCT TGAGCCAAAT CCGCAAGCCT GGTGGCCATG GGCGGTTGGG GCAACCCTAG GGGCAGCCTT ACTAACCAAA CAAAGTGCGG TGCTGCTAAT CCCTTTGATT GGGCTACGGC TTTGGCAATT GCGCCGCTCG ATCAACCTCT GGCCTGCGCT TGGTTGGCTG AGCCTAAGTA GTACAGCAAT AGCAGGTTGG TGGTATGTGC GCAATTTGCT GCTGTATGGC GACCCATTTG GTTTGGGCTT GTACAAAGGC GAATTTAGCA GCCCAACCTT TTCACCGTGG AGCTTGCACG ATTGGGGCGA GGCCATGCAT GCCTTGGGCA GTTCGCTGAT TGCCCGCTTT GGCTGGATGA GCGTGGCCGC GCCGCAATGG CTGTATTGGT TGTTAGCGAT TTGGCTTGGC CTGGCAATTT TAGGCATTCG ACGCTTGCGC ATCGATCAAC GCTGGCAAAT CGCTTGGGCC TTGGTTGGCT TGGCCTTTGC TTGGACTTTC GCTTTTGCAG TATTGAGTGG CCAAGTTGGC TGGCAAGCTC GTTTCTTATT TCCGGCGGCT CCGGTGGCGG CGTGTGCCTT GGCGATTGGG CTTGAGCGCA GCATCAAAAC GGCGGCTTGG CTCTTACCAG CGTTATTAGT AACAATGGCG GTGGGCTTGC CAAATACCGT GATCAGCGCG GCCTACCCCT ATTTGGTGGT TGAGCCACAG CCGGAGCGGC CAGTTTTGGC CTTGTGGCGA CCCGATACTG CTAATCCAAT TGAATTACGC GGCATTTTGC TGCCCCCAAC CGCCAAAGCA GGCCAACCAT GGCTGGTCAA CACGTTGTGG CGAGCGGTTG GCAAACAAGA TCGCCAATGG TCGTTATTTG TGCATTTGGC CGAGGTTGGC AGCAAAGATG AGAGTTTTGC CTTCGATGTG CAGCCGCAAG ATGGGGTTTG GCATTCGCTA CGCTGGACAC CCGACGATTG GTGGGAAGAT CTACTGACAA TCAATATTCC CAGCGATTTG GCTGCTGGCG AGTATGAAGT GCGAATCGGC TGGTTTGATG AATTTGGCGA TTGGAGCCGC GCCGGAGTTT GGAGCCAAGA CGGCATCCTG CTTGGTGATT ATGCAGTCGT TGGCAAAGTG GTGGTGGAGT AA
|
Protein sequence | MPALQRFRYH WPLLLLLLVY CTLAVWWSTR IPLGEGPDEP GHAEYALFVA RNNRLPDQRL GDVPGEGHQP PLAYWLMQPL ARSVAVEDRV VILANNHRWI WAGGKEPAAF AWRSIDRPPY QADVLVWHRL RWFSIGCAAL TIVVGYAIGL RLGLSKTLAT FAASLLAFWP QLLFHSVLVS NDPLLWLLCA GLLWLLLEPN PQAWWPWAVG ATLGAALLTK QSAVLLIPLI GLRLWQLRRS INLWPALGWL SLSSTAIAGW WYVRNLLLYG DPFGLGLYKG EFSSPTFSPW SLHDWGEAMH ALGSSLIARF GWMSVAAPQW LYWLLAIWLG LAILGIRRLR IDQRWQIAWA LVGLAFAWTF AFAVLSGQVG WQARFLFPAA PVAACALAIG LERSIKTAAW LLPALLVTMA VGLPNTVISA AYPYLVVEPQ PERPVLALWR PDTANPIELR GILLPPTAKA GQPWLVNTLW RAVGKQDRQW SLFVHLAEVG SKDESFAFDV QPQDGVWHSL RWTPDDWWED LLTINIPSDL AAGEYEVRIG WFDEFGDWSR AGVWSQDGIL LGDYAVVGKV VVE
|
| |