Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1122 |
Symbol | |
ID | 5733014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1284694 |
End bp | 1286736 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278261 |
Product | hypothetical protein |
Protein accession | YP_001543898 |
Protein GI | 159897651 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.180103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACGAG GTGTCTTCCC AAGCCAGCGC CTAGGCTGGC TTTGGCTTTT AGGCTTGCTT GGAACAGCCC TAGCGCTCAT GCTCAAGGTT CAAGCCCAAC TTGAGTTCCG CCATTTCAAT GGGGCAGTTA TGCTTGGGGC AGCGGTTGCG CTCGGCATTT GGTGGTGGCT GATCAAGCGC CAACCGCCAC AAGCTGATAT CTTGCCTGAA CTCCATGATC AGCCGTTAGA TCACCTAACG TTGCGGCTTG GCATAAGCGC TGTATCCTTG CTTAGCGGTT TCGTCGCATG GAATAATCTC TTTGGCAATG AATTTAACTC ACTCTCGACC TGGACATGGC TGTTTGCGAT TGGCTTATGG CTGTTGGCAT GGATGCCCTG GCAACGCCCA CGCTTGCCCA AAGCTGATCG AGCCGAAACC CAACGGGCAT GGCTGACCTT CGGCTGTTTG CTGATTGTCA CCGCTTTTGG TTTATGGATT CGGCTGTATC GGCTGGATCA AATGCCATAT GACATGACGG TTGACCACGG CTGGAAGATG GAAGATGTTT ACACCATTTT GCAGGGTGGT CGCCCATTAT TTTTGCCCAA TAACACAGGT CGCGAGCCAG GCCAATTCTA TTACATTGCC ATGTTGATTC GCTTTTTTGG CGTGCCGTTT GGTTTTATCG CGCTCAAACT TGGCAATGTG ATTATCGGCA CGCTAACAAT TCCGTTTATC TATTTGTTTG CGCGTGAGCT GGGCGGTCGC AAGTTGGGCA TTTTGAGTGC CGCCTTGTAT GCGCTAGGCA AATGGCCGCT CGAAACCACT CGCATGGGGT TGCGCTTTCC TTATGCCACC TTGCCCGCCG CGCTGGTGTT GTGGTCACTC TGGCGCTATG TGCGCTTGGG CAAACGCAGC GATGCCTTGC TGGTCGGGTT ATGGATGGGC ATGGGTTTAT ATGGCTATAT CGGCGTTCGA GCAGTGCCCT TCGTCATTGC CGCCGTATTT GGCCTGATGC TGTTTGAGCG ACGACGACGC AACCCCAAAG GCTGGCTCAA ATTGCTCGGC CATGGCAGCC TGACGTTAAT CACAACCGCC TTGATTTTCT TGCCGCTCGG CCATTTTATG CTCGATTACC CCGATGTTTT TTGGTTTCGC GTCAGCACCC GCACCAGCAA CCATACCGAC GATATTAGCC GCGAATTTTG CCAAACTACC AGCAGCGAAC GTGAATGCGA TATTAAAAAA TTCGTTGCCA ATAATGTTAA TTTAGCGGTC GCGTTTAACT GGCGTGGCGA TCGCAACGAA GTTAATAATG TGCGTTTCGA TCCATTGCTT GATGTTGTCA GCGCGGCTTT GCTCTTGTTA AGTTTGCCAA TCGTCGTGTG GCGGTTGTTA GTTGAACGTT CATGGCGTTG GTGGATGCTG GTCGTCGCAT TGCCATTACT TGGTTTAGCC ACAACGTTGA GCTTAGCATA CCCAATCGAA AACCCCAGCG CAGCGCGGAC TGGCGTGCTG ATGCCAGTGA TTTTTACCAT GGCCGCTGCA CCGCTAGCCT TGGCGCTTGA ATGGCTAACC AAAGGCCAGC CGTTTGAGCA ATGGTGGCGC GGCAAATCAG CCTTGGGTGG ATTGCTGAGC ATTGGCTTAA CAATCTGGCT ACTGGCATGG GCTGGACGCG AAAATTTCCA GCGCTACTTT GTTGATATGG CCCGCCAATA CACCGGTTTT ATCCCGAATA ATCGTGAGGT TGCCGATGCG ATTCGCTACT ATCGCGATGC TCAAGGCGTA CCCTACGAAA ACGCCTACCT CATGCTCAAC AGCTATTTCT GGAAGGAATC GCGCAATATC AGCGTGCATC TAAATGATAT GCAATGGTAT GTCAACAACA CGATTAAGCC CGAAATGGCG CTGGTTGTGC CTGGCAATCG CCCATTAATT TACATTCTCA ATCCTGACGA TCAAGCCCAT ATTGATCAAT TGCAACAGGA ATATCCTAAA GGCGAGTTGC GCCGCATCAG CAGCGCCGTT GGCAAGGATT TTCTGGTGTT TCATTTACGC TAA
|
Protein sequence | MLRGVFPSQR LGWLWLLGLL GTALALMLKV QAQLEFRHFN GAVMLGAAVA LGIWWWLIKR QPPQADILPE LHDQPLDHLT LRLGISAVSL LSGFVAWNNL FGNEFNSLST WTWLFAIGLW LLAWMPWQRP RLPKADRAET QRAWLTFGCL LIVTAFGLWI RLYRLDQMPY DMTVDHGWKM EDVYTILQGG RPLFLPNNTG REPGQFYYIA MLIRFFGVPF GFIALKLGNV IIGTLTIPFI YLFARELGGR KLGILSAALY ALGKWPLETT RMGLRFPYAT LPAALVLWSL WRYVRLGKRS DALLVGLWMG MGLYGYIGVR AVPFVIAAVF GLMLFERRRR NPKGWLKLLG HGSLTLITTA LIFLPLGHFM LDYPDVFWFR VSTRTSNHTD DISREFCQTT SSERECDIKK FVANNVNLAV AFNWRGDRNE VNNVRFDPLL DVVSAALLLL SLPIVVWRLL VERSWRWWML VVALPLLGLA TTLSLAYPIE NPSAARTGVL MPVIFTMAAA PLALALEWLT KGQPFEQWWR GKSALGGLLS IGLTIWLLAW AGRENFQRYF VDMARQYTGF IPNNREVADA IRYYRDAQGV PYENAYLMLN SYFWKESRNI SVHLNDMQWY VNNTIKPEMA LVVPGNRPLI YILNPDDQAH IDQLQQEYPK GELRRISSAV GKDFLVFHLR
|
| |