Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3825 |
Symbol | |
ID | 5735689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4800732 |
End bp | 4803611 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280977 |
Product | hypothetical protein |
Protein accession | YP_001546589 |
Protein GI | 159900342 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.501815 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCTGGT ATAATGGCAC GTTGTTGTCG ATGAAACGAT TTGAGGCTGG CTTGGTACAA GACTGGTGGC AACGCCGCGC TGCATGGTCG CTGCGTCAAT GGATCATTAC AATCGCAAGT GTGTTGCTGA TGGCGCTATG CTTATTAACC GCGCTCTATC AGTTGCCGAC TCATGTGCGG ATCGCCAGCG ACGGCTTGGG TGATCAGCCG TTTTTGGTTT CAAGCGAGGC ACTAGGCCAA GCTGCCACCG ATCTTGGCAC GGTCTTCCCC GATGAACTCG ACGCAACTGG CGGGCGGTTT CGCTGGACAC GCGGCAGCAC CCAAATTCAG ATTCCGGCAC TCGGCGCACA TAGCAGTTAT AGGCTTGAGC TTGATGTAGT TGGCTGGCCC GACGATGTGC TACGCAGCGA TGTACGCCAG CCATTTGTGC ATGTGTTGGT CAATCAACAG CTGATCGCCG TATTTGAGGC CAGCAGCCAA CGCACCATCT ATCAATTAAA TACTCCAGCA CTCAATCAAA GCAACCTTGA GCTTGAATTA CGCCTGAGCC ACGACCCACA GCAGCTTGCC CCCTTGGATA ATACTGCCAC CTTCACTGGC ACGACGCTCT ATCCCAACGA CCGCCGCCCC CATGGTTTGC GCTTGTATGG CCTTAGCGCC ACGACCAACA CTGTAGGCTT TAATTTGCCT GCCTGGAGCT TGATTTGGCG TGTAGCAGTG GCCTTGGGCT TGATCCTCCT GGCTGGTTGG CTCTATCCGC GTGAGCGTTT GCCATTGTTT TTGCTTGGGT TGCTCTTGGT CGTTTTTTGG CTGACCCTGG GCTATCTTGC CCGCCACTGG CTGTTGCCAT CGTTTGAGTT GGTGCTGTTG GGGCTTGGTT TTGTGGTGGT TTGGAATTGG CAACGCCAGA TTATTGATAC CTGGTTGCAT TTCCGCCAGC GCTTGTTGCA AGGCCAAAAT CTTGATTATG GTTTAGCTGT GGCTGCACTG ATTGGCCTGC TGGCTATCAG TTGGCCCAGC CTTAGCCAAG CCGCCAACGA ATGGCTACCC AAAGCCAAAA AGCTCGATCC TGGGGCGATT ATTGTGGTTT CGTTGGCAGT AGCTAGCTTG TTTATTGCAG GCTTCTACTG GGGCGATCTC AATCGGTTGC TCGATCGGCT CAATCGGCGT TTGCGCACCC GCCAAGGCTT GGGCTGGGCC TTGCTAGGCT TAGTGGTTGG GGTTTGGTTA GTCTATAGCT GGGGCGTAAT TCGCCAAATT AACTACCTTG GCAACGCCGA TTACTCCGAT AACGGTGTAG TTGCCCGCAA CTTGGTGGCG GGGCGTGGCT GGGTCGTTGA TTATGTCACC CAATTTTTCA AACTGTATCC CGATGGCAGC GTGACCCGCG TGCAAGAAAC CTGGCCAATG TTGCAGCCAG TCTGGATTGC GCCATTCTTT GCCTTGTTTG GGCCAGAACC GTGGGCCGCC AAAATCCCCA ACCTGATCTT TTTTAGTTTG CTGTCGATCG TGGTCTATCG TATTGCCCGT CAGTTGTGGG ATCAACGGGT TGGTTTGATT GCGGTGCTGT TGGTGCTGAT CAATCGACAT ATGTTCCGCT TGATGATCTA TAGCACCTCG GATTTGGCGT TTGTGCTGTT CTACACAGCG GCGATTTGGT TGCTCTGGCG TAGTTTGGTA ACTCATAGCA AAGAGCGCTT GCTTGGCTCA GGCCTGATCA TCGGCCTCAT GTGTTGGCAA AAAACCAGCG CCGTAATTGT AGCGATTGGC ATGGGCTTGT GGCTGATTTG GCGTTTGTGG CAGGTCGAAG ATCGCTGGAA AGCCTATCGC ACTGCTGCTT TGTGGTGGGT ACTCCCAGCG GTTTTGGTGT TCTCACCTTA TATTGCCCGC AACCTGCACG AATTTGGCAA ACCCGCCTTT TCGACCGAAA GTTATGATGC TTGGATCATC GGCTACACAA ATTTCGATTC GATCTACAAT ATCTACACCA ACGAGCAAGG CTTGCCCGGG AGCAACGGCT TACCTGAACC AAGCTGGATT CTGCGTTGGG GCTATCAAAA GACCTTTGAT AAAATTGGTA ACCAATTTGA AGCGACTCGC AACTATTTGT TGCCAGCTAG CCCAGCTTTG GGCATGTTCA GCGGTCGCGG CAATTTGATG GGCAATAACG ATCAGGGCGT TAATGGTCAG CCCTATGTGT GGTTGATGTT GGGTGCGTGG CTAAGCTTAA TTGGTTTGAT TGCTGCTCGC CGCCAACAAG CCAGCTTGAT CGCCTTGGTT GGAGCGAGTT TCACACCATA TATTATCTTC TTGGCGCTCT ATTGGCACGC TGATGAAGAA CGCTATTTTG TGCCATTAGT GCCATTTTTG GCCTTACTGG CGGCTGGGGC GTTGGTGGCA ATTCACGATG CAATTGCTCG TTGCTGGAAC CAGCGTGGTC GTCCGTTGGC CTTGTTGGTA GCGGCTCAAT TGCTGGTCTT GGCGCTCACC CCGGGGTGGG TTGAGGCCGC TAACAAAAGT GCTACCGTCG CAGGCAGCGA GTATGCCGAA TGGCAACCCG ATTTGCAGGC CTTTGAATGG TTGCGCCAAA ACACACCACC AGAGGCGGTG GTTATGACCC GCGTGCCATG GCAACTCAAT TTTCATGCTG AACGCGCTGC TGTGATGAAC CCAAATGTGG CCGATTTAGC CGTGATTAAA CAGGTTGCTG ACTATTACAA GGCCTCGTTT ATTCTGGTCA ATGCTGTGCA AAACAACAAA GATCAAGCTC AAATTGGCTT AGGCCGCTTG CTCAAAGGCG AGGAGCTACC TGGGTTTGTG CTACGAGCCA GCTTTGCGGG GCCGAAAAAC CGCACAGTCT ATATCTACGA AATTCAATAA
|
Protein sequence | MLWYNGTLLS MKRFEAGLVQ DWWQRRAAWS LRQWIITIAS VLLMALCLLT ALYQLPTHVR IASDGLGDQP FLVSSEALGQ AATDLGTVFP DELDATGGRF RWTRGSTQIQ IPALGAHSSY RLELDVVGWP DDVLRSDVRQ PFVHVLVNQQ LIAVFEASSQ RTIYQLNTPA LNQSNLELEL RLSHDPQQLA PLDNTATFTG TTLYPNDRRP HGLRLYGLSA TTNTVGFNLP AWSLIWRVAV ALGLILLAGW LYPRERLPLF LLGLLLVVFW LTLGYLARHW LLPSFELVLL GLGFVVVWNW QRQIIDTWLH FRQRLLQGQN LDYGLAVAAL IGLLAISWPS LSQAANEWLP KAKKLDPGAI IVVSLAVASL FIAGFYWGDL NRLLDRLNRR LRTRQGLGWA LLGLVVGVWL VYSWGVIRQI NYLGNADYSD NGVVARNLVA GRGWVVDYVT QFFKLYPDGS VTRVQETWPM LQPVWIAPFF ALFGPEPWAA KIPNLIFFSL LSIVVYRIAR QLWDQRVGLI AVLLVLINRH MFRLMIYSTS DLAFVLFYTA AIWLLWRSLV THSKERLLGS GLIIGLMCWQ KTSAVIVAIG MGLWLIWRLW QVEDRWKAYR TAALWWVLPA VLVFSPYIAR NLHEFGKPAF STESYDAWII GYTNFDSIYN IYTNEQGLPG SNGLPEPSWI LRWGYQKTFD KIGNQFEATR NYLLPASPAL GMFSGRGNLM GNNDQGVNGQ PYVWLMLGAW LSLIGLIAAR RQQASLIALV GASFTPYIIF LALYWHADEE RYFVPLVPFL ALLAAGALVA IHDAIARCWN QRGRPLALLV AAQLLVLALT PGWVEAANKS ATVAGSEYAE WQPDLQAFEW LRQNTPPEAV VMTRVPWQLN FHAERAAVMN PNVADLAVIK QVADYYKASF ILVNAVQNNK DQAQIGLGRL LKGEELPGFV LRASFAGPKN RTVYIYEIQ
|
| |