Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4276 |
Symbol | |
ID | 5736135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5458769 |
End bp | 5460976 |
Gene Length | 2208 bp |
Protein Length | 735 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281436 |
Product | hypothetical protein |
Protein accession | YP_001547036 |
Protein GI | 159900789 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00531849 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGAT TTAAACGATT TCTACCATTG GCGCTAATTG CCTTGAGCGC CATTATTTTT CTCATTCCGC CTGCGCGGCC TAGCATCGAT GTTGTGATGG GCAGCGAAGG CGAATTTCTG GTTGAAGGCC AAACTCAAAA CTTCCGAAAT TTTTATGGAT TTAAGGAAGT TGAGCGGACA GAGCAAGGTG TATTTCGCTG GACAAGCGGA CGTGGCAGCA TTGCGTTTCC TTATTGGGAG AATGTTGCCG CCCCCTTACA GCTTAATCTA TTGCTCTGTG GCTGTCGCAC CGATGGTCAA ACCATGCCCT TAACAGTTAC GGTCAATCAG CAAGCGTTGC TCAAACTTAA TCTGAGCAAT CAATGGCAAT GGTATCATGT GCAACTTCCC GCTGGTTTAA CCCACCCTGA TCATGATATT TTGCTTGACT TGACCTCGGC AGAATGGCGT AGCCCTGATC AACGTACCTT GGGCATTGTG GTGCAACGCA TCCAAATTGA GGCGTTGGCA GCCCAACCAA CCCAATCGTT CGTCGCCTGG CTAGGCTTGG CACTTGGCTT AGGGTTAATG GTGTTGTGGC AGATTCCAGT GGGCTTTGCA GGAATTGTGT TTGGGCAGTG GCTGCTTGGG GTTTATGGCT ACCAGCCCCA GTTTTTGCCA AAGGATTTTC TTCTGATCAG TTTGGTTTTT GCCAACGTGA TTGGTTGGCA ATGCATGCGC AATCTGCCGC AGCTATGGCG CTGGGTAAGC TTGCCACTCA GTAGCTTTTG GCTGATTAGC AGCCCTCAAA TCCTTGGCTC GTGGATTTTA GATGATGCGT TTATCTCATT TCGCTATGCT GCCAATTTGG CCAATGGTCA GGGGTTGGTG TTCAACCTTG ACGAGCGGGT CGAAGGCTTT ACCAATTTCC TTTGGACCTT ATTAAGTAGC TTCAGCATTA AACTTGGGGC CGACCCAATT TTAGTCACCC ATGCAACCAA CCTGCTTTTG GGCATGGTGA TTGTGGTCTT AAGTTTACAA TTGGCCCATC GCTTGCATCA AAGTAGCTGG ATGACCCTAG CGATACCCTT GATGTTGCTC AATTTGCCGT TATTGCTCTA CACAAGTTTG GGTAGCGGCA TGGAAACCGC ACTCTTTTGC GCGGGTATTT TGGCAACCGT GCTACTTGCG ATCAATCAGG TCTGGGGTTA TGCAGCGATC GCCACCACGC TCACGATTAT GACTCGGCCT GATGGCATGC TGTTGGCTGG GATTATTGGC TTACTGGCAA TTTGGCAAAG TTGGCAAACC AAGCACTGGC AGCCGCTAAT ACGCTACGCC GGAATCTTGA GCCTGACCTT TGTACCCTAC TGGCTGGCGC GTTGGTGGTA TTATGGCTAT CCACTGCCCA ACACGTTTTA TGCCAAAGTT GGTGGCACAG CCAAACAAGC CGAACGAGGG ATTAATTATT TTATTGATTT TAACCGTAGC TACTACTTAG GTTGGCTTGG TTTAGGGCTG GGAGCACTAG CAAGTGGGCT GCGTTGGTAT CAAACTAAAA AAGTACCCAT CCTTGCCATC ATCACATGGG CAATCGTTGG CCTCTACACC AGCTACACGA TTAGCGTGGG CGGCGATTGG ATGCCAGGTT ATCGTTTTAT GGTCGCAACT GTGCCATGGT TTGCGCTTTT GGCAAGCTGG GGCATCAGCG AACTTTGGCA CTATCGGCGT TGGCTGGGCG CTAGTGCAAT CGTAGCCAGC AGCGCAATCC TGTTGCTCCT GCTCCAGCCA CTTCAGCAAG AACGCCCATT AACAGTTGGT TCGGCGGCTT GGTACGAGAC CGATGTGGTC AATCGCTATC GTGAAGTTGG GCTATGGATC AAAGCCAGCA CACCGCCCGA AACCACGCTA ACGGTTACCG CTGCCGGAGC CATACCCTAT TATGCAGAAC GCACAACGAT TGATGCTCAT GGCTTGACCG ATCTGCATAT TGCGCATCTG CCAATTGATC CGAGCAAGGC GGGCAAACCT GGCCACGAAA AGCAAGACCC CGATTATGTG CTCCGTGATC GAAAACCAAG CTTAATTCCA TGGGTTGCTG CACCAATGTT TACTAGCCAT CCATTATTTG ATGCTAACTA TCGGCTGATC GAAGCCAGCG GCGTTGAAGG CCGAGGCATT CGGATGTTTG TCCGCCGTGA TAGTGGCCTA TTTGAGCAAG CACAATGA
|
Protein sequence | MQRFKRFLPL ALIALSAIIF LIPPARPSID VVMGSEGEFL VEGQTQNFRN FYGFKEVERT EQGVFRWTSG RGSIAFPYWE NVAAPLQLNL LLCGCRTDGQ TMPLTVTVNQ QALLKLNLSN QWQWYHVQLP AGLTHPDHDI LLDLTSAEWR SPDQRTLGIV VQRIQIEALA AQPTQSFVAW LGLALGLGLM VLWQIPVGFA GIVFGQWLLG VYGYQPQFLP KDFLLISLVF ANVIGWQCMR NLPQLWRWVS LPLSSFWLIS SPQILGSWIL DDAFISFRYA ANLANGQGLV FNLDERVEGF TNFLWTLLSS FSIKLGADPI LVTHATNLLL GMVIVVLSLQ LAHRLHQSSW MTLAIPLMLL NLPLLLYTSL GSGMETALFC AGILATVLLA INQVWGYAAI ATTLTIMTRP DGMLLAGIIG LLAIWQSWQT KHWQPLIRYA GILSLTFVPY WLARWWYYGY PLPNTFYAKV GGTAKQAERG INYFIDFNRS YYLGWLGLGL GALASGLRWY QTKKVPILAI ITWAIVGLYT SYTISVGGDW MPGYRFMVAT VPWFALLASW GISELWHYRR WLGASAIVAS SAILLLLLQP LQQERPLTVG SAAWYETDVV NRYREVGLWI KASTPPETTL TVTAAGAIPY YAERTTIDAH GLTDLHIAHL PIDPSKAGKP GHEKQDPDYV LRDRKPSLIP WVAAPMFTSH PLFDANYRLI EASGVEGRGI RMFVRRDSGL FEQAQ
|
| |