Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2372 |
Symbol | |
ID | 5734253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3021359 |
End bp | 3023887 |
Gene Length | 2529 bp |
Protein Length | 842 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279513 |
Product | cellulose-binding family II protein |
Protein accession | YP_001545140 |
Protein GI | 159898893 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACGAC GAATGTATGC TCGTGGAACA TTGTTGGTAT TGTTGATCGC CTTGATCAGC CTTAGCGTTG CACATCGCAC GGCTACGCCC AGCGCGGCAG CGGCCAGTTG CGTGGTCAGC TATAAGGTGA TCAATCAATG GGCCGATGGG TTTATTGGCG ATGTGACCGT TACCAATAAT TTGGCAGCAA CCACAACTTG GCAGTTAAGC TGGACGTTTG CTGGCAATCA GCGGATTGTT AATCTGTGGA ATGGCGTTTT GACCCAAACT AACGCCGCCG TCAGTGTGCA AAATGCCGCT TGGAATGGCT CGATCACCAG CGGTGGCAGC GTTAATGTTG GCTTTCAAGC AACATTCAGT GGCACGAATA GTATTCCCAC GAGTTTTACC CTGAATGGGG TGGTTTGTGG CGAAACTAAC GTAACGATCA CACCGCCAAC CAGTAACCCA ACCAACACGG CGACGGCGCG GCCCCCAACT AACACCCCAA TGCCCCCAAC TGGCACTCCG CGTCCATCGA ATACCCCAAC GATTGCTCCA ACTACCCCAA CCGCAACGCC CAACAACAAT GGCTGTATTG GCGCAATTAT CTGCGACGAT TTTGAAAACC AAACCGGCAT TGATCCCAGT GGATTTTGGC AAGCGCTCTA TCCTGATTGT ACTGGCCATG GCTCAGTGAT GGTCGATAAC ACCTATGCCA ATAGCGGCAG CAAATCGATC ATGGTTCATG GGCATAGTAA TTTCTGTGAT CATGCCTTTT TCGGCAATAC CACGGCGATC GAAAGCATTG GCAATCTGTT GTATGGCCGC TTTTATGTGC GCCAAGATTT AGCTTTGCCC GACCATCACA TCACATTAAT GGCGCTCAAA GATCGCAACG ACGCTGATAA TGATTTGCGC TTGGGCGGCC AGCAAGGGGT GCTCGATTGG AATCGCGAAT CGGATGATGC AACCGCCCCA TCGATGAGCA GCGCAGGCAT CGCCAAATCG GTCACGATTC CACCTCGTCA GTGGACATGC GTCGAATTCA AGATCGATAG TGCCAATGGC TATTTGCAGG CCTGGGTCAA CGGCAGCGAA GTTGAAGGCA TGCGGGTTGA TGGGGTCGAT ACGCCTGATA TTGACCAAGC TTGGCGTACT CGTGCCAATT GGCAACCACG CTTAGTTGAT ATTCGGTTGG GTTGGGAAAG CTACGGCGGC GATGATATTA CCCATAATAT TTGGTTTGAT GATGTGGCCC TGGCAACTCA ACGTATCGGC TGTAGCGCTA GCAACCCAAC TACGCCGACC GCTACTCCGC GCCCAACCAA CACCACCACG CCCGTGGTTG GCACGCTCAC GCCTAGCCCA ATTCCCAGTG CAACGCCAAC CATCGCGCCC AATGGCATTA CTGCTGCTAC CACGATTGCC GAGCTTTGCC CAACCTACCG CCAACTCTAT ATCGATCGTG ATTTTCTCGA TTATTTGCCG AGCGACGGCA GTGGCGGCCA TAACAACATG CCAGTTGATG GCGGCATGAC TGATGCCCAA AAAGCTAGTT TATACGGCGT GAATATCAGC GCCATTCAAA GCAAAATTGG CAATGGCACA TTAACTTTGG GCGAATTGGG AACCCAAGCC TTGGGCCATG TGCAACGCTT AGTCAATCAA AATTTCCCCA AAAATGCGAT TTGTCAGTTG TTGCCACGCT TGATGCTGCT AGGGCCGGAG ACCGAAACCG CTACCTTCCA CAAAAATAGC AGCAATCCAT GGGCTGAAAC CGCTGGCCCC GTCAATGCGA TTGCCCCCGC TGGATTTATG CAAACCCGCT GGCCAACTGA TGCTCGTACC TATGTGCCAG CCGAAAAAGC CGAACGCGAC CGTTGCCACG ATCAGCCAGT GCATGAAAAT AATCTTGGCT GGACATTTAG CTCGATCGTT GATCCGAGTA TTTTGTATGA TCCAAATAAT CCAGTGTTGA ACGCAATTCG CACGAGTACC CACCCAATCA GTGGCTTGCC GATGGGGCCA GGCTTTAGCA GTAATGCGCC GATGCAGGCA ACGACCAAAC TGCACGAGGA AAATTCGGGT TTCTGGTATC AGGTGATTCA GTTCAAAAAT ACCAGTACTA TTCCCTATTA CCTCGATTGT GCCATGATTT GGTGGGTCGG GCCATCGGGC TTATCGTTTG ATTTACGTAA TGGTCATTAT AATAATGAAC AACGCCCTGG CCGTGGCTAT GGTCACCCAC AGCGCGATAT CATCGAAGTG GTGTACAATC AAACACAAAA GCTCTCGGTG TATAGCATTC GTTTGTCGTT CCACGATGAG CCGTACAACA TGCGCACGGC CTACCCCAAC CAATATTGGT CGTTGGAAGT TGGCACGCCC GCCTTTTTGA ATGGACAGGC GCGTTATACC ACCTCAGCCG AACGCCAAGC CTTGATGGAT TTGATGCTCA ACACATTGCA TGTCGAGCTT GAAACCAACC TTGATCGCAA TATTGAGCTA TTCGATGCCC TGAAGATGCG TAATCGGGTT TCGAATTAG
|
Protein sequence | MLRRMYARGT LLVLLIALIS LSVAHRTATP SAAAASCVVS YKVINQWADG FIGDVTVTNN LAATTTWQLS WTFAGNQRIV NLWNGVLTQT NAAVSVQNAA WNGSITSGGS VNVGFQATFS GTNSIPTSFT LNGVVCGETN VTITPPTSNP TNTATARPPT NTPMPPTGTP RPSNTPTIAP TTPTATPNNN GCIGAIICDD FENQTGIDPS GFWQALYPDC TGHGSVMVDN TYANSGSKSI MVHGHSNFCD HAFFGNTTAI ESIGNLLYGR FYVRQDLALP DHHITLMALK DRNDADNDLR LGGQQGVLDW NRESDDATAP SMSSAGIAKS VTIPPRQWTC VEFKIDSANG YLQAWVNGSE VEGMRVDGVD TPDIDQAWRT RANWQPRLVD IRLGWESYGG DDITHNIWFD DVALATQRIG CSASNPTTPT ATPRPTNTTT PVVGTLTPSP IPSATPTIAP NGITAATTIA ELCPTYRQLY IDRDFLDYLP SDGSGGHNNM PVDGGMTDAQ KASLYGVNIS AIQSKIGNGT LTLGELGTQA LGHVQRLVNQ NFPKNAICQL LPRLMLLGPE TETATFHKNS SNPWAETAGP VNAIAPAGFM QTRWPTDART YVPAEKAERD RCHDQPVHEN NLGWTFSSIV DPSILYDPNN PVLNAIRTST HPISGLPMGP GFSSNAPMQA TTKLHEENSG FWYQVIQFKN TSTIPYYLDC AMIWWVGPSG LSFDLRNGHY NNEQRPGRGY GHPQRDIIEV VYNQTQKLSV YSIRLSFHDE PYNMRTAYPN QYWSLEVGTP AFLNGQARYT TSAERQALMD LMLNTLHVEL ETNLDRNIEL FDALKMRNRV SN
|
| |