Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BLD_1003 |
Symbol | |
ID | 6363667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bifidobacterium longum DJO10A |
Kingdom | Bacteria |
Replicon accession | NC_010816 |
Strand | - |
Start bp | 1175135 |
End bp | 1177828 |
Gene Length | 2694 bp |
Protein Length | 897 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642680171 |
Product | arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_001954947 |
Protein GI | 189439866 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.629221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGAA ATACCGCGCT TACTCGCATT ATGGCATCCG GCGTGGCCGC CATAATGCTC TGCGCAGGAG GCACGTTCAC TGTAAACGCC GCCGAAGAAG AACCAGTTAA AGCCGACGTC TCGGTCAAGG CCATCCAAGG CTTAAGCGAT GATTTCATCG GCGGTATGGA CGTCTCATCC ATGCTCTCGC TGGAAGAAAG CGGCGTCACG TTCAAGAACG CCAATGGCGA GGTCGAGGAT TTGTTCACGT TGCTCAAGGA ATCCGGCGTG AACTATGTGC GTCTGCGTGT GTGGAACGAT CCGTTCACCG CAGACGGCCA AGGATATGGC GGCGGCAACG TCAACGCCGA TCGCGCACTG ACCATGGCCA AGCGAGCCAC GGCAGCAGGG CTGAAAGTGC TGGTCGACTT CCATTACTCC GATTTCTGGG CCGACCCCAG CAAGCAGCAG GTGCCCAAGG CGTGGAAATC CTTTGAAGGC GACGCGGACA AAACCGCCGA CACCGTATAC GACTACACCA AGCAGACTCT GACCACGTTC AAACAAGCCG GCGTGGACGT CGGCATGGTG CAGGTCGGCA ACGAAACCAC TGCGAAAATC GCCGGCATCT CCGGCTGGGA CGGCATGTCC AAAGTATTCT CCGCAGGTTC CAAAGCCATA CGCGAAGTGC TCCCCGAAGC CAAGGTCGTC ATTCATTTCA CCAACCCGGA GAAAGCCGGC ACCTACGCCA CTTACGCCAA ACAGTTGAGC AACCACAATG TGGATTACGA TGTGTTCGCC AGCTCCTACT ACCCGTTCTG GCATGGCACC ACCGAAAACC TCACCAGCGT GCTCAAGAAC GTGGCCTCCA CTTATAAGAA AGATGTGATG GTGGCGGAAA CGTCGTGGGC ATACACCTTG GATGATGGCG ACGACGATTC CAACACAGTG CCCAGTAAAG TAACCGCCGA TAACCTCAAG AAATACGACA TCAGCCCGCA AGGCCAGGCC GATGAGATTC GCGCGGTCGC CGAGGCCGTG AACAATATCG GCGACAATGA TGGCGATGGC GAAAACGACG GCCTGGGTGT GTTCTACTGG GAGCCCGCAT GGGTGCCCGT GGGCACCGGC GGCAAGGATA ACGCCGAACT GGTGGATACG TGGAACAAGT ACGGTGGCGG CTGGGCCACT GAAGCGGCCG GCGAATACGA TCCCAATGAC GCCGGCCTGT ACTGGGGCGG CTCAGGTGTC GATAACCAAG CTTTGTTCGA TTTCGATGGC AAGGCGCTCG CCTCTCTGCC GACGTTCAAG TACATTCACA CCGGTGCCGT CACCGATCAT GTGTTCACCA AGATAGATCC AGTGGAGATT ACCGCGACTG ACTCGGATTC CATCGACGCG ATCAAAGCGC AATTGCCGAG TGAGGTGGCC GCACACTATC AGGATGGGGT GGACGAGACC GAAACCGTGA CCTGGCAATC CGCTGCGCTT GACTGGATTC GTGGCGCAGG CACATACACC ATCACCGGCA CCACCAACGC TGGCCACGAC GTGACCGTCG CCGTCACCGT CACCGCCACT CCCGCCAAGG ATTACGTAAC GGACGGCAGC TTCGAGAACG CTGAGAACGA TAAGAACTGG ACCATTGCCG GCACCGGTGC GTCCATCACC GAAGACAGTG GCAACGCCGC CGACGGCAAG CGCGCGCTGA AGTTTTGGGC ATCCGATGCC TACAGTTTCT CCGCCACGCA GACCATCACC GGACTCGAGC CTGGCGAATA CGTGCTGACC GCCATGAGCC AAGGCGCGGC AGCGGACAAC GCCGCTATTA CCGATGGTGT GGCCCTCTCC GCCACAACGG GTGGCAAGAC CACGTCCGAT GCGCTGGAAC TCAACGGTTG GGTTAAGTTC GATACCGCCA CCGTGCCTGT CACCGTGGGT GCTGATGGCA CGGCGACCAT CACCATCACG GGCAACCTGC CTGCCGACGC ATGGGGCAAT GTGGACAAGG TCTCACTCGT CAAAAAGACC GAAACTCCGG TGAAGCCATC CACGGAGAAC CTGGACAAGG CCGTGGCCGA AGCCGGGAAA ATCAATCGCG ATGAGTACAC CAACGAGTCG CTCGCCAAAC TCGATCAGGC GCTTGCCGCC GCTGACGTGC TGCTTGCCGG CAGCACCTAC ACCGAGCAGG ACGTGAACGA TGTGATCAAG CTTGTTGCCG ATGCCATTGC CGGCCTCGCC CAAAAGGAGG TCTCCAGCCT GACCGTCACG CCAAGCAAAA CCACCTATCA GGTGGGCGAT GCCATCGATG CCGACCATGA TCTCAAGGTC GTGGGCAACT ATTCCGCAGG CATGGGCAAC GTGACGCTTT CCGCCGACCA GTTCACGCTC GATTACGATT TCTCCGCTCC CGCTGATGCG GCGAAGGTTA CCGTAACGCT CAAGTCGAAT CCGAATGTCA CCGAAACCTA TACGGTTGCG GTTACCGCTC GGGCCGAAGG TGGTTCCGGC AACGGCTCGG ACGGTGCGGG TAATGGCGGG GCTACTATCA ATCCCGATAC GGGCGAGGGC GATAAGACCA ACGGTGCGAA TGGCGACAAG ATCACTGGTG TGCTGAGCAA TACCGGTAGC GCGGTGACTG CCGTGGGTCT TGCGGTTGTC GTTCTCGGCG TGGCCGGCGG GGTCTCGCTT GCTTTGCGTC GTAAGCGCTC CTGA
|
Protein sequence | MRRNTALTRI MASGVAAIML CAGGTFTVNA AEEEPVKADV SVKAIQGLSD DFIGGMDVSS MLSLEESGVT FKNANGEVED LFTLLKESGV NYVRLRVWND PFTADGQGYG GGNVNADRAL TMAKRATAAG LKVLVDFHYS DFWADPSKQQ VPKAWKSFEG DADKTADTVY DYTKQTLTTF KQAGVDVGMV QVGNETTAKI AGISGWDGMS KVFSAGSKAI REVLPEAKVV IHFTNPEKAG TYATYAKQLS NHNVDYDVFA SSYYPFWHGT TENLTSVLKN VASTYKKDVM VAETSWAYTL DDGDDDSNTV PSKVTADNLK KYDISPQGQA DEIRAVAEAV NNIGDNDGDG ENDGLGVFYW EPAWVPVGTG GKDNAELVDT WNKYGGGWAT EAAGEYDPND AGLYWGGSGV DNQALFDFDG KALASLPTFK YIHTGAVTDH VFTKIDPVEI TATDSDSIDA IKAQLPSEVA AHYQDGVDET ETVTWQSAAL DWIRGAGTYT ITGTTNAGHD VTVAVTVTAT PAKDYVTDGS FENAENDKNW TIAGTGASIT EDSGNAADGK RALKFWASDA YSFSATQTIT GLEPGEYVLT AMSQGAAADN AAITDGVALS ATTGGKTTSD ALELNGWVKF DTATVPVTVG ADGTATITIT GNLPADAWGN VDKVSLVKKT ETPVKPSTEN LDKAVAEAGK INRDEYTNES LAKLDQALAA ADVLLAGSTY TEQDVNDVIK LVADAIAGLA QKEVSSLTVT PSKTTYQVGD AIDADHDLKV VGNYSAGMGN VTLSADQFTL DYDFSAPADA AKVTVTLKSN PNVTETYTVA VTARAEGGSG NGSDGAGNGG ATINPDTGEG DKTNGANGDK ITGVLSNTGS AVTAVGLAVV VLGVAGGVSL ALRRKRS
|
| |