Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2369 |
Symbol | |
ID | 3520099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 2468407 |
End bp | 2471451 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637284826 |
Product | glycosy hydrolase family protein |
Protein accession | YP_269087 |
Protein GI | 71279640 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3693] Beta-1,4-xylanase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.195451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACT ATAAATGTAC CTTTGCCCTT ATGCTTATTT TCTGTTTGGG AGTACAAGGT TGTGGCTCTG AAGAGCAAGT AATAGAAGAA CTACCAATAC TTGAAATTCC AGTTGAGGAT GCTCCAGTAG AGGATGTTCC AGTTGAGGAA ATTCCAGTTG AGGAAGCTCC AGCTGAGGAA GTTCCAGTTG AGGAAGATCC AGTTGAGGAA GTTCCAGTTG AGGAAGCTCC AGCTGAGGAA GATGTTGAAC TTTCTGCATT TGAACTAGCG ATGGCAAAAT GTAATAACCC TTGGAATGAG CTAGGTGTAG CGTTAAATGA TAATAGTGAG TTGCTAAATG ACCCATTATT ATCGACATGT TATCGTGTGT CAGGGGTTGT TGATGGCTTA ATTGATGACA ATATTGACTT GTCAATTAAT GGTGCTGAAA CGCTAGCGGT GGCAAGTGAG GGGCAATTTG AATTTGTTAC TCCTTTTATC AGTGATGAAA CGTTTATATT GAGCGTAAAA ACTGCAGCAA CAAAACATCA ATGTAATTTG ATTAATAACA GCGGCACAAT TAATGATGAG CAAGCCAATA ACCTTATTGT TCATTGTATA GCTACTACCA GCCAATGTGA TAAAACCACA GAAAATATTA CTGATGATAA AAACTTCTCA GGTAATAATA TTGAAGGAAT TGAGAGTCTT GTTAGTGATA TTGAATGTGG TGTAGAACCA TGGCGAGAAG CAGCTAACGA ACGTATCAAT AACATTCGTA AAACGTCAGG TACTATTACT ATTGTTGATA AAAATGGTGA GCCAGTAACT AACGCTAAAG TGAACTTAAC CCTTAACCGT CATAACTTTA AGTTTGGGGG AATTGCTCAG GCAAAATTAT GGCATGGTGA AGCCGATGAT GTTGCAGATC TATATAAAGC GGCTTATTTA GATTTTGGTT TTAATAAAGG TGGTTTTCAA AACGCTTTAA AATATAAATT AAGAGCAGGT TTAGAGCCTT TAGTACCAGC TATGCTTACT TGGTTTAAAG CGCATGATAT CCCTGTGCGT GGGCATGCGT TGATCTGGCC AAAATGGACG AATATGGAAA CAACAGTATC AGCACAAGAT GCATTAGATA TGGGCATTAC CCAAGGAGAT GTTGCGAACC TTCCCAGTGA TGAACTGAAA ATTTATGTTG ATACCACGAT TAGAAATTGG GCGAGTAAGT GGGATGTTGT TGAGTGGGAT GTTGCTAACG AATTACGAGG TCACTACGAT GTGCAAGATA TTCTAGGCTA TCAAGAAGAA GCACATTGGT ACAAATTAGC TAAGGCCAAT GTTCAAAATT CGGCGACATT ATTTATTAAT GATAATCGAA TTATATCAGA TAGCAGTGAA ACTGTGGTTT CAGATAAAGT TGCAGGCTAT AAAAGTAATG TTGAAAGTAT TCTTGCAGAC GATACGTTAA ATGAAGGTCA TGTTGAAGCG CTTGGATTTC AAAGTCGCTT TGGTTCGATG TTATCTGCGG ATACTATTTA TCAACGTTTA TCCTATTTTG ATGACTTAAA TCTGCCAATT TCAGCGACGG AATTTGAGAT TAAAGATGAC TTGATAACCA CAGAGATTGA CCGCGCTGTT TTAACTGAAC GAGTGATGAC TGTTTACTTT AGTAAAGAAA GTGTCAGTGA TATTTTAGTC TGGACATTTT TTGAAAGCTC TAGCCGAAGT GATGCTCGCC ATTTAGTCGA CTTAGAAGGC AATGCTAATT TACGAGGAAA AACGTGGTTG TATTTAGTTA AGAAACATTG GAATACCGAT GTTACCACTT GGTTAGATAG GCAGGGGGAA ACACAGCTAA ACGGCTTTAA AGGTGAATAT ACTGCAACGG TATCTTTTAC AAATTACCCT GATGAGCAGG TTGACTTCTC TTGGATTGAT GGCACAAAGG GTAAAACAAT TCAGTTACTT AATTACGCCA ATGGTAGCGA CAGTGCAACA CCTGCTAGTT TCAGTATTGA TAGTTTTGAA AATACTGAAG TAGAGGAAGG CGTTATGTTT ACTAGTACAG TACCGACCCT GTCAGGTGAT GATAATATTG GCGCTATTAC TTGGACTGTA ACCGGTGATG ATGCCAACTT ATTTAACCTT AACTCTATAA CTGGCGTTTT AAGCCTTACA GCACAAGACT TTGAATCACC AGCTGATACT GATATTGATA ATAACTATGC GGTTATTTTA ACTGCGACGG ATGCTGTTGG TAATTTTGCA GAATTAGCTT TAGAGCTCGT TGTAACTGAT AATCTTGCCG ATAACCAAGT ACTAGTAAAC TTCACTATTG ATGAATTCAT TACCACAACA ATTGCTGAAA ATGTAATTTA TAGCAGTGAG CTGCCAAACC TTTCTGGTGA TGAAGCAGTT GGTAGTGTTA TTTGGACTGT TGAAGGCGAT GATGCTAATT TATTTGTTGT CGATTCATCG ACAGGTGCAT TAACTTTAGC GGCTCAAGAT TTTGAAAATG CTAATGATTT AGCTGGAGAT AATGACTATC TAGTAACACT TGTCGCGACT GATTCAGAAG ATAATTTCGC CCAACTAGCA TTAGTTATTT CAGTAACCAA TGTTGATGAA GTTGTTGTTT ATTCACCACC TGAGATTAGT GGTGACAATG GTGATATTTC GAGTGTTATC AATGCTGGTG ATTTAGTTTT TACTCGTCCA GCCCTGGCGA CGGAAACGAC ACTTGATGGT GGCGCTGCCA CAGTTGAAGG TAATAAATGG AAGTTATTCA ATTGGAGTGA AGCCGAAGCT TACTGTAGTG ATATTGGTGC AAGATTACCA ACTAAAACAG AGTTATCAGA TAACTTATTA ACGTTAGTTA ATGATGCCGA CTTGGTTAGC AATGGCAGTT TCTCCGCTAC CGAGCATTGG CCTGTAAATA AAGGTTACTG GGCTAGCACT TTTCCTGAGG ATGGTAAACA TCACTTAATG AAAACAAGCA TAGATCCGGC AAAAATGTCA GCTTTAGCTG ATACCAACCG TCAATATGTA ACTTGTGTTA GATAA
|
Protein sequence | MKNYKCTFAL MLIFCLGVQG CGSEEQVIEE LPILEIPVED APVEDVPVEE IPVEEAPAEE VPVEEDPVEE VPVEEAPAEE DVELSAFELA MAKCNNPWNE LGVALNDNSE LLNDPLLSTC YRVSGVVDGL IDDNIDLSIN GAETLAVASE GQFEFVTPFI SDETFILSVK TAATKHQCNL INNSGTINDE QANNLIVHCI ATTSQCDKTT ENITDDKNFS GNNIEGIESL VSDIECGVEP WREAANERIN NIRKTSGTIT IVDKNGEPVT NAKVNLTLNR HNFKFGGIAQ AKLWHGEADD VADLYKAAYL DFGFNKGGFQ NALKYKLRAG LEPLVPAMLT WFKAHDIPVR GHALIWPKWT NMETTVSAQD ALDMGITQGD VANLPSDELK IYVDTTIRNW ASKWDVVEWD VANELRGHYD VQDILGYQEE AHWYKLAKAN VQNSATLFIN DNRIISDSSE TVVSDKVAGY KSNVESILAD DTLNEGHVEA LGFQSRFGSM LSADTIYQRL SYFDDLNLPI SATEFEIKDD LITTEIDRAV LTERVMTVYF SKESVSDILV WTFFESSSRS DARHLVDLEG NANLRGKTWL YLVKKHWNTD VTTWLDRQGE TQLNGFKGEY TATVSFTNYP DEQVDFSWID GTKGKTIQLL NYANGSDSAT PASFSIDSFE NTEVEEGVMF TSTVPTLSGD DNIGAITWTV TGDDANLFNL NSITGVLSLT AQDFESPADT DIDNNYAVIL TATDAVGNFA ELALELVVTD NLADNQVLVN FTIDEFITTT IAENVIYSSE LPNLSGDEAV GSVIWTVEGD DANLFVVDSS TGALTLAAQD FENANDLAGD NDYLVTLVAT DSEDNFAQLA LVISVTNVDE VVVYSPPEIS GDNGDISSVI NAGDLVFTRP ALATETTLDG GAATVEGNKW KLFNWSEAEA YCSDIGARLP TKTELSDNLL TLVNDADLVS NGSFSATEHW PVNKGYWAST FPEDGKHHLM KTSIDPAKMS ALADTNRQYV TCVR
|
| |