Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_2861 |
Symbol | |
ID | 8359022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 3538706 |
End bp | 3541660 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644965041 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003122541 |
Protein GI | 256421888 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00229986 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGAA TGAACCATTT ATTAGGCAGG GCGCTGGTTG TAATGGCTGC CCTCGGATTT CCAACAGTAG TGGCTGCACA GGTAAATACT GTCAGTCTGG GTAGTTCAGA CCAGCAGATC TGGTATGTAA AATCAGCTGC CGAAACCGGT ACAAATGCAA AGGATTCGCA ATTACAAACA TCGGATTGGA CGAAAGCAAT CGTGCCGGGT ACTACCTTTT CTTCTTTTGT TGCCGCGGGC AAGGAAGAAG ATCCCAATTT CGGCGATAAT ATCTATAAGG TAGATCGCAG TAAATATGAC AGGGATTTCT GGTATAGAAC GACGTTCAAA ATACCAGCAG CCTATGCACA AAAAAGAGTC TGGCTCAACT TTAATGGTGT CAACAGAAAG GCATCCGTTT ATCTCAATGG AAAACTGCTT GGTAACCTGG ATGGGTTTAT GGATCGTGGT CATTTTGACA TTACTGATGA TGCTGTGACC AATGGAGATA ATGTACTGGC AGTACTTGTA AGCATCCCTA TACAGCCACT GGCGAATGTG GGTAGCCCCA ATTATGTGGC GAGTGCAGGA TGGGACTGGA TCCCATATGT ACCCGGATTA AATTCAGGTA TTACAGATAA AGTGTTCCTT AGCGCCAGCG GCGATCTCTC CATACAGGAT CCATGGGTAC GTACAGACCT GCCGACCAAT GCAAAGGCTT ATGTGTCCTT ATCCATGCAG GTTAAAAACA GCGCTTCGAA GGATGTGGAT GGCACCTTAC AGGGTACTAT CATGCCTGGT AACATCCAGT TTTCACAGAA AGTGCGTGTT AATCGCAACA GCACCAAAGA TATCAGCTTT ACCAAACGCG ATTTTGAACA ACTGATATTG AACAATCCCC GTTTATGGTG GCCCAATGGT TATGGCGCGC CTAATCTCTA TGAGCTGGAT CTGAAACTGA CGGTAGGGGA TAATATCTCC GATGAAAAAC AGGTCAAGTT TGGTGTTAAA CGCTATAGCT ATGATACTAC CGGTGATGTG TTGCATATTG CAATCAATGG TACGCCGGTG TTTATCAAAG GAGGTAACTG GGGTATGTCG GAGTACATGT TGCGTTGCCG TGGGGCTGAA TACGATACGA AGGTTCGTCT GCACAAGGAA ATGAACTTTA ATATGATCCG TAACTGGATC GGTTTGACGA CTGATGAAGA GTTCTACGAA GCCTGCGATA AGTATGGTAT TATGGTATGG GATGAATTCT GGTTGAATTC TAATCCGAAT CTGCCTGCTG ATCTACAGGC TTTCAATGCC AACGCCATCG AAAAAATCAA ACGTGTCAGG AATCATCCTG CTGTGGCGAT CTGGTGTGCA GATAATGAAG GCTGGCCAGA ACCACCATTG AATAACTGGT TAAGGGAAGA TGTGCGTGTT TTTGATAAAG GAGACCGCTT TTATCAACCT AATTCTCATG CGGAAGGACT GACCGGTAGT GGTCCCTGGA TGGCAAAAGA TCCAAGGTAT TACTTTACAG CTTATCCGAC AGGTCTTGGT GGCAACAAAG GTTGGGGAAT GCGTAGTGAG ATCGGCACCG CTGTTTTTGT GAACATAGAG AGCTTTAAGA AGTTTATGCC ACAGGAGAAG TGGTGGCCGC GTAATGAAAT GTGGAATCAG CACTTTTTTG GTCCGAATGC CTTCAATGCC GGTCCGGATG AATACGACCA GATGATCTCC CGTGGATATG GAAAACCGGA AGGTATTGAG GATTATTGCC GTAAGGCGCA GTTTGTCAAT CTCGAAAGTA ATAAAGCGAT GTATGAAGGC TGGCTGGATC ACATCGGAGA AGATGCCGCT GGTGTAATGA CCTGGATGAG CCAGTCCGCA TATCCATCTA TGGTATGGCA GACCTACGAT TATTATTACG ACCTCACGGG GGCTTATTTT GGAGCTAAAA AGGCATGTGA GCCATTACAC ATCCAATGGA ACCCGGTTAC CAATGCCATT AAGGTCGTAA ACACCACGCG TAGCGAGGTG ACCGATCTTA CTGCCAGTGC AGAAATTTAT AACCTGGATG GGCGTATTGT TAAACAATAC AGTAAATCTG CCCAAATTTC TTCTCCTGCT TATAGTGCGG GTGAAGCCTT TAAACTCGAT CTGACACCAG ATCAGACGGA TCTCGCAAGA GGTAAAAAGA TGTTTGCTTC TTCTTCCCAG GACGGAGATC CATCTGCTAC CAATGATGGC AATGCACAGA CGCGCTGGGC CAGCAGGTAT AACGATGATG AATGGATCTG TGTGGATCTG GGAAAAACAG CAGTAGTAAA TGGCGTCGGT TTGAACTGGG AGGAGGCTTA TGCGAAATCC TTTAAAATAG AGGTCTCTGA TGATAATTCA AGATGGCAGC AGGTATACCG TACTGATGAA GGACGTGTTG GGCAGCAGAA GATTGTTTTC CCTGAAGTGT CTGCACGTTA TGTAAGAATG CATGGTATAG AGCGCGGTAG CTGGTGGGGG TATTCGCTGT TTGATTTTGA AGTATATCAG GGGGATGTGG CCAGCGCCGG ACTGAGTGAC GTACATTTCA TCAAACTGAA GCTTGCCGGT AAGGATGGTC GGCCTATCTC TGAAAACTTT TACTGGCGTG GTAATAAGCG CAAGGACTAT AGCGCTTTAA ATACCCTTGG AAAGGCGGAG CTGAAAGTAC AATACAAGAC CACTAAAGCA GATGGCAAAA CATATGTTAC CGCTACAATC AATAACCCGG TATCATCGGC AAGCGCCGCT TTTGGAATCA GACTGCTGCT GACCGGTGCT ACAGATGGTA AGCAGATTCT TCCTGCCATT TTTAGTGATA ACTATTTCTC CCTGATGAAT GGTGAGACAA AGACAGTAAC GATTGAGTTC GATAGCAATG CAGTTGGTAA AGATGGGTTC AAGCTCACTG CAGAACCATT TAATAACCAT ATCGTACGGA AATGA
|
Protein sequence | MTGMNHLLGR ALVVMAALGF PTVVAAQVNT VSLGSSDQQI WYVKSAAETG TNAKDSQLQT SDWTKAIVPG TTFSSFVAAG KEEDPNFGDN IYKVDRSKYD RDFWYRTTFK IPAAYAQKRV WLNFNGVNRK ASVYLNGKLL GNLDGFMDRG HFDITDDAVT NGDNVLAVLV SIPIQPLANV GSPNYVASAG WDWIPYVPGL NSGITDKVFL SASGDLSIQD PWVRTDLPTN AKAYVSLSMQ VKNSASKDVD GTLQGTIMPG NIQFSQKVRV NRNSTKDISF TKRDFEQLIL NNPRLWWPNG YGAPNLYELD LKLTVGDNIS DEKQVKFGVK RYSYDTTGDV LHIAINGTPV FIKGGNWGMS EYMLRCRGAE YDTKVRLHKE MNFNMIRNWI GLTTDEEFYE ACDKYGIMVW DEFWLNSNPN LPADLQAFNA NAIEKIKRVR NHPAVAIWCA DNEGWPEPPL NNWLREDVRV FDKGDRFYQP NSHAEGLTGS GPWMAKDPRY YFTAYPTGLG GNKGWGMRSE IGTAVFVNIE SFKKFMPQEK WWPRNEMWNQ HFFGPNAFNA GPDEYDQMIS RGYGKPEGIE DYCRKAQFVN LESNKAMYEG WLDHIGEDAA GVMTWMSQSA YPSMVWQTYD YYYDLTGAYF GAKKACEPLH IQWNPVTNAI KVVNTTRSEV TDLTASAEIY NLDGRIVKQY SKSAQISSPA YSAGEAFKLD LTPDQTDLAR GKKMFASSSQ DGDPSATNDG NAQTRWASRY NDDEWICVDL GKTAVVNGVG LNWEEAYAKS FKIEVSDDNS RWQQVYRTDE GRVGQQKIVF PEVSARYVRM HGIERGSWWG YSLFDFEVYQ GDVASAGLSD VHFIKLKLAG KDGRPISENF YWRGNKRKDY SALNTLGKAE LKVQYKTTKA DGKTYVTATI NNPVSSASAA FGIRLLLTGA TDGKQILPAI FSDNYFSLMN GETKTVTIEF DSNAVGKDGF KLTAEPFNNH IVRK
|
| |