Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4811 |
Symbol | |
ID | 8360987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | - |
Start bp | 5999523 |
End bp | 6002438 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644966961 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003124446 |
Protein GI | 256423793 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000406891 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000000000101535 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTTGAGAA ACAAACGATG GGCGCTTTCA GCGGTCTTAG GGATATTATG CCTGGCAGCA AACGGACAGG AAAAGATCAT GACCTTAAAC AGCAGCAATG CGGCCGTAAG CTGGAAAGTG AAAGCGGCAG CGGAGCTGGG ACAGACAACG GATATTCATG CAACAGCTTA CAACGACCAG CAATGGGTGA AAGGGATTGT ACCCGGGACG GTGTTTGGCT CATTTGTGGC CGCCGGATTA GAGAAAGATC CTAACTATGC GGATAACATT TACCAGGTAG ATAAAGCGAA GTATGACCGG GATTTCTGGT ATCGCAGCAC GTTTAAGTTT TCCCGCCGGA AGGCAGGAGA GCAGCAATGG CTCAACTTTG AAGGCGTGAA CCGGAAAGCG GAAGTATTCC TGAATGGCCA TCGGCTTGGT TTACTGGACG GCTTTATGGA CCGGGGAAAG TTTGACGTGA CGAACCTGTT ACGGTATGAT CAGCCGAATG TACTGGCCTT GCTGGTGAGC TGGCCAGGTA CGCCTATCGT GAACTATTCA AGTCCGACGT ATATTTCCAG CGCCAGCTGG GACTGGATGC CCTATGTACC CGGACTGAAC ATGGGTATTA CGGATGATGT ATACATTACC GGTTCCGGCG CAATCACTAT CCAGGACCCA TGGGTACGTA CCAGTGCCGC GGATACCTCA CTCGCTAAAC TGAGTATTTC GATGGAGTTG GATAATCATT CTGCACAGGC ACAGGAGGGT ACCATTTCCG GCACGATTCA GCCGGGTAAT ATCCGATTCT CAAAGAATGT AAAACTATCA GCCGGACAAA CGGAACAAGT CTCTTTTCAG CCGGAAATAG CCCATCCTGC GCTTTGGTGG CCGAACGGGT ATGGCAGTCA GCCCTTATAT ACCTGTGACC TGCAATTCAC CGTAAAAGAC AGCGTGTCTG ACAGCCATAA CGTGACTTTC GGTGTGAGAC GTTTCAGTTA TGACACCACA GGTGGTGTCT TGCATATCTA TATCAACGGG CAAAAGATCT TCATCAAGGG CGGTAACTGG GGGATGTCGG AGTACCTGTT ACGTTGTCGT GGCAGTGAAT ATGATACGAA GCTAAAGCTG CATCGTGAGA TGAATTTTAA TATGGTGCGC AACTGGATTG GAAGTACGAC AGATGAAGAA TTCTATACTG CCTGTGATAG ATATGGCTTG CTGGTATGGG ATGATTTCTG GTTGAACTCA CATCCCAATC TGCCTAAAGA TATCTTTGCT TTCAACAGGA ATGCGGTGGA GAAGATCAAG CGGCTTCGTA ATCATGCCAG TATCGCGGTA TGGTGTGGCG ATAATGAAGG TTATCCTTTA CCACCTTTGA ATAATTGGTT GAAAGAGGAC GTCAGCACAT TTGATGGGAA TGACCGTTTG TATCAGGCGA ATTCTCATGC TGATGGTCTG ACGGGTAGTG GTCCGTGGAC GAACTTTGCA CCCGCCTGGT ATTTTACCAG ATTTCCCGGT GGATTTGGCG GTACGCCCGG ATGGGGGCTA CGTACTGAGA TCGGTACGGC GGTGTTTCCT TCTTTTGAAA GCTTTAAACA GTTTATGCCG GACAGCAGCT GGTGGCCACG TAATAAAATG TGGGACCTGC ATTTCTTTGG TCCGTCGGCG GCTAATGCTG GTCCGGACAG ATATGACGAA GCGATCAACA AAGGTTACGG TACTGCCAGC GGTATTGAAG ACTATTGCCG GAAGGCGCAG CTGGTGAATA TTGAAGTCAA CAAGGCGATG TATGAGGGTT GGTTACATAA TATGTGGAAG GATGCATCGG GTATTATGAC CTGGATGAGT CAATCTGCTT ATCCGAGTAT GGTATGGCAG ACTTACGACT ATTACTATGA CCTGACAGGC GCTTACTGGG GCGTGAAAAA AGCCTGCGAG CCTTTGCACA TTCAATGGAG TGCGGCGGAT AATTCCGTGA AGGTGGTGAA TACTACTTTA CAGGATTATA GTAACCTGAA GGCGGAAGCG ATTGTATACA ATATGGATGG AACGATTGCT AAACAGATTG GTCAGACAGC GACTGTCCGT GCCGCTGCGA ATAATACAAC GCCTTGTTTT GATCTGAATT TTAATGCAGA CAATCTGGCA TTCAGGAAGA CAGTGGTAGC TTCCTCTTCT TCTCCGGAGA GTGCAGGCAC TGCTGCCGCA GCAGATGGCA GTGTAGGTTC CCGCTGGAGC AGTAATTACA ACGACAATGA ATGGATCTAT GTGGATCTGG GTGTTGCGCA GGAGATCAGT AATGTCGTAT TGATCTGGGA AGACGCACAT GCGGCAGCTT ATAATTTACA GGTCTCTGAT GATGCGCAGT CCTGGACAGA TGTCTATAAG ACAGAGACCA GTAAAGGCGG TACTGAAACC ATTGCTTTAC AGGCGGTGAA GGCGCGTTAT GTAAGAATGC TCGGACGTAA GCGGGCTTCA CAATGGGGGT ATTCTTTGTA TGAGTTGGAG GTATATGGGA AACGTAGTGC GACCCTTTCA GAAGTGCAAT TTATACGCTT GCGCCTGAGT GATGCCAAAG GTAGTCTGCA ATCTGATAAT TTCTATTGGA GAGGCAACCG GAATGGTGAT TATACGGCCT TGAATCAATT ACCGGCAGTA CAGCTGAAGG TGGGTTCGAA GGCGGTGCAG GTGAGTGATA GTACACGTAT TACAGCGACG GTGAGCAATC CTTCCAATGC TGCCGGACCT GCATTTGCGG TATGTGTGCA GGTAGTGAGA GCGGATAACA ATGAGCGGGT ATTACCGCTT GTGATGAGTG ACAACTATTT CACTTTGCTG AAAGGTGAAA GCAAACAGCT GGAGATCTCT TTTGAGAAGC GATTGCTGGA GAGTGGTAAG TACAAATTGA TCGTTACACC TTACAATCAT AAATAG
|
Protein sequence | MLRNKRWALS AVLGILCLAA NGQEKIMTLN SSNAAVSWKV KAAAELGQTT DIHATAYNDQ QWVKGIVPGT VFGSFVAAGL EKDPNYADNI YQVDKAKYDR DFWYRSTFKF SRRKAGEQQW LNFEGVNRKA EVFLNGHRLG LLDGFMDRGK FDVTNLLRYD QPNVLALLVS WPGTPIVNYS SPTYISSASW DWMPYVPGLN MGITDDVYIT GSGAITIQDP WVRTSAADTS LAKLSISMEL DNHSAQAQEG TISGTIQPGN IRFSKNVKLS AGQTEQVSFQ PEIAHPALWW PNGYGSQPLY TCDLQFTVKD SVSDSHNVTF GVRRFSYDTT GGVLHIYING QKIFIKGGNW GMSEYLLRCR GSEYDTKLKL HREMNFNMVR NWIGSTTDEE FYTACDRYGL LVWDDFWLNS HPNLPKDIFA FNRNAVEKIK RLRNHASIAV WCGDNEGYPL PPLNNWLKED VSTFDGNDRL YQANSHADGL TGSGPWTNFA PAWYFTRFPG GFGGTPGWGL RTEIGTAVFP SFESFKQFMP DSSWWPRNKM WDLHFFGPSA ANAGPDRYDE AINKGYGTAS GIEDYCRKAQ LVNIEVNKAM YEGWLHNMWK DASGIMTWMS QSAYPSMVWQ TYDYYYDLTG AYWGVKKACE PLHIQWSAAD NSVKVVNTTL QDYSNLKAEA IVYNMDGTIA KQIGQTATVR AAANNTTPCF DLNFNADNLA FRKTVVASSS SPESAGTAAA ADGSVGSRWS SNYNDNEWIY VDLGVAQEIS NVVLIWEDAH AAAYNLQVSD DAQSWTDVYK TETSKGGTET IALQAVKARY VRMLGRKRAS QWGYSLYELE VYGKRSATLS EVQFIRLRLS DAKGSLQSDN FYWRGNRNGD YTALNQLPAV QLKVGSKAVQ VSDSTRITAT VSNPSNAAGP AFAVCVQVVR ADNNERVLPL VMSDNYFTLL KGESKQLEIS FEKRLLESGK YKLIVTPYNH K
|
| |