Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4271 |
Symbol | |
ID | 8360444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | + |
Start bp | 5331989 |
End bp | 5335285 |
Gene Length | 3297 bp |
Protein Length | 1098 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644966434 |
Product | hypothetical protein |
Protein accession | YP_003123922 |
Protein GI | 256423269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.274575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAACTA ATAAAAACCC TCAACACGTC AATACGACCA CGTCAAATGC TACATCTGCC ACCAGCTATT CAGGTATATC AATGCCTGCT GTTCTCCCTT ATCAGAAAAC GGCAATTGAG CAACACGAGA AAAATGAAAG CCCGCTTAAA ATGTCGGAAA ACCCAGATAT GGGTGCCATT GCACCATTTC AGCTGAAAAC AGATAGAATA GATACCACTT CGAGCCAGCA AAATCATCCT TCACAAGAAG CCTGGCATGT CGTACAGCTG AAGCAGAATA AAATAGGGAC TGTTCCTGTT GTACAGCTAA AAGAACCCTG GGATTCATTG GCAAAGGAAA AGTGGAACAG CACTATTGAG AATATAGAAA CATACGCCCA TCACATAGGA ATTTACGACA TAGAGGAATC CGATATCATT GATTACATCA GAAATATAGA CAACATAGAA CCTCCGAACG GACAGAGTTT TAAAGAGCTG GATTACGGGA GCTATTGCAA CGAGCTGGAT AATGTAGACG CTAGAAAAGC CGTCAAATAC GCGGTAGAAG TTCATCGGCA TCGTGAATGG CTGAATTTTG AAGTCGAAGA TTCTGAATCC AATGATGATG AGGAACGCTA TTATAGCAGC GAAGAAGATG AAACGCCGGG AAATGAAGTA GAATTTAATC TCATTATTGC TGTTATTATA GCATTGAACA AATTGCTCAG TGATTACATT TCTGATTTAA ATCCCGATAA AAAATTTGAT AAAATTGCAA TCAATGTAAT TGATAATATC CGAAAGCATA AGCAAACTTC GGCATATAAA AGAACAAAAA AATTCCCCAA TTCTTTTAAT AAAGTTACTA TTGGTGGTAT GGAAGATTGG CTAAGTTTTG CCAATGAATT CAGATACGCC AAAAGTGGCA GGAGTAAAGA GAAACCAACC GTTTATAACA TTGAGTTCGG ATTATGGAAT AAAGCTAAAC ATAAAGAAAT TCAGTTTCGC GGCCGGCAGC TGAAGGATGC GGAAGGTAAT CCAGGCAGAT CCTGGGCAGC GATAGCACTT TCTTTGCATA ACCTTGCTGC TGCATTGGGC GCTCATTTGC TCAAAGCGTC AGATAAGAAG ATGCAGAAGG GAAAGGGAAA CCAGATTCTT GCTGCTGCTA TGTTGGCCAT TCTGAGCGGT CGGGGCAATG AAGTATATGT CGTACTTCAG CAATATAAAG TAGATCATAA AGTCATTCAG AATTTTGAGT CGCTTGTGCT CGAGTTTTTC GCACTGACAA TGGGCATTGA GAACATAAGG CTTGAGTTCA CACAGCTGGA TACTATCTTT CACCTGAACA ATATAGCCCG TGGGGCGACA TATGGACAGT ATAACAAGAA ATATAACTTC CTCAATGTAT TTTCTTCCGT AAGGGAAACC CCGAATGGAG ATATGGTGCA CGTTTATGAC CGGGCACGGT ATCTATGGAA AAGGAAAAGG GATGACGATA AGTACAGCGA AATGAATGAC ATCGAAATTG CGGAGGAAAT AGAAGATGTA GAGAACGATT TAGAAGTAAC CAAAGAACTC ACTGAATATA GGGGACACAG AATGCGTTCC AAAAAACAAA GCCTGAGAAA GCCACGGGAA CGCGGATCTA TCAAGAGTAG TAAGTCAATC GGGAAAAGTA TCTATTCACA TAGTGCCGGA CACAAAAAAA CAACAAACCT GCTATTCGGT GTAGAGTCTG AAATTGTTAA GTTATGGGCG CAGGTGCGGC GGGTTGAAAA TGGGGAAGAT AGACGCGAAC TTCACCAGAG CAGGCTTTTC AGCATGGGGC CTATACGTTT GTATAATCTG TTCGCTAATC TGATATTAAG GAACAGGGAT AGGCTTACTG ATTTGTCCGC GGAACAGATT GCCGGATTCA TTATAGAGCT GTATATCAAT TTCGTGGAAA ACAGAATACC CAGCAGTGGA AATCCGTTTC ATGATGAGAT CGAAGTGATC AGTGGTTCTG ATTTTGGTGC GACAGTTGAA AAGATCCTAC AGGGAGCGGG TACGCTCAAA CCTGATAAAG CGATTGTCAG AATGTTGGTG GGGGCTTTGA ATGGCAATAA AGAGGCTTTA GGATTGATAA GTACTATAGA TAAAGTACAA CTTAAAATTG CTCTGATTTA TATACAGCGG AAAGCCGGAA GCGTTACTGA CAAGTTACTG CATATGGCGG TCAGTTCGAA CGATCCTGCA TTGGTTGCAG CGATGTTATC AATTGGAGTG AGTTTTGACG CTGAAGATAA TGATGGCAGG ACTCCTTTGG CACTTGCTCT TGAATTGGAT AAGAAAGATA GTAATAGTAA AAAGATAATT GATTTACTTG TTGGTGCTCA GGATAAGAAA AAGAAAAAGA AAAAAACACA ATTACCCAGG GGATGGAGTC ACTACGGCAG CTCCGATATT GGTGAAAATT GGTTAACCGA TGAAGAAGTG GATACCGGAC TTAGTGCCGC GAATCTTCCC AACACGCACG TAGCCCCCTC AGTAGATATC GCCAATAACC CTGATTTCTT AGCTGAATTT ATCCGGGATA ACTATCTTGA CCAGATGGGC CAGGGTGCTA TCGCAATGAA TACGGTGATT CCCATTAATT TTAATAATGC GCATTGGGCG ACATTGGTGA TCAGGCAAAA CCAGGATCGC AGATCCGCAC CTAAGGTATA TTTTTTTGAC TCAATGGGAG AAGATGAAGA TAAAATAGCA CTGATTAAGC TGATGCTTGA GATGACCGGT GTTTATACCA GCGTTGATAA TATTGTTGAC CTGTCTGAGC ACATGCAGCG AGACGGCTAC ACATGTGGCA CATGGATGAT TGAATCTGCG AGCAAGGTCG TGAACATTTT GGAAAACGGC GGAAGTGTAG CGGAGATAAA AGATGCACTG CATGTAATAA GCGCTGTCGT TAAAGAATTA CACAGACAAA ACTTACAATT GGCAAAACCT GGCCGTGAGG AAGTATCAGG GTCTGAAGAG GGTATGGAAT TGGAAAAGAA AGGGTATACC TATCATACTT TTGGAAATGG TATTCAGGAA TATAGTGAAC GTTGGATAGC CCATAGCTAT AATGAATCTT TCGGATTTCC TACTCAACTG TCGGAATTGT CACAGATGTT GATGGTAAAT TATGAGGATG TAGATAGGAA AATGCAGATA ATCTACCGGG TATATATTTG TAGAATATAT GGTGAGTTGA ATAAAACAAA AAAGCTGATG AGGGCGTATA ATGAAGAAAT AAAAGAAATC AGGAAATTTT TAAAGGAGAA GAAGTGA
|
Protein sequence | MLTNKNPQHV NTTTSNATSA TSYSGISMPA VLPYQKTAIE QHEKNESPLK MSENPDMGAI APFQLKTDRI DTTSSQQNHP SQEAWHVVQL KQNKIGTVPV VQLKEPWDSL AKEKWNSTIE NIETYAHHIG IYDIEESDII DYIRNIDNIE PPNGQSFKEL DYGSYCNELD NVDARKAVKY AVEVHRHREW LNFEVEDSES NDDEERYYSS EEDETPGNEV EFNLIIAVII ALNKLLSDYI SDLNPDKKFD KIAINVIDNI RKHKQTSAYK RTKKFPNSFN KVTIGGMEDW LSFANEFRYA KSGRSKEKPT VYNIEFGLWN KAKHKEIQFR GRQLKDAEGN PGRSWAAIAL SLHNLAAALG AHLLKASDKK MQKGKGNQIL AAAMLAILSG RGNEVYVVLQ QYKVDHKVIQ NFESLVLEFF ALTMGIENIR LEFTQLDTIF HLNNIARGAT YGQYNKKYNF LNVFSSVRET PNGDMVHVYD RARYLWKRKR DDDKYSEMND IEIAEEIEDV ENDLEVTKEL TEYRGHRMRS KKQSLRKPRE RGSIKSSKSI GKSIYSHSAG HKKTTNLLFG VESEIVKLWA QVRRVENGED RRELHQSRLF SMGPIRLYNL FANLILRNRD RLTDLSAEQI AGFIIELYIN FVENRIPSSG NPFHDEIEVI SGSDFGATVE KILQGAGTLK PDKAIVRMLV GALNGNKEAL GLISTIDKVQ LKIALIYIQR KAGSVTDKLL HMAVSSNDPA LVAAMLSIGV SFDAEDNDGR TPLALALELD KKDSNSKKII DLLVGAQDKK KKKKKTQLPR GWSHYGSSDI GENWLTDEEV DTGLSAANLP NTHVAPSVDI ANNPDFLAEF IRDNYLDQMG QGAIAMNTVI PINFNNAHWA TLVIRQNQDR RSAPKVYFFD SMGEDEDKIA LIKLMLEMTG VYTSVDNIVD LSEHMQRDGY TCGTWMIESA SKVVNILENG GSVAEIKDAL HVISAVVKEL HRQNLQLAKP GREEVSGSEE GMELEKKGYT YHTFGNGIQE YSERWIAHSY NESFGFPTQL SELSQMLMVN YEDVDRKMQI IYRVYICRIY GELNKTKKLM RAYNEEIKEI RKFLKEKK
|
| |