Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpin_4814 |
Symbol | |
ID | 8360990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chitinophaga pinensis DSM 2588 |
Kingdom | Bacteria |
Replicon accession | NC_013132 |
Strand | - |
Start bp | 6007288 |
End bp | 6010365 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644966964 |
Product | TonB-dependent receptor |
Protein accession | YP_003124449 |
Protein GI | 256423796 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0976743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000000000101535 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGCACTCA ATTATATTCA ACGATATATC TCTGCACTTA GCTGTCTGCT ATGCCTGCTT TGCGGCAGCG TATACGGGCA GGAGTCTGCT GGTATTAGCG GCAGACTGGT AGATGCTTAT GGTCAGCCGC TGGCAAAAGC ACTTGTTTAC GTTAAGGGAA CAGGCGACAC CACATTGACT GATGCAGCAG GTAAATTTAC ATTAGCAGCC GGGCGTAACA GCGTTCTCGT AATGCGATAT CCCGGCTATA AACAGCTGCA ACATGTTGTG AAAGAAAATA CTGTCGTGTT GCGTATGGAG GAAGTTTTTA TTCCTAACCC GGAGCACATC CCTGTATTAT ACGGAGAAGC GGATCGTCGC AGCAGTGTGG GCGCTATAGG CACGGTATAT ACCAGCCAGT TGTCAGCTAC ACCGGCTACT TTATATGCCT ATGCATTACC TGGACGTCTT GCAGGATTGT ATACCCAGCA AACAAGCGGT TTCCGTAATC CGGGTACCGG GAATAACTTT GATGTGGATC AGTTTGTAGG AAACATCCCG AGATCGGGTG CACTGGAAGC GAATGACAAC TCGGAGATCA ATCTGTGGTT GCGCGGACAA ACGCCTGTCA CCATCGTAGA TGGTGTACAA CGGGATGTAT ATTCTATAGA TCCGGAGAAT ATTGAATCTA TCTCGGTACT GAAAGATGGA CTGTCTACTA TTTTATTGGG ACAGCGTAGT TCGCGGGGCG TATTACTGGT GACCACCAAA CGTGCCCGTG CCGGTAAACC GCGTCTGTCA TTTACCGCAC AGACAGCGAT ACAGCAGTCT TTAAGTATGC CGAAGGCATT ACCCGCCTGG CAGTATGCAT ACCTGTTAAA TGAAGCTTTG CAGAATGACG GCAAAGCGCC TTTATACAAG GAAGCAGACA TCCGTGCATT CCGTGATCAT TCAGATCCTT ATGGTCACCC TGATGTGAAC TGGTATGACC AGGTATTGAG AGATAATGCG CCTTTGTCAA GATATAATCT GAACATCAAC GGCGGTGGCG ATGTGGCCCG TTATTCCGTA TCACTGAACT ATACCGGTCA ACAGGGTATC TTCAAATCAT CACCAGAGAA TAGTTATACG ACGAATGCCG GTCTGAAAAG ATACCTGATC AACACAGACG TCAATATCGA CGTCACCAGT AACCTGAATG TCGGTATGCA GTTATTCGGC CGTTTACAGG AAGGTACACA ACCAGGAGCG GGTACCGGTA AAATACTGAA TGATCTGCTT TCTACTTCCA ATGCAGCCTA TCCTATCACC AATATCAACG GCAGCTGGGG TGGTACAGGT AACCTGACCA CCAATCTGCT GTCATCTACG GTCAACAGTG GTTATATCCA GGACAATAGC AAAGACGTGA TGGCGAATGT GGACCTCCGT TATGACCTGG GTGACTGGCT GCCTGGTTTG TCGCTGAAAG GCAAGGGCAA CCTGTCGATC CAGTCCGCGA ATGCAATCAA CCGTAGTAAG CAGGATCTTG TCTATAAAAT GAACATCACA CAGGGCGATA CTACTTATGC CAGGTTTGGA CAGGCGGTGA CCCAGCACAA TGACTTTATC TCTGTTTTCA ATGCACAGTA TTTCTTCGGA CAGGTCTCTC TCAATTATGA TCGTCAGTTT GGTGCGCATG GACTGAGTGC CATTGTACTG GCGGACAGAA GACAGACGAT CTTCAACTAT GACCTTCCCG GTAAAGCCAC GAACCTCTCA GCGAAGGCTA CCTACAACTA TGGCAACCGA TACTTTGCGG AGGCTGCCAT CAACAGAAGC GGTTATAACC GTTATATGCC CGGCAGACAA TACGGTACTT TCTATGCAGG CGGCATCGGC TGGGATATTG CCAGGGAAGC TTTTATGCAG GACCAGGCAG GTTGGCTGAA TCAGTTGAAG CTGCGGGCCA CCTATGCGCA GACAGGCAAT GGAATTGATA ATTCGGGCTA CTATATCTGG CGTCAGGATT TCAGTGAGAA CAATGGTATA GGCGGTGGTA TTTATGAACA AGGTACTGCC CGTTCACCAG GATCAGGTTT CCAGGAAAAC GGATTGGCGA ATACAAACAT TTCATGGGAG ACAGCCCGTA AGATAGACGT AGGTGTTGAC ATCTCCCTGT TCAATAACCG TTTGCAGGTA ACGGTAGATT ATTACCAGGA CCGTTACGCG AATCTTTTAC AGATAAGAGG TAAGAGTATT GTTTTATCCG GAGCTGCCTA TCCACCGGAG AACATCGGCA TCAATCTTTA CAAAGGAGGA GAGCTGACGG TGACCTGGCA GGCTGATATC AACGACTTTC ATTATTTCAT CACTGCGAAT GGCAGCATGG AGCAGAGCAG GGTGATTTTC ATGGATGAAC AACGCCGCGA TTATGAATGG AATAAACGAA CAGGACAGCC GGTAGGTATG CGTTTCGGAT ATATTGCAGA TGGCTTTTTG CAGACAGCCG AAGAAGCTGC CGCTGCGCCT GTCATTACCG GTTACCAGCC ATTACCCGGC GATATCAAAT ATAAAGACCT GAATGAAGAC GGTGCGATCA ATCAGTTTGA TGAAGCGCCT ATTGGTAAAT CCAGTCCACG TATTTACTAT GGCATCAGCG CCGGTATTTC CTGGAAGGGA CTGGAATTAA GTGCCTTGTT GCAGGGCGTT AGTAACCGCA CGAACTACGT AGCCAATTTT GCTACAGAGT TAGGGTTCCA GTTCCTCAAC TTCACTTACG GACAGGCTTA CGAACAGATC ACTGGCCGCT GGACACCTGA AACAGCCGGT ACTGCCACCT ATCCTCGTTT GTCGGCAGAT GCAAACTACA ATTACAATAA AGCCAGTTCT ACTTTCTGGG TGCGTAACGG CAATTACTTC CGTCTGAAGA ATGTGAGCAT TGCCTATAAC CTGCCTTATG TATGGGTGCA GCGACTGAAA CTGGGCGGGG TGAAAATCTT TGCCAATGGG TTAAACCTGT TCACACATGC GCCTTATGAT TTTGTAGATC CAGAGGTAGG AGTAGGCGCC TATCCGATTC AACGGGTATT GAATACAGGT CTGAATATTA AGTTCTAG
|
Protein sequence | MALNYIQRYI SALSCLLCLL CGSVYGQESA GISGRLVDAY GQPLAKALVY VKGTGDTTLT DAAGKFTLAA GRNSVLVMRY PGYKQLQHVV KENTVVLRME EVFIPNPEHI PVLYGEADRR SSVGAIGTVY TSQLSATPAT LYAYALPGRL AGLYTQQTSG FRNPGTGNNF DVDQFVGNIP RSGALEANDN SEINLWLRGQ TPVTIVDGVQ RDVYSIDPEN IESISVLKDG LSTILLGQRS SRGVLLVTTK RARAGKPRLS FTAQTAIQQS LSMPKALPAW QYAYLLNEAL QNDGKAPLYK EADIRAFRDH SDPYGHPDVN WYDQVLRDNA PLSRYNLNIN GGGDVARYSV SLNYTGQQGI FKSSPENSYT TNAGLKRYLI NTDVNIDVTS NLNVGMQLFG RLQEGTQPGA GTGKILNDLL STSNAAYPIT NINGSWGGTG NLTTNLLSST VNSGYIQDNS KDVMANVDLR YDLGDWLPGL SLKGKGNLSI QSANAINRSK QDLVYKMNIT QGDTTYARFG QAVTQHNDFI SVFNAQYFFG QVSLNYDRQF GAHGLSAIVL ADRRQTIFNY DLPGKATNLS AKATYNYGNR YFAEAAINRS GYNRYMPGRQ YGTFYAGGIG WDIAREAFMQ DQAGWLNQLK LRATYAQTGN GIDNSGYYIW RQDFSENNGI GGGIYEQGTA RSPGSGFQEN GLANTNISWE TARKIDVGVD ISLFNNRLQV TVDYYQDRYA NLLQIRGKSI VLSGAAYPPE NIGINLYKGG ELTVTWQADI NDFHYFITAN GSMEQSRVIF MDEQRRDYEW NKRTGQPVGM RFGYIADGFL QTAEEAAAAP VITGYQPLPG DIKYKDLNED GAINQFDEAP IGKSSPRIYY GISAGISWKG LELSALLQGV SNRTNYVANF ATELGFQFLN FTYGQAYEQI TGRWTPETAG TATYPRLSAD ANYNYNKASS TFWVRNGNYF RLKNVSIAYN LPYVWVQRLK LGGVKIFANG LNLFTHAPYD FVDPEVGVGA YPIQRVLNTG LNIKF
|
| |