Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_0203 |
Symbol | |
ID | 7971412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | + |
Start bp | 213347 |
End bp | 216007 |
Gene Length | 2661 bp |
Protein Length | 886 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644790806 |
Product | pentapeptide repeat protein |
Protein accession | YP_002942132 |
Protein GI | 239813222 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.713128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCA TCAAGCCCCT GCGCCTGAGC ATCCTGACGC GCCCCTACCT GAGGAACGGC GCCCAGTATC TGGCGATGAC CGCCATCGCA ATGACAAACC TGAAGGGTCA GCAGCAGCAG CTTGTGCCGG AACCCGAGAT CTGGAAGACC CTGACCGATG AACTGGGCGG CGACACGGTG TTTGATCTCG GCATGCCCAA GGACAATGCC GAATTCATCG TATCGGGCTT TGCGCACACG GCGCACCAGC CGGACAAGTC TCGCTGTGCA GTGAGCGTGC AAGTCGGGGC GCGCAAGAAA ACGCTCCTGG TGTTCGGCGA CCGATTCTGG ATCGACGGCC GGGCGACCGC GCCCCAGCCA TTCGATGCGA TCCGCCTTGA CTGGCGCAAG GCCTTCGGGG GCCCGGGATT CGCAGAAAAC CCCCTGGGCA TCGGACATGA CGATGACATC GTCGAGGGGT TGAAAGTGCG GCGCGTGCCC AACATCGAAC ACGCGAACGG ACTGCTGTGG CGGCCGGGCC AGGTGCCTGC ACCCGCAGGC TTCGGCCCGA TCGATGTGGC AAGGCCTTCG CGGATGCGCA AGATGGGGCG CGACTACGAC GAGCATTGGA AGCAGAACCT CTATCCCGGT TTCGCGAGGG ACATGGACTG GGGCTACTTC AATGCAGCCT CCGAAGACCA GTGGCTGGCG CTGGGCGAAC GCGGCCTGGC CGGTGCCAGC TACGAGATCC TGAACATGCA TCCGGAGGAT CCGGTGCAAC GCGGCCGCCT GCCCGACTGG CAGGCGCGAG GGTTCATCGT CCGTGAAGCA AAGAACCGGC AACTGGCGTC GGGCATTTTT GACGAAGTCT CGCTGCGGCT CACGACGGCC TGGTTTTTTC CCCATCGAGA GCAGGTCGCG CTGGTGTACC ACGGCAGCAT CGGCATCGAG GAAGACGACG CATCCGACAT CAGCCACGTC ATGGCTGCCA TGGAGGAAGG TGGGTCGGAA CGGCCTTTGG CTTCGTACCG CGATGTATTG ATTCAGCGCT GCGATCCCGA GAACGGCGCA CTCTACGCAT TGCGCGACGA TCAATTGCTG CCGCCGAACT CGATTGGGCA GTGGCTCGAG AACAGCCAGG AAGAAGACAG GGACGCCGAC CCGCTCACCC GCAACATGAA AGTTCGCGCC GAACGTGCCC AAGCCAAACT GGCCGAAGCA TTCAAGGCGA GCGGCGGAGA TCCGGCCAAG TTCGCCCTGC CGCCCCCGCC GATCACCACC CCGCCCAAGC TGCATGAGAT CCCGGCCTAT GTGCAGAAGA TGAAAGCCAT GGTGCAGGAG CAGCGGGAGA AGCTGGCGAA GGGGCGGGAG GCCATCGGCA AGGCCGCCGA TGCCAATGCG GTCCAATCGA GAAAGCTTGG CTTCGATACC TCGACCTTTC TCGCGAAAGC AGAAGCGGCC AAGGGCAAGG GGCCTCCGCG CTTCGACCCC CGGCCTTTCG TAGACGGAAT CTCGGGTGTT TCTGCGGCGG TAGGCGCGCC GCCCGCGCCC CCGGAGAGCC TCGCGGAACT CAGGAGGGTG ACGGCCGAGG CAAAAAGAGG ATTGCTCGCG AGCTATCGCA GCATGGCGCA GTACCAGGAC GCGGCCGACG CCATGCCGCC CGACGAGTCC GACAGGGCGC GGGCCGAAAT CGAGCGCGTG CTCGCGGGTT CGCGCGATTT CTCGGACATG GATCTGACCG GTGCCGACCT GTCGAACATG GACCTTCACG GCACCCGATG GCATCGAGCA CTCCTCGAGG GCGTGGACTT CAGCAATAGC AGGCTCGACG ATGCCGACTT CAGCGAAGCG GTGCTGGCGA GAGCGCGCCT GCATCGCACG TCGATGCGCC GTGCGATTTT CGATCGTGCC AACATGGCAT TGGCACGATG CGAGGATGCG GATTTCTCGG GTGCGCGCTT CATGAAGATG GTGCTCGACA AGATGGTTGC CAAGGGATGC AATTTTTCTG ATGCGACCAT GGAGGAGCTG AATTTCATGA CCGTGCACCT GGAGAGGTGC AGCTTCGAAC GTGCCGCCAT CTCCTATGCC CATGTGCTGG AGAAGTCCGT GATGCATGGC GTGCGCTTCG ACCACGCACG GATTCACAAG ATGTCGTGGA TCGATTGCGA TGTCAACGAA CTGAGCTTCG CGCACGCCGA GCTCGATTCA TGCGGATGGG TCAACACCGA TGGCGCAGGC CGTCTCGATT TCTCGGCCGC ACACTTGACG ACCACCTGCT TCGTGGGAGA GAGCAGCTTG AGGAAGGTCT GCTTTCGAGG CGCCATGCTG CGCGACTGCA GCCTGCGTGG TATTGCATTG GACGAAGCGG ACTTTGCGAA TGCGCGGATC GAGAACAGCG ACTTCTCCGG GGCATCGATG CATCGCACGA ACCTGGAGGG TGCCGACGCC AAGGGTGCGA TGTTCGTGCG TTCGGACCTG ACCCGGGCGT CGCTGCTGGA TGCCGATCTC AGCGGCGCCA TCCTGCAGAA GGCGGTGCTC GTGTCCACGG ACCTGCGTCG TGCCAACCTG TTCAGGGCGG ACCTGTCGCA ATGTCTCATC GATGACAGAA CGCTTTTCGA CGGTGCCTAT GTCGAGCAGG TCAAGACAGT TCCCTTGCGC AAGAAAGAGA AGGTGGAATG A
|
Protein sequence | MKTIKPLRLS ILTRPYLRNG AQYLAMTAIA MTNLKGQQQQ LVPEPEIWKT LTDELGGDTV FDLGMPKDNA EFIVSGFAHT AHQPDKSRCA VSVQVGARKK TLLVFGDRFW IDGRATAPQP FDAIRLDWRK AFGGPGFAEN PLGIGHDDDI VEGLKVRRVP NIEHANGLLW RPGQVPAPAG FGPIDVARPS RMRKMGRDYD EHWKQNLYPG FARDMDWGYF NAASEDQWLA LGERGLAGAS YEILNMHPED PVQRGRLPDW QARGFIVREA KNRQLASGIF DEVSLRLTTA WFFPHREQVA LVYHGSIGIE EDDASDISHV MAAMEEGGSE RPLASYRDVL IQRCDPENGA LYALRDDQLL PPNSIGQWLE NSQEEDRDAD PLTRNMKVRA ERAQAKLAEA FKASGGDPAK FALPPPPITT PPKLHEIPAY VQKMKAMVQE QREKLAKGRE AIGKAADANA VQSRKLGFDT STFLAKAEAA KGKGPPRFDP RPFVDGISGV SAAVGAPPAP PESLAELRRV TAEAKRGLLA SYRSMAQYQD AADAMPPDES DRARAEIERV LAGSRDFSDM DLTGADLSNM DLHGTRWHRA LLEGVDFSNS RLDDADFSEA VLARARLHRT SMRRAIFDRA NMALARCEDA DFSGARFMKM VLDKMVAKGC NFSDATMEEL NFMTVHLERC SFERAAISYA HVLEKSVMHG VRFDHARIHK MSWIDCDVNE LSFAHAELDS CGWVNTDGAG RLDFSAAHLT TTCFVGESSL RKVCFRGAML RDCSLRGIAL DEADFANARI ENSDFSGASM HRTNLEGADA KGAMFVRSDL TRASLLDADL SGAILQKAVL VSTDLRRANL FRADLSQCLI DDRTLFDGAY VEQVKTVPLR KKEKVE
|
| |