Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3034 |
Symbol | |
ID | 7973754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 3192275 |
End bp | 3194500 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644793618 |
Product | TonB-dependent receptor |
Protein accession | YP_002944919 |
Protein GI | 239816009 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0831829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGCC AGGCGCAGGA GGCCGCGCCG CAGGCGGGCA CGCTGCCGGC CGTCGAGGTG GTGGCCACCA CGCCCGTGCC GGGCATCGAG GTGCCGAAGG ACCAGATCCC GTCCAACGTG CAGACCGCCG ACGACCGGCA CCTGCGCCGC GCGCAGAGCC TGAACCTGCC GGACTTCATG GCCACGCAGC TGCCCAGCGT GAACGTGAAC GAGATCCAGG GCAATCCGTT CCAGGTCGAC GTGAACTACC GCGGCTTCAG CGCGAGCCCG GTGCTCGGAA CGCCCCAGGG CCTGTCGGTG TACCAGGACG GCGTGCGCAT CAACGAGCCC TTCGGCGACG TGGTGAACTG GGACCTCATC CCGAAGGCCG CCATCTCGAG CATCACGCTG CTGCCGGGCT CCAACCCGCT GTTCGGCCTC AACACGCTCG GCGGTGCGCT GTCGCTGCAG ACCAAGCGCG GCGACACGCA TCCGGGCACC GAGCTGGAAC TGCAGGCCGG CTCCTTCGGC CGCGTGAGCA CCGAGCTCAC GCACGGCCGC AAGCTGGCCG AAGGCGGGCA CCTGTTCCTT GCGCTCGGCG GCCTCAACGA GGACGGCTGG CGCAACTACT CGCCTTCGCG CGTGCGCCAG CTGTTTGCGA AGGTGGGGCA GGACAGCGGA AAGCTCTCCT GGGACCTGAG CTTCACCCAT GGCGACAACC GGATGATCGG CAACGGCCTG CTGCCCGAGT CGATGCTGAT GCAGAACCGC AAGCAGGTGT ACACGCGGCC CGACCGCACC GAAAACCGCA TGTCGATGCT CACGCTCAAT GCCAGCTACC GCCTGAGCGA CGTGCAGACG ATCTCCATGA CGGCTTACAC GCGGCGCTCG CGCTACAGCA CGCTCAACGG CGACCTCAAC GACGGTTTCA ATCCGCCGGA CAACGAAGCC ACGGGCGTGG AGAACCGCAC CTACACGCGC CAGCGCAGCG AAGGCGTGGC GCTGCAGTCG ACCTATACCG CGGGCATTCA CCAGCTCACT TTCGGCGCCT CCGTGGACCG TGCGCGCACG CACTTCCGCC AGACCGAGGC CGAGGGCATG CTCGACTCCA CGCGCGCGGT GGTGCCGCAG GAAGAAGCCG AAGTCGATGC GCTGCTCGCG GGCAAGAGCC GCACCGCGAG CATCTATTTT TCCGACCTGG TGAGCCTGCA GCCGAACCTG CAGCTGAGCC TTTCGGGCCG CTACAACGAC ACTCGAGTGA GCACCCGCGA CGACGGGCGC GCCTTGCTCG GGCTGTCCAC CCGGCTCGAT GGCGAAGGCC ATTACAAGAA GTTCAATCCT GCCATCGGCC TCACCTGGCA GGCTACGCCG CGGCTCACGG CCTACGCGGG CTGGAGCCAG GGCAGCCGCG CGCCGAGCCC GATCGAGCTC GGCTGCTCCG ATCCGGCCAA CGCCTGCGTG CTGCCCAATG CGCTGCAGTC CGATCCGCCG CTCAAGCAGG TGGTGTCGCA GACCTTCGAA ACCGGGCTGC GCGGCACGCT CGAGCCAGGC ATGCGGTGGA ATGCCTCGGT GTTCCGCACC GTCAACAAGG ACGACCTGCT GTTCGTGAGC AGCGGGCTTT CGCGCGGCTA CTTCAGCAAC TTCGGGCGCA CCCTGCGCCA GGGCGTGGAG CTCGGGCTCT CGCAGCAGAC CGAACGCGTC GACTGGTCGC TGTCGTACAG CTACCTGCGC GCGAGGTACG ACTCGCCGGC CTGCCTGGTG GCCGAAGCCA ACAGCAGCGC CGAGACCAGC CCCGCCTGCA CCGGCGAGGG CGAGATCGCG GTGCGCCGCG GCGACCGCCT GCCGGGACTG CCGGCGCATT CGCTCAAGCT CAATGTCGAC TGGCGCGTGA CGCCCGAGTG GACGCTGGGC GCGCAGTACC GCGTGTATTC GAAGCAGACG GTGCGCGGCA ACGAGAACGG CCTGCACGCG CCCGACGGGG CCGACTTCAG CGGCAGCGGC CGCATCGGCG GCTACGCGCT GCTCGACCTC ACGACGCGCT GGAAGCTCGG GCCCAACGTG GAGCTTTTTG CCAAGGTGGC GAACGTGTTC AACCGGCGCT ACGCCACCGC CGGCCAGCTG GGCCGCAGCG GCTTCGATGC GAGCGGCGCG GTGCTGGCGC CTGATGCATG GCGCAACGTG CAGTTCGTGG CTCCCGGCGC GCCGCGCGCG GTGTGGATCG GCATGCGGGT GCAACTGGGC GTCTGA
|
Protein sequence | MPGQAQEAAP QAGTLPAVEV VATTPVPGIE VPKDQIPSNV QTADDRHLRR AQSLNLPDFM ATQLPSVNVN EIQGNPFQVD VNYRGFSASP VLGTPQGLSV YQDGVRINEP FGDVVNWDLI PKAAISSITL LPGSNPLFGL NTLGGALSLQ TKRGDTHPGT ELELQAGSFG RVSTELTHGR KLAEGGHLFL ALGGLNEDGW RNYSPSRVRQ LFAKVGQDSG KLSWDLSFTH GDNRMIGNGL LPESMLMQNR KQVYTRPDRT ENRMSMLTLN ASYRLSDVQT ISMTAYTRRS RYSTLNGDLN DGFNPPDNEA TGVENRTYTR QRSEGVALQS TYTAGIHQLT FGASVDRART HFRQTEAEGM LDSTRAVVPQ EEAEVDALLA GKSRTASIYF SDLVSLQPNL QLSLSGRYND TRVSTRDDGR ALLGLSTRLD GEGHYKKFNP AIGLTWQATP RLTAYAGWSQ GSRAPSPIEL GCSDPANACV LPNALQSDPP LKQVVSQTFE TGLRGTLEPG MRWNASVFRT VNKDDLLFVS SGLSRGYFSN FGRTLRQGVE LGLSQQTERV DWSLSYSYLR ARYDSPACLV AEANSSAETS PACTGEGEIA VRRGDRLPGL PAHSLKLNVD WRVTPEWTLG AQYRVYSKQT VRGNENGLHA PDGADFSGSG RIGGYALLDL TTRWKLGPNV ELFAKVANVF NRRYATAGQL GRSGFDASGA VLAPDAWRNV QFVAPGAPRA VWIGMRVQLG V
|
| |