Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_4871 |
Symbol | |
ID | 4694617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | - |
Start bp | 5384537 |
End bp | 5387416 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639852609 |
Product | putative transmembrane protein |
Protein accession | YP_999579 |
Protein GI | 121611772 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3170] Tfp pilus assembly protein FimV |
TIGRFAM ID | [TIGR03504] FimV C-terminal domain [TIGR03505] FimV N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.763378 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.552077 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGAAA AAGCACCGCT CGTCCGAGCC GTAACAAAAC GTAACAAATC CTCGGCAAAC ATGCGTCGTT TGAAATTCTC TGTCTTGGTG GCCGCAGTTT GCGCCGGTTT TTACTCCCTC GATGCCTCCG CCCTGGCGCT GGGCGCCATC ACCGTGCGAT CGGCCTTGGG CGAGCCATTG CGCGCCGAGA TCGACCTGCC GCAGATCACC CCGGTCGAAG CCGATGAGTT GCGCGCAACG ATCGCCAGCC CCGGAGTGTT CCGCGACCAA GGCATGGAAT ACACCTCGGC GGTGAACGAT CTGCAGATAC AGTTGCAGCG CCGGCCCGAT GGCACGGCCA TGCTGCGCCT GTCGGGTGAG CGCCCGGTGA ACGAGCCTTT CGTCGATCTG GTGCTCGACG CCAATTGGGG GTCCGGCCAC ATCGTGCGCA GCTACACCAT GCTGCTCGAC CCGCCCAACC TGCGCCGGGC TGCGCCTGCC GTCGCGGCGG CGCCCCAGGT TGCGGCCTCG GCGAAAGCGC CGGCCGACGG GGCCCCGTCC GACACAGCGG CTGGCGCTGC TGCGGCCACC CGCCCCGTCC CGCGCCAGGC GCCGGTGGGC GGTGCGGCCA TGGCACCGGC CGACGGCGTG ACCGCGCAGG TCGGCGATAC CGCCAGCCGC ATCGCCAGTG CCAACAAGCC CCAGGGCGTC TCGCTGGACC AGATGCTGGT CGCGCTGCTG CACGCCAATC CCGAGGCATT CGTCCAGAGC AACGTCAATC GCCTGAAGGC GGGCGCCATT TTGCGCATAC CCGACCAGGC CACGGCGCAG TCGATTCCGG CCGCCCAGGC ACGCCGGATG CTGGCCGCAC AAGCGCGCGA CTTCAGCGAG TTCCGGCGCA GGCTCGCCGG CGCAGTGCCG ATGGCCAAGG TCGCCGCCGC GCAGCGCTCG GCTTCGGGCA GCGTCAGCGC CCAGGTCGAA GACAACAGGC CTGCGACTGC GGCCCCGGAC AGGCTGACGC TGTCCAAAGG ATCGGTCAAG GGCTACAAGG CCGCTACCGC CGAGGAGCAA CTGGCCAAGG ACAAGCAGGC CAGCGAAACC GCCACGCGGA TGGCCGAACT GTCCAAGAAC ATCGCTGAAC TGAGCAATTT GGGCACAGGC AGCGCCAAAA GCAACGCTGC GGCGCAAACC ACAGCCCCCG CTGCAGCGCA GCCCGCAGCC CCTGCCGCAA CGGCGCAAAT CGCAGTCCCT GCGGCCGTGA CCATACCAAC CCCCGCAGCA ATACCCGCGC CCCCTGCAGC GCCGCCGGCC CCGGAGGCTT CGCCCGGCGC GGCAAGCGCC GACGCTCCCC CTGCGGAGGT CGCTGCGCAA GCTCCGGCAG CGTCTGCGCC GGCCCCTGCC GCCGCCAAAC CCAAGCCGGT GCCCCCGCCG ATGCCGGTCG AAGAGCCCGA TTTCCTGTCG ACCCTGATGG CCGAGCCGTT GCTGCCCATT GGCGGCGGCG TAGTGCTTGT CGGCCTGCTC GGTTTTGGCG CGTACCGCGT CTGGCAGCGG CGCCGCCAAA GCAGCGACGC GAGCAATGTC TCCCGCGAGG AAAGCCGTAT TCCGCCCGAC TCCTTTTTTG GTTCGAGCGG CGGCCAGCGT GTGGACACGA TCCATAGCGA TGCGGGCAAA GGCAGTTCGA TGATGGCCCC CTACTCGCCC AGTCAACTCG ACGCTGGCGG CGATGTAGAC CCCGTGGCCG AGGCCGAGGT GTATCTGGCC TATGGCCGCA ACCTGCAGGC CGAAGAGATT CTGAAAGAAG CCGTGCGTCA CAACCCCGGC CGCATCTCCG CCCAACTGAA GCTCGGTGAA ATCTATGCCA AGCGCAAGGA CCACAAGGCG CTGGAGGCGG TCGCCACGCA GGTCTTCAAG CTCACCCAGG GCGAGGGGCT CGACTGGTCC CGCATCGTCG AGCTCGGCCG CGACCTGGAT CCGGACAACC GTCTGTACCA ACCCGGCGGC CGTCAGGGCA TCGGTGACGC ACCTGCGTCC CCGGACGCCG GCGCAGGCCG CCCGACGGTA ATGTCCGACC TGGACCTCGA TCTGGACTTG ACCCTGCTCG ATGCGGCTGT GCCCGAGGCG CCAACGGCGC CGGCACCCCA AGCCCAAGCG CCAACGGCGC CGGCACCCCA AGCCCAAGCG CCAACGGCGC CGGCACCCCA AGCCCAAGCG CCAATGGCGT CGGTGCCCAA AGCCCAGGCG CCAACGGCGC CGGTGCCCAA AGTCCAAGCG CCAGCGGCGC CGGCGCCCAA AGGATTGGCG GCTGAGCCAG CCGTCGCGCC GGTCGTTGCC GCCGCTGTCC CGGCTACGGT CGGCAGCGAC CGCGGTCCAG CCAATACCAG TGCCAAGAAT GACAAGAATG CCGGTGTGCC CGAAACGCTG CACCCCGGGT TGGAAATGCC GGCTGCTTCG CCCGCCCGAT CACCTGCGCC GGCCCCGGCC CCCACCACTG CATCCTTGGT GGCACCACCG GTCGCCCCGC CGGCGCAGGA TGTGGTGTAC AACTCCCCGG GCACGACGCT GCGTGCTCCG ATGGACACGG TGCCCATGCC ACTGGCCAGG CCGGGCTTGC CGCGCGCAGA ACCGGGCCTG GCGGTAAGCA CCTCCCCGAT GGAATTCGAT ATGAGCGATT TGTCACTGGA TCTGGATGTG CCGAGCAAAC TGGCGGCGGC GCCAGAGTCC GCCGCCAATG GTCTGGCTGC GGCGGCAGAC GACCCGTTGG CCACCAAACT GGCATTGGCG CAAGAATTCG ATGCCATTGG CGATGCCGAT GGCGCCCGCA CGCTGATCGA AGAGGTGATG GCCGAGGCCA GTGGCGCGCT CAAAGCCCAG GCCCAGCGCA TGCTGGCCAA CTTGGGTTGA
|
Protein sequence | MREKAPLVRA VTKRNKSSAN MRRLKFSVLV AAVCAGFYSL DASALALGAI TVRSALGEPL RAEIDLPQIT PVEADELRAT IASPGVFRDQ GMEYTSAVND LQIQLQRRPD GTAMLRLSGE RPVNEPFVDL VLDANWGSGH IVRSYTMLLD PPNLRRAAPA VAAAPQVAAS AKAPADGAPS DTAAGAAAAT RPVPRQAPVG GAAMAPADGV TAQVGDTASR IASANKPQGV SLDQMLVALL HANPEAFVQS NVNRLKAGAI LRIPDQATAQ SIPAAQARRM LAAQARDFSE FRRRLAGAVP MAKVAAAQRS ASGSVSAQVE DNRPATAAPD RLTLSKGSVK GYKAATAEEQ LAKDKQASET ATRMAELSKN IAELSNLGTG SAKSNAAAQT TAPAAAQPAA PAATAQIAVP AAVTIPTPAA IPAPPAAPPA PEASPGAASA DAPPAEVAAQ APAASAPAPA AAKPKPVPPP MPVEEPDFLS TLMAEPLLPI GGGVVLVGLL GFGAYRVWQR RRQSSDASNV SREESRIPPD SFFGSSGGQR VDTIHSDAGK GSSMMAPYSP SQLDAGGDVD PVAEAEVYLA YGRNLQAEEI LKEAVRHNPG RISAQLKLGE IYAKRKDHKA LEAVATQVFK LTQGEGLDWS RIVELGRDLD PDNRLYQPGG RQGIGDAPAS PDAGAGRPTV MSDLDLDLDL TLLDAAVPEA PTAPAPQAQA PTAPAPQAQA PTAPAPQAQA PMASVPKAQA PTAPVPKVQA PAAPAPKGLA AEPAVAPVVA AAVPATVGSD RGPANTSAKN DKNAGVPETL HPGLEMPAAS PARSPAPAPA PTTASLVAPP VAPPAQDVVY NSPGTTLRAP MDTVPMPLAR PGLPRAEPGL AVSTSPMEFD MSDLSLDLDV PSKLAAAPES AANGLAAAAD DPLATKLALA QEFDAIGDAD GARTLIEEVM AEASGALKAQ AQRMLANLG
|
| |