Gene Veis_4871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4871 
Symbol 
ID4694617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp5384537 
End bp5387416 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content69% 
IMG OID639852609 
Productputative transmembrane protein 
Protein accessionYP_999579 
Protein GI121611772 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID[TIGR03504] FimV C-terminal domain
[TIGR03505] FimV N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.763378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.552077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAAA AAGCACCGCT CGTCCGAGCC GTAACAAAAC GTAACAAATC CTCGGCAAAC 
ATGCGTCGTT TGAAATTCTC TGTCTTGGTG GCCGCAGTTT GCGCCGGTTT TTACTCCCTC
GATGCCTCCG CCCTGGCGCT GGGCGCCATC ACCGTGCGAT CGGCCTTGGG CGAGCCATTG
CGCGCCGAGA TCGACCTGCC GCAGATCACC CCGGTCGAAG CCGATGAGTT GCGCGCAACG
ATCGCCAGCC CCGGAGTGTT CCGCGACCAA GGCATGGAAT ACACCTCGGC GGTGAACGAT
CTGCAGATAC AGTTGCAGCG CCGGCCCGAT GGCACGGCCA TGCTGCGCCT GTCGGGTGAG
CGCCCGGTGA ACGAGCCTTT CGTCGATCTG GTGCTCGACG CCAATTGGGG GTCCGGCCAC
ATCGTGCGCA GCTACACCAT GCTGCTCGAC CCGCCCAACC TGCGCCGGGC TGCGCCTGCC
GTCGCGGCGG CGCCCCAGGT TGCGGCCTCG GCGAAAGCGC CGGCCGACGG GGCCCCGTCC
GACACAGCGG CTGGCGCTGC TGCGGCCACC CGCCCCGTCC CGCGCCAGGC GCCGGTGGGC
GGTGCGGCCA TGGCACCGGC CGACGGCGTG ACCGCGCAGG TCGGCGATAC CGCCAGCCGC
ATCGCCAGTG CCAACAAGCC CCAGGGCGTC TCGCTGGACC AGATGCTGGT CGCGCTGCTG
CACGCCAATC CCGAGGCATT CGTCCAGAGC AACGTCAATC GCCTGAAGGC GGGCGCCATT
TTGCGCATAC CCGACCAGGC CACGGCGCAG TCGATTCCGG CCGCCCAGGC ACGCCGGATG
CTGGCCGCAC AAGCGCGCGA CTTCAGCGAG TTCCGGCGCA GGCTCGCCGG CGCAGTGCCG
ATGGCCAAGG TCGCCGCCGC GCAGCGCTCG GCTTCGGGCA GCGTCAGCGC CCAGGTCGAA
GACAACAGGC CTGCGACTGC GGCCCCGGAC AGGCTGACGC TGTCCAAAGG ATCGGTCAAG
GGCTACAAGG CCGCTACCGC CGAGGAGCAA CTGGCCAAGG ACAAGCAGGC CAGCGAAACC
GCCACGCGGA TGGCCGAACT GTCCAAGAAC ATCGCTGAAC TGAGCAATTT GGGCACAGGC
AGCGCCAAAA GCAACGCTGC GGCGCAAACC ACAGCCCCCG CTGCAGCGCA GCCCGCAGCC
CCTGCCGCAA CGGCGCAAAT CGCAGTCCCT GCGGCCGTGA CCATACCAAC CCCCGCAGCA
ATACCCGCGC CCCCTGCAGC GCCGCCGGCC CCGGAGGCTT CGCCCGGCGC GGCAAGCGCC
GACGCTCCCC CTGCGGAGGT CGCTGCGCAA GCTCCGGCAG CGTCTGCGCC GGCCCCTGCC
GCCGCCAAAC CCAAGCCGGT GCCCCCGCCG ATGCCGGTCG AAGAGCCCGA TTTCCTGTCG
ACCCTGATGG CCGAGCCGTT GCTGCCCATT GGCGGCGGCG TAGTGCTTGT CGGCCTGCTC
GGTTTTGGCG CGTACCGCGT CTGGCAGCGG CGCCGCCAAA GCAGCGACGC GAGCAATGTC
TCCCGCGAGG AAAGCCGTAT TCCGCCCGAC TCCTTTTTTG GTTCGAGCGG CGGCCAGCGT
GTGGACACGA TCCATAGCGA TGCGGGCAAA GGCAGTTCGA TGATGGCCCC CTACTCGCCC
AGTCAACTCG ACGCTGGCGG CGATGTAGAC CCCGTGGCCG AGGCCGAGGT GTATCTGGCC
TATGGCCGCA ACCTGCAGGC CGAAGAGATT CTGAAAGAAG CCGTGCGTCA CAACCCCGGC
CGCATCTCCG CCCAACTGAA GCTCGGTGAA ATCTATGCCA AGCGCAAGGA CCACAAGGCG
CTGGAGGCGG TCGCCACGCA GGTCTTCAAG CTCACCCAGG GCGAGGGGCT CGACTGGTCC
CGCATCGTCG AGCTCGGCCG CGACCTGGAT CCGGACAACC GTCTGTACCA ACCCGGCGGC
CGTCAGGGCA TCGGTGACGC ACCTGCGTCC CCGGACGCCG GCGCAGGCCG CCCGACGGTA
ATGTCCGACC TGGACCTCGA TCTGGACTTG ACCCTGCTCG ATGCGGCTGT GCCCGAGGCG
CCAACGGCGC CGGCACCCCA AGCCCAAGCG CCAACGGCGC CGGCACCCCA AGCCCAAGCG
CCAACGGCGC CGGCACCCCA AGCCCAAGCG CCAATGGCGT CGGTGCCCAA AGCCCAGGCG
CCAACGGCGC CGGTGCCCAA AGTCCAAGCG CCAGCGGCGC CGGCGCCCAA AGGATTGGCG
GCTGAGCCAG CCGTCGCGCC GGTCGTTGCC GCCGCTGTCC CGGCTACGGT CGGCAGCGAC
CGCGGTCCAG CCAATACCAG TGCCAAGAAT GACAAGAATG CCGGTGTGCC CGAAACGCTG
CACCCCGGGT TGGAAATGCC GGCTGCTTCG CCCGCCCGAT CACCTGCGCC GGCCCCGGCC
CCCACCACTG CATCCTTGGT GGCACCACCG GTCGCCCCGC CGGCGCAGGA TGTGGTGTAC
AACTCCCCGG GCACGACGCT GCGTGCTCCG ATGGACACGG TGCCCATGCC ACTGGCCAGG
CCGGGCTTGC CGCGCGCAGA ACCGGGCCTG GCGGTAAGCA CCTCCCCGAT GGAATTCGAT
ATGAGCGATT TGTCACTGGA TCTGGATGTG CCGAGCAAAC TGGCGGCGGC GCCAGAGTCC
GCCGCCAATG GTCTGGCTGC GGCGGCAGAC GACCCGTTGG CCACCAAACT GGCATTGGCG
CAAGAATTCG ATGCCATTGG CGATGCCGAT GGCGCCCGCA CGCTGATCGA AGAGGTGATG
GCCGAGGCCA GTGGCGCGCT CAAAGCCCAG GCCCAGCGCA TGCTGGCCAA CTTGGGTTGA
 
Protein sequence
MREKAPLVRA VTKRNKSSAN MRRLKFSVLV AAVCAGFYSL DASALALGAI TVRSALGEPL 
RAEIDLPQIT PVEADELRAT IASPGVFRDQ GMEYTSAVND LQIQLQRRPD GTAMLRLSGE
RPVNEPFVDL VLDANWGSGH IVRSYTMLLD PPNLRRAAPA VAAAPQVAAS AKAPADGAPS
DTAAGAAAAT RPVPRQAPVG GAAMAPADGV TAQVGDTASR IASANKPQGV SLDQMLVALL
HANPEAFVQS NVNRLKAGAI LRIPDQATAQ SIPAAQARRM LAAQARDFSE FRRRLAGAVP
MAKVAAAQRS ASGSVSAQVE DNRPATAAPD RLTLSKGSVK GYKAATAEEQ LAKDKQASET
ATRMAELSKN IAELSNLGTG SAKSNAAAQT TAPAAAQPAA PAATAQIAVP AAVTIPTPAA
IPAPPAAPPA PEASPGAASA DAPPAEVAAQ APAASAPAPA AAKPKPVPPP MPVEEPDFLS
TLMAEPLLPI GGGVVLVGLL GFGAYRVWQR RRQSSDASNV SREESRIPPD SFFGSSGGQR
VDTIHSDAGK GSSMMAPYSP SQLDAGGDVD PVAEAEVYLA YGRNLQAEEI LKEAVRHNPG
RISAQLKLGE IYAKRKDHKA LEAVATQVFK LTQGEGLDWS RIVELGRDLD PDNRLYQPGG
RQGIGDAPAS PDAGAGRPTV MSDLDLDLDL TLLDAAVPEA PTAPAPQAQA PTAPAPQAQA
PTAPAPQAQA PMASVPKAQA PTAPVPKVQA PAAPAPKGLA AEPAVAPVVA AAVPATVGSD
RGPANTSAKN DKNAGVPETL HPGLEMPAAS PARSPAPAPA PTTASLVAPP VAPPAQDVVY
NSPGTTLRAP MDTVPMPLAR PGLPRAEPGL AVSTSPMEFD MSDLSLDLDV PSKLAAAPES
AANGLAAAAD DPLATKLALA QEFDAIGDAD GARTLIEEVM AEASGALKAQ AQRMLANLG