Gene Veis_2953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_2953 
Symbol 
ID4690448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp3301777 
End bp3305160 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content54% 
IMG OID639850713 
ProductYD repeat-containing protein 
Protein accessionYP_997706 
Protein GI121609899 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000628016 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGA AAATCATCCG GCGTTTTTGG TTGCTCCTGC TCGCTGGCGC CGGGGCCGCC 
AGTGCGCAGG AAGTCATACC GGATTTCTAC AAAGAGCCCG GCCTCCAGTC GAATCGCGAT
TTCGTCAATC AGAGCCATCA CGAGAATATC GATCCATTCA CCGGATCTTT GCAGCGCCAC
TATGTGGATA TCCGCATTCC GGGCAATGGC GGGTTTGATC TTCAAATCGT TCGATCCTAC
AACAGCAGCA ATGTCGACCC CGCCAATGCA ACTGCGCCAA AGACAACCGC GGGCTGGGGA
TGGACGATCC ATTTTGGCCG CATCCTGAGA GCGCGTGAAA GCCTGCCTTG CGTCAACATG
AACCGGCTCA CCATTGCTGA CAATCCGGTT CTGGAGCTTC CGGACGGCAG TCGACAACTG
CTGACATTCA CCGGACAGGT CTCTCCGATG GCGCTGACGG CGCAGCGCTG GAAAGCAGAA
TGCCCTCCAG GAGGCAACGG TCTGATCATC ACCTCTCCCG ATGGCGTGAT TTATGAAATG
ACACAGGCCC TTGGTGCCAG TTCCAGAATC TCGCCTGTTT ATGAGTGGCA TACCAAAAAG
ATCACGGACC GCAATGGCAA CTATATTTCC ATTGATTATG CAACCACAGC CAGCCCCGAG
ATTCGCAGTA TCGCGACCAA CGATGGGCGC TCGGTATCAT TCAGCTATCA TGATTCGGGA
ACTGCATCCG CAAGAATTCG CAGCATCACC AGTGCCGGGC AAACCTGGCA ATACAATTAT
CAGTCCTCCG GAATTACCGG GGTTCATCAA CTGGCCAGTG TCACGAGACC GGACGGCCAT
ACATGGGCAT ATACCTATAA TGGCAACAAT GGTTTTGCAG CAGGCAGCTA TCTCATGCGG
TTGGCAAAGA CCCCCTACAA CGGAACCATC TACTACGAAT ATGACCACGT ATATTTTGAC
TCCCAATCAA ACCCTGGCAG CCGGTCCAAT GTCGTCCGCA GCAAATCATT GAGCACAGGC
GGAAGCTGGA CTTTCAGCTA TTCACCGGGT GGCGCAGGAT CGCTCGATAC CACCACCGTG
ACCAACGCCG CAGGAACGAC AACCTACAAG CATATCGGGC CAATGTATGC AGGCCCGGGC
AATGTCTGGA AGGTCGGATT GCTGGCAGAA AAAAGGATTG GAAATACACA GATAGAGCAG
TACCAATGGC AGGGACAGCA AATCTCATCG GAAACCTACG TTCGCCCGGG TGCATTTGTC
ACCAAGTTCG ACTACCTGCA AACCAACGCC CCGCTTTTGA CGCAGAAGAC GATCATCCGC
GATGGGGGAA CATACCGCAC CGATTACAGC AATTTTGACA GCCATGGCAA TCCACGCTCG
ATCGTGGAGT CAGGTCCGAA CCGGGGGAAC AGAACGACGA CCGTGAGCTA CTACATCAAC
ACCGCAAAGT GGATCATCCG GCAGGTACAG AACGAGAGCT CCTACGGAAA TTCAGTCACC
CGATCTTTCG ACAGCAGAGG CAATCTGATC ACGGAAACCA GGAATGGCAT CACCGTCAGC
CGCAGCCATG ACACGCAAGG AAATGTGAAC AGCATCACCT TCCCCAGGTC GTTGACCTAT
CGCTATTCAA ATCACATGCG TGGCATTCCA CAAAACGAAA GCCAGCCCGA GGGTATCTAC
GTCTATCGGC AAGTCAGCGA TGCAGGGAAT GTCGTCTCTG AAACCGATGG CAGTGGTCAC
ACGACCCGCT ACGACTACGA TGGCTTGAAT CGAATCACGT CAATCGATCT TCCCGTCGGG
AACGGGGTGA GCATCAGCTA CGGAACCAAT ACCCGCACCG TCACACGCGG ATCACTTGTC
GAGACCAGTA CCTACGATGG ATCAGGTTAC GTCACCCATG TGAATCACGG TGGAATTTCA
ACCTCATACA CGGTCGATGC GCTCGGACGA ATCACGTTCG TGTCTGACCC CAACAATGAC
TCCGGGGGTA CCTATTCTTA CGACGGTCTG CACCGGGTGA TCCAAATCCG GAATGCCGAT
GGCTCCGCCA AAACCATCAG CCATGACGCT GGCTCCAAAA CCATCCGGGA TGAACGCGGC
AACGCGACGA CCTACAACTA TCGGGCTTAT GGCGATCCCG ACAAGGTATT TTTGCTGGAA
GTGAACACGC CCGAGAGCAG CGCCAGTGTC CGCTACAGTC GCAACAGCAA GGATTTGATC
ACTTCTGCAG TACAAAATGG AGTGACCAGA AATTACACAT ACAACGATGC CGGCCATTTG
ATCTCTGCTT TCCATCCCGA GTCGGGAACT ACCAGCTATG GCCGTGACGC GGCAGGAAAC
ATGACATCAA AAAGCACGGG ATCTGCCGGA ACCATCCGGT ATGCATATGA CCAACAGAAC
CGGTTGCAAA ATATTGTTTA TCCCGATTCC ACACCTTCCG TTGCTCACTC ATATGATGGT
GCGCATCGGC TGTTGAGCAC CACCTCTTCG GATGCGGCAC GTTACTTTAC CTATGATGGC
AATGGCAACC TCAAGAGCGA GCAGCTTTCT TCGGACGGAA TATCCTGGAC TGCACGATAT
GCATACAACG GAAACGACCA ATTGACATCC ATCGTATATC CCCACTCTGG CGATACGGTC
TGGTATGGGG TCGACAACCT CGGGCGCCCT ACTTCGATCA CAGGGTATGT CGATGCCGTC
AGTTATTGGC CATCCGGACA GATTCGGAGA ATATCGTATG CGAACGGTAC CCAATCTGAA
TATGGACAAA ACTCCCGGTT GTGGCCCAGC TCATTCATCA CCTCGCACGC TGGCAGCTAC
TATGGACGCT CCATTTTCAG CTACGATGCG GTGGGCAACC TGACTGCGAT CAATGATTCC
ACAGATGCCC AGTACAACCG CACATTCGGC TTTGACAGCA TGGACCGACT GACCAGCGTC
ACAACTCCGC AGGGAACGGG GTATATATAT TACGACGGAG TCGGAAACAT CACGGCCCAG
AATCTCCCCG GAACACGACT GACCTACCAT TACGATGGCA ATAATCGATT GAGCGGTTTG
AGCGCATCGC CGGCGACACT GTCAACCGCC TATGCCTACG ATGCGCTGGG CAACATCACC
AGCGGCGGAG GACGGACGTT CACCTATGAT GCAGTGCCCA ATCTCAGATG CGTCAATTGT
GCAAACCCTG CGAACAGTGT GCAGTACATG TACGACGGAA ATCAGAAGCG GACGAGTGTC
ACTCAAGGCG GTATCAAGAC CCATGAGTTC CATGGGGTGC ATGGAAACCT GCTTGCCGAA
TACACCCCGG GATTCCCTGG AAAATTGATT CAGTACATCT ATCTGAACGG CAAGCGTATT
GCCCAGAAGG AGTCCGCACA ATGA
 
Protein sequence
MLKKIIRRFW LLLLAGAGAA SAQEVIPDFY KEPGLQSNRD FVNQSHHENI DPFTGSLQRH 
YVDIRIPGNG GFDLQIVRSY NSSNVDPANA TAPKTTAGWG WTIHFGRILR ARESLPCVNM
NRLTIADNPV LELPDGSRQL LTFTGQVSPM ALTAQRWKAE CPPGGNGLII TSPDGVIYEM
TQALGASSRI SPVYEWHTKK ITDRNGNYIS IDYATTASPE IRSIATNDGR SVSFSYHDSG
TASARIRSIT SAGQTWQYNY QSSGITGVHQ LASVTRPDGH TWAYTYNGNN GFAAGSYLMR
LAKTPYNGTI YYEYDHVYFD SQSNPGSRSN VVRSKSLSTG GSWTFSYSPG GAGSLDTTTV
TNAAGTTTYK HIGPMYAGPG NVWKVGLLAE KRIGNTQIEQ YQWQGQQISS ETYVRPGAFV
TKFDYLQTNA PLLTQKTIIR DGGTYRTDYS NFDSHGNPRS IVESGPNRGN RTTTVSYYIN
TAKWIIRQVQ NESSYGNSVT RSFDSRGNLI TETRNGITVS RSHDTQGNVN SITFPRSLTY
RYSNHMRGIP QNESQPEGIY VYRQVSDAGN VVSETDGSGH TTRYDYDGLN RITSIDLPVG
NGVSISYGTN TRTVTRGSLV ETSTYDGSGY VTHVNHGGIS TSYTVDALGR ITFVSDPNND
SGGTYSYDGL HRVIQIRNAD GSAKTISHDA GSKTIRDERG NATTYNYRAY GDPDKVFLLE
VNTPESSASV RYSRNSKDLI TSAVQNGVTR NYTYNDAGHL ISAFHPESGT TSYGRDAAGN
MTSKSTGSAG TIRYAYDQQN RLQNIVYPDS TPSVAHSYDG AHRLLSTTSS DAARYFTYDG
NGNLKSEQLS SDGISWTARY AYNGNDQLTS IVYPHSGDTV WYGVDNLGRP TSITGYVDAV
SYWPSGQIRR ISYANGTQSE YGQNSRLWPS SFITSHAGSY YGRSIFSYDA VGNLTAINDS
TDAQYNRTFG FDSMDRLTSV TTPQGTGYIY YDGVGNITAQ NLPGTRLTYH YDGNNRLSGL
SASPATLSTA YAYDALGNIT SGGGRTFTYD AVPNLRCVNC ANPANSVQYM YDGNQKRTSV
TQGGIKTHEF HGVHGNLLAE YTPGFPGKLI QYIYLNGKRI AQKESAQ