Gene Veis_4172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4172 
Symbol 
ID4694620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4589243 
End bp4592161 
Gene Length2919 bp 
Protein Length972 aa 
Translation table11 
GC content61% 
IMG OID639851919 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_998895 
Protein GI121611088 
COG category[L] Replication, recombination and repair 
COG ID[COG2189] Adenine specific DNA methylase Mod 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA GAAACCCCAA AGCCCCCCTC GTCGTCGACA CCCTCAAACA CGAGGGCGCC 
ACGCGCAGGA ACATCCCCAC GGCCGAGTAC CAGTCGGTCA TGGCCAAGGA CGAGCAAAGC
CCCAGGCCCG TGGCCTACCC GCGCGCCAAC ACGCAGTGGC TCAGCGACCT CGCGGCGCTG
CACGACCTGG GCAAATCGTC GGACGCATTC CAGCAGCGCC TGAACCGCGA CCTGGACCCC
CAACTGCTGT GGCGCGGCAA GGACCAGCAG GACTGGAGTG ACCTCGTCGT CAACGCCCCG
CCGCTCTACA TCCAGGAGAA GGTGCATCCC AAGGTGTTGA TTGACGACCT GCGGCGGGAG
ACGGAGCGTC AGCGCGAAGC CTCTACACGC GGCAGCGGCC AGCCGACCGA GCAGACCGAC
CTGTTCGCCG ACTTCAACGG CCTTCCCAGC GACAACGCGC GCACCGAGTT CTACCAGCAC
GATGGCCACT GGGCCAACCG CATGATCCTG GGCGACAGCC TGCAGGTAAT GGCCTCCCTC
GCCGAGCGCG AGGGCCTGCG CGGCAAGGTG CAATGCATCT ACTTCGACCC GCCCTACGGC
ATCAAATTCA ACAGCAACTT CCAGTGGAGC ACCACCAGCC GCGATGTGAA GGACGGCAAT
GCCCAGCACA TCACGCGCGA GCCGGAGCAG GTGAAGGCGT TCCGCGACAC CTGGCGCGAC
GGCATTCATT CTTACCTGAC CTATCTGCGC GACCGGCTGA CGGTCGCGCG GGATCTATTG
ACCGAGTCGG GGTCGATCTT TGTGCAGATT GGCGAAGAAA ACGTTCATCG AGTGCGTACC
GTTCTCGATG AAGTGTTTGG CGATGCGAAT TTCGTTAGTC AAATCAACTT CAAGACCACC
GGAGGTGCAG GATCTCCAAC AGGGGGAACC GAAACTCTGG CTTCGGTCAA TAACTTTATT
CTTTGGTATG CAAAGAACGG CGCGGCAATC AAGTATCGGC AGCCATATCG AGTCAAAGGC
GATCTATCTG GAGGGGCTAC GGCCTACAAC AAGCTGGATT TCTTCGACGC GAAAGAGCGT
CGTCCTGCCA CAGATGCTGA TAGAGAAACA GCACCTGACG GTTCACGCTT GTTTCGCTGG
GATAACCTGA CCAGCCAAAG TTCAGGTGGT CCACAGTTCT TTAATGTGGA ACTGGATGGG
AAAACTATTC CAGTTGGAAA AAGTGGTTGG AAAACAACCA CCACAGGCAT GGATCGCTTG
AAGAAAGCGA AGCGGGTTGG ATTGGCTGGA AAAACCCTAA GCTATGTCCG CTACATCCTG
GACTTTCCCG TTTACCCCAT GAACAACTCT TGGGATGACA CAGTGACGGC GGGTTTTGCA
TCAGACAAGT TATATGTCGT TCAAACAAAT CCGAAGGTCA TTGAGCGCTG CATCCTAATG
ACCACCGATC CCGGCGACCT CGTCCTAGAT CCCACCTGCG GTTCTGGCAC TACCGCCTAC
GCAGCCGAAC AATGGGGCCG CCGCTGGATC ACCATCGATA CTTCGCGCGT CGCACTCGCC
CTGGCCCGCG CCCGCATCAT GGGCGCGCGC TATCCCTACT ACCTGCTGGC CGACAGCCGT
GAAGGCCAGT TGAAGGAAGC CGAGGTCACG CGCAGCGCGC CCTCTACCGC GCCGGTACAT
GGCGACGTGC GCCATGGTTT CGTCTACGAG CGCGTGCCGC ACATCACGCT CAAGTCCATT
GCAAACAACG CGGAGATCGA CGTGATCTGG GAGCAGCACC AGCAGGTGCT GGAACCCTTG
CGCGCATCGC TGAACACCGC GCTCGGCAAG GCTTGGCAGG AATGGGAGAT CCCGCGCAAT
ACCGATGCCA AGTGGCCTGG GGAGGCCCAG CGCCTGCACG CCCAATGGTG GCAGCAGCGC
ATCGCGCGGC AGAAGGAGAT CGACGCTTCC ATTGCTGCCA AGGCCGAGTT CGAATACCTC
TACGACAAGC CCTACCCCGA CAAGAACAAG GTGCGCGTGG CCGGCCCCTT CACGGTGGAA
AGCCTCTCCC CCCACCGCGT GCTGGCCGTG GGCGCCGATG ACGAACTGAT CGACCCGGCC
AGCCCGCATG TGGCCGAGCG GCAGGCCGAA TACAACGCCG AGCGCAACTT CGTCCAGATC
ATCCTGGAGA ACCTCAAGAC CGCTGGCGTG CAGCAGGCGC ACAAACAGGA CAGGATCAGT
TTCACCTCGC TCACGCCCTG GCCGGGCGAA CTGGTCTGTG CCGAAGGTCG CTACGCCGAG
GGCGAGACTG AGAAACGCGC TGCCATCTTC ATCGGCCCCG AGTTCGGCAC CGTAGCCCGG
CCCGACCTGG TGGCCGCCGC GCGCGAGGCC GGCGACGCGG GCTTCGACGT GCTGATCGCC
TGCGCCTTCA ACTACGACGC GCATTCCAGC GAGTTCGACA AGCTCGGCAG AATCCCCGTG
CTCAAGGCCC GCATGAACGC CGACCTGCAC ATGGCCGACG ACCTGAAGAA CACCGGCAAG
GGCAACCTGT TCGTGATCTT CGGTGAGCCG GACATCGACA TCCTCGACGC CGGCCCTCAT
GGCGACGAGG GTCAGGTGCG GGTAAAGGTC AATGGCGTGG ACGTGTTCCA CCCCAACACC
GGCGAGGTGC GCAGCGACGG CGCCGAAGGC ATCGCCTGCT GGTTCATCGA CACCGACTAC
AACGAAGAAA GCTTCTTCGT CCGCCACGCC TACTTCCTGG GCGCGAACGA CCCCTACAAA
TCCCTCAAGA CCACGCTCAA GGCAGAAGTT GATGCTGACG CCTGGGCAAC GCTCAACAGC
GATACCTCAC GTCCCTTCGA CAAACCGAAG TCGGGACGCA TTGCCGTCAA GGTGATCAAT
CACCTGGGGG ATGAGGTGAT GAAGGTGTTC AAAATCTGA
 
Protein sequence
MATRNPKAPL VVDTLKHEGA TRRNIPTAEY QSVMAKDEQS PRPVAYPRAN TQWLSDLAAL 
HDLGKSSDAF QQRLNRDLDP QLLWRGKDQQ DWSDLVVNAP PLYIQEKVHP KVLIDDLRRE
TERQREASTR GSGQPTEQTD LFADFNGLPS DNARTEFYQH DGHWANRMIL GDSLQVMASL
AEREGLRGKV QCIYFDPPYG IKFNSNFQWS TTSRDVKDGN AQHITREPEQ VKAFRDTWRD
GIHSYLTYLR DRLTVARDLL TESGSIFVQI GEENVHRVRT VLDEVFGDAN FVSQINFKTT
GGAGSPTGGT ETLASVNNFI LWYAKNGAAI KYRQPYRVKG DLSGGATAYN KLDFFDAKER
RPATDADRET APDGSRLFRW DNLTSQSSGG PQFFNVELDG KTIPVGKSGW KTTTTGMDRL
KKAKRVGLAG KTLSYVRYIL DFPVYPMNNS WDDTVTAGFA SDKLYVVQTN PKVIERCILM
TTDPGDLVLD PTCGSGTTAY AAEQWGRRWI TIDTSRVALA LARARIMGAR YPYYLLADSR
EGQLKEAEVT RSAPSTAPVH GDVRHGFVYE RVPHITLKSI ANNAEIDVIW EQHQQVLEPL
RASLNTALGK AWQEWEIPRN TDAKWPGEAQ RLHAQWWQQR IARQKEIDAS IAAKAEFEYL
YDKPYPDKNK VRVAGPFTVE SLSPHRVLAV GADDELIDPA SPHVAERQAE YNAERNFVQI
ILENLKTAGV QQAHKQDRIS FTSLTPWPGE LVCAEGRYAE GETEKRAAIF IGPEFGTVAR
PDLVAAAREA GDAGFDVLIA CAFNYDAHSS EFDKLGRIPV LKARMNADLH MADDLKNTGK
GNLFVIFGEP DIDILDAGPH GDEGQVRVKV NGVDVFHPNT GEVRSDGAEG IACWFIDTDY
NEESFFVRHA YFLGANDPYK SLKTTLKAEV DADAWATLNS DTSRPFDKPK SGRIAVKVIN
HLGDEVMKVF KI