Gene Veis_3013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3013 
Symbol 
ID4693804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp3360938 
End bp3364150 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content58% 
IMG OID639850771 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_997764 
Protein GI121609957 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.36517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTT TATCTGAAGC CGCTATTGAG CTGGCACTGC TAGAGCAATT GCGGGGGCTG 
GGCTACCGCA TCGAGCAGGA AAAAGATATC GGCCCCGATG GTCCGCGTCC GGAACGCGAA
AGCCACGATG AGGTCGTGCT CAAAAAGCGC TTCGCGGATG CTGTGGCGCG GCTCAATCCT
GGCCTGCCGC TGGAGGCGCG GCAGGTTGCC ATGCGCCGGC TGATGCAGTC GGAGTTGCCA
ACGCTGCTCG AAGAAAACCG CCGCCTGCAC AAACTGATGA CCGATGGGGT GGAGGTCGAG
TACTACGCCG AGGATGGCAC GCTGACCACG GGCACGGTCG CGCTCATTGA CTTCAAGCGG
CCAGAGCAGA ACGATTGGCT GGCGGTCAGT CAATTCGTCG TGATTGCCGG GCAATATCAG
CGCCGCCCGG ATGTGGTGGT GTTTGTGAAT GGCCTGCCGC TGGTGGTGAT CGAGTTGAAG
GCGCTAGGCA ATGCGACTTT GCTCGCGGCC TTCCAGCAGT TGCAGAACTA TAAGCAGCAG
ATTGCGGCAT TGTTCAATAC CAATGCGCTG CTGGTGACTT CGGATGGTGT CCATGCCCGT
GTTGGTTCGC TGTCTGCCGA TCAGGAGCGC TTTATGCCTT GGCGCACCAC GGATGGCACG
GCGATTACGC CAAAGGGCAA ACCGGAGTTG CCAACCTTGA TCGAGGGCGT GTTTGCGCAT
CGCCGCCTGC TCGACCTGTT GCGCCACTTT ACGGTGTTTG ATGAAACGGG TTCAGGCTTG
GTCAAGATTA TTGCTGGCTA TCATCAGTTC CACGCCGTCA TGCGTGCGGT CGATTCAACC
TTGCGCGCAT CGAGCCAATG GCAGGGGGTG CAAGAAGACC CACGCGATTA CGGCTGGTCC
AGTGTCAAAA CCCAGGCCAA AGGTGACAGA AGGGCCGGCG TGATCTGGCA TACCCAAGGC
TCAGGCAAAA GCCTGTTGAT GGTGTTCTAC GCCAGGCAAC TGGCACAGCA TGTGGCCATG
GCGAATCCAA CCCTGGTCGT GCTGACCGAC CGCAACGACC TTGACGATCA ACTTTTTGCC
ACTTTCTCCA GGTGCCGCGA TTTCATTCGG CAAACACCGG TGCAGGCCAA CAGCCGGGAG
GATTTGCAAA AACTGTTGAT GCGCGCATCC GGCGGTGTGA TTTTTACCAC CTTGCAAAAG
TTTGCCGAGT CAAGTCAGCC GCTGACCGAG CGTCACAATG TGGTGGTGAT TGCTGATGAG
GCGCACCGCA GCCAATATGG TTTTCGCGCC AAACTCGATG CCAGGACGGG CGGGGTTTCC
TACGGCTTCG CCAAATACCT GCGCGATGCC CTGCCTCATG CCTCATTCAT CGGCTTTACC
GGCACGCCGA TCGAAGCCAG TGGTGTGAAC ACCCGGGCGG TGTTTGGCAA TTACATCGAT
ATCTATGACA TCAGCCGTGC CGTCGAGGAT GGCGCGACGG TGCTCATCTA TTATGAATCA
CGGCTTGCGC GCATCGAACT CGATGGGGAT GAGAGGCACA AAATCGATGC CGAAGTGGAT
GCGATGATCG AGGACGACTC CGAAGCCGAA CAAGAGCGCT TCAAGAAAGA ATGGTCAACC
GTGGAAGCCC TGGTGGGCAG CGATAAACGC CTCGCGCTGA TTGCGCAAGA CATGGTGACG
CACTTCGAAG CGCGCATGGC CGCACTGGAT GGCAAGGCAA TGGCGGTGTG CATGAGTCGC
CATGTCTGCG TGAGACTGTA CGAGGAAATC GTGAAACTGC GCCCCGGCTG GCACAGCGCC
GACGACAAGG CGGGCGCGCT CAAAATCGTG ATGACGGGTG CCGCCAGCGA CCCCGCAGAA
TGGCAACAGC ATGTCGGTAA CAAGGCGCGC CGCGATGTGC TGGCCAAGCG CGCTCGCGAC
ATCAGGGACC CGCTCAAACT GGTGATTGTG TGCGATATGT GGCTCACCGG TTTTGATGCG
CCCTGCATGC ACACGATGTA TGTGGACAAG CCCATGCAAG GGCATGGCCT GATGCAGGCG
ATTGCGCGGG TCAATCGCGT GTTCCGCGAC AAACCCGCCG GATTGATCGT GGACTATATC
GGCATCGCCC ATTATCTTAA ATCGGCGCTG CGACAATACT CGAAAAGCGA CCGGAAAAAT
ACCGGTATCG ATGAATCCAA GGCCGTCGCG ATGATGATGG AGAAGTACGA AATCGTGCGC
AACATGTACC ACGGTTTCGA TTACACGTTG GATGGAACAG CGCAAGAACG CCTGGTCATG
ATGGCCGGCG CCATCGAGTG GATTCTCGAC ATGCAGCAAA AATCGGCGGC CAGGGAGAGC
AGCCGGGAAG GCAAGAAGAC TGCGCATCGC CGCTATCAGG ATGCAGTGCT GGCGCTGTCC
AAGGCTTTTT CCCTGGTGTC TGCTTCTGAC GAAGCGCGCA CCATTCGGGA AGAAGTCGGC
TTTTTCCAGG CGATTCGCGA TGCACTGGTG AAAAGCAGTG GCGCAATGGG CGTGATGGGG
CAAGAACGCG AATTGGCCAT TCGGCAAATC GTGAGCCGCG CGGTGATTTC TACCGAAATC
GTCGACATTC TGGCCGCAGT CGGCATCAAG AGCCCGGATA TTTCCATCCT GTCCGACCAA
TTTCTGACCG AAATCGCGCA GATGGAGCGC AAGAATCTCG CGCTGGAGGC CTTGCGCAAG
TTGATCAATG ACGACATCCG TTCACGCAGC AAAGCCAACG TGGTGCAAAG CAAAGCCTTC
TCGGAACGGC TGGAAGAGGC CATGGCGCGC TACCACGCCA ATGCCATCAC GACCGCCGAA
GTGCTGCAAG AGTTGATCAC CCTGGCCAAG GACATTCGCG CCGAGCGCCA GCGCGGGGAA
GAGTCCGGTT TGTCGGATGA GGAGATTGCT TTCTATGACG CGTTAGCCGA AAACGATAGC
GCGCTGCAGA TGATGGGCGA TGACAAACTG CGGCTGATTG CCCACGAATT GCTGGTCAGC
CTGCGCAACA ACATATCGGT GGATTGGGCG CACCGCGAAT CCGCCCGCGC GCAAATGCGC
ATGCTGGTCA AGCGCATTCT GCGCAAATAT GGCTATCCGC CGGATTTGCA GGACGAGGCC
GTGCGAACGG TATTGCGACA GGCTGAGGTG TCGATCAGGA GCATGTCCGG CAATGCGCTC
GATGAAAATC TGAAGGGCTT CCGCCTGAAA TAA
 
Protein sequence
MAFLSEAAIE LALLEQLRGL GYRIEQEKDI GPDGPRPERE SHDEVVLKKR FADAVARLNP 
GLPLEARQVA MRRLMQSELP TLLEENRRLH KLMTDGVEVE YYAEDGTLTT GTVALIDFKR
PEQNDWLAVS QFVVIAGQYQ RRPDVVVFVN GLPLVVIELK ALGNATLLAA FQQLQNYKQQ
IAALFNTNAL LVTSDGVHAR VGSLSADQER FMPWRTTDGT AITPKGKPEL PTLIEGVFAH
RRLLDLLRHF TVFDETGSGL VKIIAGYHQF HAVMRAVDST LRASSQWQGV QEDPRDYGWS
SVKTQAKGDR RAGVIWHTQG SGKSLLMVFY ARQLAQHVAM ANPTLVVLTD RNDLDDQLFA
TFSRCRDFIR QTPVQANSRE DLQKLLMRAS GGVIFTTLQK FAESSQPLTE RHNVVVIADE
AHRSQYGFRA KLDARTGGVS YGFAKYLRDA LPHASFIGFT GTPIEASGVN TRAVFGNYID
IYDISRAVED GATVLIYYES RLARIELDGD ERHKIDAEVD AMIEDDSEAE QERFKKEWST
VEALVGSDKR LALIAQDMVT HFEARMAALD GKAMAVCMSR HVCVRLYEEI VKLRPGWHSA
DDKAGALKIV MTGAASDPAE WQQHVGNKAR RDVLAKRARD IRDPLKLVIV CDMWLTGFDA
PCMHTMYVDK PMQGHGLMQA IARVNRVFRD KPAGLIVDYI GIAHYLKSAL RQYSKSDRKN
TGIDESKAVA MMMEKYEIVR NMYHGFDYTL DGTAQERLVM MAGAIEWILD MQQKSAARES
SREGKKTAHR RYQDAVLALS KAFSLVSASD EARTIREEVG FFQAIRDALV KSSGAMGVMG
QERELAIRQI VSRAVISTEI VDILAAVGIK SPDISILSDQ FLTEIAQMER KNLALEALRK
LINDDIRSRS KANVVQSKAF SERLEEAMAR YHANAITTAE VLQELITLAK DIRAERQRGE
ESGLSDEEIA FYDALAENDS ALQMMGDDKL RLIAHELLVS LRNNISVDWA HRESARAQMR
MLVKRILRKY GYPPDLQDEA VRTVLRQAEV SIRSMSGNAL DENLKGFRLK