Gene Veis_4145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4145 
Symbol 
ID4690528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4560067 
End bp4562337 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content66% 
IMG OID639851892 
Producthypothetical protein 
Protein accessionYP_998868 
Protein GI121611061 
COG category 
COG ID 
TIGRFAM ID[TIGR02059] cyanobacterial long protein repeat 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.686732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCCA CCATCACCAT CGACAAGACC ATCGTCAGAG CCGGCGAGAC CGCCTTGGTC 
ACCTTTACTT TCTACAATAA GGAGGACATG CTCGTCGTTC GGGGGCAGAT TCATAACGTA
ACCGTCACCG GCGGCACGAT GAACGCCGGT TCCCTCTCAG GGATGAATCG GACTGCAACC
ACCAGATTCT TCACAGCCCA ATTCATACCG ACCCCGGGCC TGGAGAAATC CGACGCCAAA
ATCCGCTACG ACGGCACCGG CGATGTCGCC GATGGAGAGG TCACCTTCGC CGTCGACACC
CTGCGGCCCA CCGTTGCCAG TGCCTCGATC GCCAAAAGCG ACCTGCGCCT CGGGGAAAAA
ACCACCATCA CCATCACCTT CAGCGAACTG GTGACCCGCA GCAGCTTCAC CATCGATGAC
CTGCAGATAG ACGCCGGCAA GGGCACTTTG AGCAACCTGC GCGTCGCCCC CACCGACACC
ACGGCCACCA CCACTGCCGC CACCACCTGG CTGGTCGACC TGGAGGCCCC GACCACCCGG
CCCGCGACGG GCCTCGATGG CAACCAGATA CGGATCAACC TCGACGGCAT CACCGATGTC
CCGGGCAACG CAGGCGCGGG CCGGGGGGTG AGCGTCCCGG CCCGCTACAA CATCGACGAC
GGTGTGCCGC CCACGGTCAC CATCGCGCCG GCAACCACCA TCCTGCGGGC CGGCGAGACG
ATGAGCGTCA CCTTCACCTT CAGCGAGAAG GTCACCGGCT TTGGTACCGA GGACATCCAG
TACGACACCA GCAAAGGCAC GCTGGGCGCT CTGACGGCGG TCGGCACCGA CGGCAAGGTC
TGGAACGCCA CCTACACCCC CCAGCCCGGC ACCGAGAGCG CCAACAACAC CATCCGCGTG
AACCTCAGCG GCGTCCGGGA CGCGCAGGAC AACGCCGGCG TGGGCACCGG CACCAGCGGC
AACTTCGGCA TCGACACCGT GCGCCCCACG GTCAACGTGA CGATCAGCGA CGCCCGCCTG
ACCGCTGGCG AAAGCGCCAC CATCACCTTC ACCTTCAGCG AGCGCGTCAC CGGCTTCGCG
AAAAATGCCA TCGATCTGTC CCAGGCCAAC GGCACGCTCG GCGACCTGAC GCCGGTCGGC
ACCGACGGCA CAACCTGGAC CGCCACCTTC ACCCCCACGG CCAGACTGGC GCGCACCACC
AACAACCGGC TCACCCTGAA CCTGTACAAC GTGCGCGATG CCGCAGGCAA CGCCCCGGCG
GCGAACACCT ACGCGTTCAA CCAGTACACC GTAGACACCA TGGTCTTTGT GCTCAGCAAC
GCCACGGTGA ATCGCGAGCA GTTGGTGCTG AGCTACAGCG ACGAAACGAT GCTCGACGGG
AACGCGGACC GTGCCCCGAC CAACGAGTCC TTTACCGTGC TGGTCGATGG CACGCGCATC
GATGTCAGCC GGGTGACGGT GGATGCAGCG GCCAGGACGG TGACGCTGAC CCTGGCCAGC
GCCGTGACCA CCGGCCAGAC GGTGACCGTC GCCTACCAGG ACACCGACAC CAGCGATAAC
AAGGCGGTAC AGGAAGCCGG CACCGGCGAC GACGCGACCA GTTTTGCGGC CAGGGCGGTG
ACCAACCTCA CCCGGCCCCC GGTCGCACCC GCCACACCGG AGGCGCCGGA TGCGCCGGAC
TCCGACCGCG ACGGCCTGTC CAACAACCGG GAGGACCAGG CCCCCGGCCT GCTGCGCCCC
GACGGCTCGG CCGGTATGGC TGGTGATGGC AACGGCGATG GCGTCAAAGA CAGCCAGCAG
GCCGCCGTCG CTTCGACCCG CGACCAGACC CTGGTGGCCG GCAGCCAGAA CGGCAAATTG
ATCCCCGACA GCAACGCGCG CATCACCGAA CTGGTGCGCA GCGATGCCCC GGCCAACCTG
CCCAAGGGCA TGGAGATGCC GATCGGCCTG ACCTCATTCA AGGTATCGCT GGCCGAGGGC
CGCAGCACCG AGAGCTTCAG CCTGTACGTA GATCAGGCGC TCGGCGCCAA CGGCTACTGG
CTCAAGAACG GCGCCGGCAC CTGGGTGAAC CTGGCCAGCG AACCGTATGG TGGCAAGGTG
GCCAGCGAAG GCGGGCGCAT GCGGCTGGAC TTTCAGATCC AGGACGGCGG CCAGTACGAT
GCCGACGGAC TGGTCAACGG CAGCATCAGC GCGCCCGGCG CCGTGGCGAA GATGCCGCTG
TCCATCGTCG GGCAGTCGGC CCAGGTCGAT TCGCATGGCT TTTGGTACTG A
 
Protein sequence
MASTITIDKT IVRAGETALV TFTFYNKEDM LVVRGQIHNV TVTGGTMNAG SLSGMNRTAT 
TRFFTAQFIP TPGLEKSDAK IRYDGTGDVA DGEVTFAVDT LRPTVASASI AKSDLRLGEK
TTITITFSEL VTRSSFTIDD LQIDAGKGTL SNLRVAPTDT TATTTAATTW LVDLEAPTTR
PATGLDGNQI RINLDGITDV PGNAGAGRGV SVPARYNIDD GVPPTVTIAP ATTILRAGET
MSVTFTFSEK VTGFGTEDIQ YDTSKGTLGA LTAVGTDGKV WNATYTPQPG TESANNTIRV
NLSGVRDAQD NAGVGTGTSG NFGIDTVRPT VNVTISDARL TAGESATITF TFSERVTGFA
KNAIDLSQAN GTLGDLTPVG TDGTTWTATF TPTARLARTT NNRLTLNLYN VRDAAGNAPA
ANTYAFNQYT VDTMVFVLSN ATVNREQLVL SYSDETMLDG NADRAPTNES FTVLVDGTRI
DVSRVTVDAA ARTVTLTLAS AVTTGQTVTV AYQDTDTSDN KAVQEAGTGD DATSFAARAV
TNLTRPPVAP ATPEAPDAPD SDRDGLSNNR EDQAPGLLRP DGSAGMAGDG NGDGVKDSQQ
AAVASTRDQT LVAGSQNGKL IPDSNARITE LVRSDAPANL PKGMEMPIGL TSFKVSLAEG
RSTESFSLYV DQALGANGYW LKNGAGTWVN LASEPYGGKV ASEGGRMRLD FQIQDGGQYD
ADGLVNGSIS APGAVAKMPL SIVGQSAQVD SHGFWY