Gene Veis_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_2058 
Symbol 
ID4691853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp2336437 
End bp2338344 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content60% 
IMG OID639849822 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_996826 
Protein GI121609019 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.704789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0381419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA GCAAGGCCAA GGAACGCGCG CTGATGAAGG AGTTCGGCCT GGACGAGACC 
GTTCTGTCCG AGGAAGACCT GGCCCGGCGC CGTTCGCGCC TGAAGACCCT GATCAAACTG
GGCAAGACGC GCGGCTACCT GACCCATGTC GAGATTTCCG ACCACTTGCC CGACAAACTG
GTCGATGCCG AAACGCTGGA AGCCGTCATC ACCACGCTGA ACGACCTGGG CGTGGCCGTC
TACGAGCAAA CGCCCGATGC CGAAGCGCTG ATCATTACCG ACAATGCCCC CACCGGCGCC
AGCGAAGAAG AGGCCGAAGA GGCCGCCGAA GCGGCCCTGT CTACCGTCGA CAGCGAGTTC
GGCCGCACCA CCGACCCGGT GCGCATGTAC ATGCGCGAGA TGGGCACCGT GGAACTGCTC
ACGCGCGAAG GCGAGATCGA AATCGCCAAA CGCATCGAAG GCGGCCTGAT GGCGATGATG
GAGGCGATCA GCGCGTCGCC AGCCACCATC GCCGAAATAC TGAACATGGG CGAGGAAATC
CGCGCAGGCA AGGTCGTGAT CTCGACCATC GTCGATGGCT TTTCCAACCC CAACGAGGCC
GACGACTACG TGGCCGAAGA AGACTTCGAC GAATTTGACG AAGCCGATGA CGACGACGGC
AAGGGCGGCT CCAAGGCGCT GACCAAAAAG CTCGAAGAAC TCAAAAAGCA GGCCCTGGAA
CGCTTTGACA AACTGCGCGA TCTGTTCGAG AAAATGCACA AGGTCTACGA CAAGGACGGC
TACGGCACGC TGGCCTACGT GCAGGCCCAG CAAGCCCTGT CGGCCGAGCT GATGACCATA
CGCTTTACCG CCAAGACCAT CGAAAAACTG TGCGACATGG TGCGCGGCCA GGTCGATGAT
GTGCGCCGGA AAGAACGCGA GCTGCGCCGC ATCATCGTGG ACAAATGCGG CATGCCGCAG
GAAACCTTCA TCAAGGATTT CCCGCCCAAC CTGCTGAACC TGCAATGGGT GGAAAAGCAG
GCGGCCATGG GCAAGCCCTG GTCTTCGATC ATTGCGCGCA ACATCCCGCC GATCCAGGAT
TTGCAGCAAA AGCTGATGGA CTTGCAGTCG CGCGTGGTGG TGCCGCTGAC CGAGCTCAAG
GTCATCAACA AGCGCATGAA TGAAGGCGAG GCCACCTCGC GCGATGCCAA AAAGGAAATG
ATCGAGGCCA ACCTGCGCCT GGTGATCTCG ATTGCCAAGA AGTACACCAA CCGTGGCCTG
CAATTCCTGG ACCTGATACA GGAGGGCAAC ATCGGCCTGA TGAAGGCCGT GGACAAATTC
GAATACCGCC GCGGCTACAA ATTCTCGACC TATGCCACCT GGTGGATCCG CCAGGCCATC
ACGCGCTCGA TCGCCGACCA GGCGCGCACC ATCCGCATCC CGGTGCACAT GATAGAGACC
ATCAACAAGA TGAACCGCAT CAGCCGCCAG CACTTGCAGG AGTTCGGCTT CGAGCCCGAT
GCCTCGCTGC TGGCCGCCAA AATGGAGATA CCCGAGGACA AGATCCGCAA GATCATGAAG
ATCGCCAAAG AACCGATCTC GATGGAAACC CCGATCGGGG ACGACGACGA CAGCCACCTG
GGCGATTTCA TCGAGGACGC GAGCAACACC GCCCCGATAG AAGCCGCGAT GCAGGCCGGC
CTGCGCGACG TGGTCAAGGA CATCCTCGAC GGCCTGACGC CGCGCGAAGC CAAGGTGCTG
CGGATGCGCT TCGGCATCGA GATGACCAGC GACCACACGC TGGAAGAAGT GGGCAAGCAA
TTTGACGTGA CGCGCGAGCG CATCCGCCAG ATAGAAGCCA AGGCGCTGCG CAAGCTCAAG
CACCCGAGCC GTTCGGACAA GTTGCGCAGC TTCATCGACT CGATATAG
 
Protein sequence
MKISKAKERA LMKEFGLDET VLSEEDLARR RSRLKTLIKL GKTRGYLTHV EISDHLPDKL 
VDAETLEAVI TTLNDLGVAV YEQTPDAEAL IITDNAPTGA SEEEAEEAAE AALSTVDSEF
GRTTDPVRMY MREMGTVELL TREGEIEIAK RIEGGLMAMM EAISASPATI AEILNMGEEI
RAGKVVISTI VDGFSNPNEA DDYVAEEDFD EFDEADDDDG KGGSKALTKK LEELKKQALE
RFDKLRDLFE KMHKVYDKDG YGTLAYVQAQ QALSAELMTI RFTAKTIEKL CDMVRGQVDD
VRRKERELRR IIVDKCGMPQ ETFIKDFPPN LLNLQWVEKQ AAMGKPWSSI IARNIPPIQD
LQQKLMDLQS RVVVPLTELK VINKRMNEGE ATSRDAKKEM IEANLRLVIS IAKKYTNRGL
QFLDLIQEGN IGLMKAVDKF EYRRGYKFST YATWWIRQAI TRSIADQART IRIPVHMIET
INKMNRISRQ HLQEFGFEPD ASLLAAKMEI PEDKIRKIMK IAKEPISMET PIGDDDDSHL
GDFIEDASNT APIEAAMQAG LRDVVKDILD GLTPREAKVL RMRFGIEMTS DHTLEEVGKQ
FDVTRERIRQ IEAKALRKLK HPSRSDKLRS FIDSI