Gene Veis_4061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4061 
Symbol 
ID4693587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4454478 
End bp4455683 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID639851808 
ProductAraC family transcriptional regulator 
Protein accessionYP_998784 
Protein GI121610977 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.135508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.429541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAC GCATGCCCGT CAAGCCCGGC GACAGCCCGG ATGCCCGCGC CAAGGCCGGC 
AGCACCCATA TCGGCTTCAT CCTGCTGCAG GACTACTCGA TGATCGCCTT TGCCAATGCC
GTGGAAACCT TGCGCATGGC CAACTACCTG AGCCGGCGCC CGCTGTACCG CTGGAGCGTG
GTCAGCCCCG ATCAGACAGT GGCGCTGGCG AGCAACGGCC TGGCCGTCGC CGCCATCGGT
GACCTGGGCA GTCTGTCCGA ATGCCAGCAA CTGTTCGTCT GCGGCGGCGT CGACATCCGC
CACAACACCA GTGACACCGT GCGCCGGCTG CTGCGCCAAT GCGCCAGCTA CGGCGTGCCG
ATGGGCGGGC TGTGCACCGG CGCCTACGCA CTGGCCAGCG CCGGCCTGCT CGACGGCTAC
CGCTGCGCCA TCCATTGGGA AAACCTGGCC GCGATCCGCG AGGAATTCCC GAAAGTGCGG
TTTTCTTCCG AAGTCTTCGT CATCGACCGC GACCGCATCA CCTGCTCCGG CGGCACGGCG
CCGCTGCACC TGATGCTGCA CCTGGTGCGC GCGCAGCATG GCGCCCGGCT GATGATGGAC
ATCTCGGAAC AATTTCTGGT CGAGCGCCTG CGCTCCAGCG ACGACCGCCA GCGCATCCCC
CAACCCGCGT GCATAGGGCC AGGCTACCAG CACCTGACCG AGGCCGCCGC GCTGATGGCC
GCGCGCATCG AAGAGCCGCT GCCACTGGCC GAACTGGCCC GCGCCGTCGC GCTATCGCTG
CGCCAGCTCG AACGCCTGTT CCACCGCTAC TTCAGCATGA ACCCGGCGCA GTACTACATG
AACCTGCGCT TGCACCGCGC CCAGGAACTG CTCACGCACA GCAGCCTGCG CATCATGCAG
ATCACGGTGG CCTGCGGCTT TCAGTCCTCG TCGCATTTTT GCAAAGCCTA CCGCAGCCTG
TTCGGTCATT CGCCCAGCGA AGAAAGGCGG CGCCACATCG GCGGCGCGAA GCCACATGCC
GCCACATGGC GCGCCAGCCC GTCCGCCGCA CGCCGCGTCC GGCTGGCGCC GGCGAGCCTG
CCCGACTTCG GACCCCTGAC CCGTGCCGCC GCCTGTGATT CCATTTCTAC ATCGTTCGTG
AACGCGAAGG CGGAGCGCAG GCAGTACCAA TGCACGGCAA GCGACGCCGA CGAAGTGCAC
GGGTGA
 
Protein sequence
MPARMPVKPG DSPDARAKAG STHIGFILLQ DYSMIAFANA VETLRMANYL SRRPLYRWSV 
VSPDQTVALA SNGLAVAAIG DLGSLSECQQ LFVCGGVDIR HNTSDTVRRL LRQCASYGVP
MGGLCTGAYA LASAGLLDGY RCAIHWENLA AIREEFPKVR FSSEVFVIDR DRITCSGGTA
PLHLMLHLVR AQHGARLMMD ISEQFLVERL RSSDDRQRIP QPACIGPGYQ HLTEAAALMA
ARIEEPLPLA ELARAVALSL RQLERLFHRY FSMNPAQYYM NLRLHRAQEL LTHSSLRIMQ
ITVACGFQSS SHFCKAYRSL FGHSPSEERR RHIGGAKPHA ATWRASPSAA RRVRLAPASL
PDFGPLTRAA ACDSISTSFV NAKAERRQYQ CTASDADEVH G