Gene Veis_4092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4092 
Symbol 
ID4691762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4488376 
End bp4489935 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content69% 
IMG OID639851839 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_998815 
Protein GI121611008 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.146023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCGC TGGCGCCGCT GACGATTCCG GTGTTCCGCA TGTTGTGGCT GACCTGGGTG 
ACGGCCAATA CCTGCATGTG GATGAACGAT GTGGCGGCGG CCTGGCTGAT GACCACGCTG
ACCAGTTCGC CGATCCTGGT CGCGCTGGTG CAGTCGGCGT CCACGCTGCC GGTGTTTTTG
CTCGGTCTGC CCAGCGGCGC GCTGGCCGAC ATTCTGGACC GGCGGCGCTA TTTCATCGTC
ACCCAGTTCT GGGTTGCGGC CGTGGCGCTG GTGCTGTGCC TGGCCATCCT GGCCGGCGGC
ATGACGGCGC CACTGCTGCT GGCGCTGACT TTTGCCAACG GCATCGGTCT GGCGATGCGC
TGGCCGGTGT TCGCGGCCAT CGTGCCCGAA TTGGTCGCGC GCGCGCAGTT GCCTGCGGCG
CTGGCGCTCA ACGGCGTGGC CATGAATGCC TCGCGCATCA TGGGCCCGCT GCTGGCCGGG
GCCATCATTG CCAGCGCGGG CAGCGCCTGG GTGTTCGTGC TCAATGCCGT GCTGTCGGTG
CTGTCCGGCC TGGTCATCAT GCGCTGGAAG CGCGTCCATG TGCCCAACCC GCTGGGGCGC
GAGCGCCTGC CCAGCGCGAT GCGCGTGGGC CTGCAATTCG TGGGCCAGTC GCCGCGCATG
AGGGCGGTGA TGTGGCGCAT CTCGATCTTC TTTTTGCACG CCACCGCGCT GCTGGCGCTG
TTGCCGCTGC TGGCCCGGGG GCTGGAGGGC GGCGGCGCCG GCACCTTTAC GCTGCTGCTG
GCGTCAATGG GCGCGGGGGC GATCTCGGCG GCGCTGTTTT TGCCGCGCCT GCGCCAGGCC
ATGGCGGGCG ACACGCTGGT GATCCGCGGC ACCCTGCTGC AGGCGGCGGC CACCGGGGTG
ATGGCCATCG CGCCGAATGT CCAGGTGGCC GTGCCGGCGA TGTTCATCGG CGGCATGGCC
TGGATCACCA CGGCCAACTC GCTGAGCGTG TCGGCCCAAC TGGCGCTGCC CAACTGGGTG
CGCGCCCGGG GCATGTCGAT CTACCAGATG GCCATCATGG GCTCGACCGC GCTGGGCGCC
GCGCTGTGGG GCCAGGTGGC CACGCTCGGC AATGTGCACC TGAGCCTGGG GCTGTCGGCG
CTCTCCGGGG TGTTGGCGAT GCTGCTGGTG CAGCGCCTGG TGGCCGACCG CAGCATCGAA
GAAGACCTGA GCCCCTCGCG CGCCTTCAAG GCGCCGGTGC TCGACATCCC GCCGGAATCG
GGCCATGTGG TGGTGACCAT CGAATACTTC ATCGACCCGG CGCGCGCGGC GGAATTTCGC
GCGCTGATGC AAGACAGCCG GCGCAGCCGC CTGCGCCAGG GCGCATTGGC CTGGCAGCTA
CAGCACGATA TCACCGACCC CGCGCGCTAT GTCGAGCAGA TCGAGGATGA ATCCTGGACC
GAGCACCTGC GCCGCTTCGA CCGCGTCACC GCCCACGACG TGGCGCTGCG CGAGCGCAAA
CTGGCGTTCC ACACCCGGGA CACACCGCCC GTGGTCTCGC GCCTGCTGGT GCAGCGCTGA
 
Protein sequence
MTALAPLTIP VFRMLWLTWV TANTCMWMND VAAAWLMTTL TSSPILVALV QSASTLPVFL 
LGLPSGALAD ILDRRRYFIV TQFWVAAVAL VLCLAILAGG MTAPLLLALT FANGIGLAMR
WPVFAAIVPE LVARAQLPAA LALNGVAMNA SRIMGPLLAG AIIASAGSAW VFVLNAVLSV
LSGLVIMRWK RVHVPNPLGR ERLPSAMRVG LQFVGQSPRM RAVMWRISIF FLHATALLAL
LPLLARGLEG GGAGTFTLLL ASMGAGAISA ALFLPRLRQA MAGDTLVIRG TLLQAAATGV
MAIAPNVQVA VPAMFIGGMA WITTANSLSV SAQLALPNWV RARGMSIYQM AIMGSTALGA
ALWGQVATLG NVHLSLGLSA LSGVLAMLLV QRLVADRSIE EDLSPSRAFK APVLDIPPES
GHVVVTIEYF IDPARAAEFR ALMQDSRRSR LRQGALAWQL QHDITDPARY VEQIEDESWT
EHLRRFDRVT AHDVALRERK LAFHTRDTPP VVSRLLVQR