Gene Veis_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_2044 
Symbol 
ID4691498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp2320127 
End bp2321203 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content66% 
IMG OID639849808 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_996812 
Protein GI121609005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.707301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCGGC AAGGCCCCCG GCAATACCCC CCCCGAAAAA CCATCCCCCA ACCCAAGGAG 
CACGAGTTGA AAACCATCAT CCGACTGGCC GCGCTGGCGC TGGCGCTGTC CCTGTCCGCC
ACCGGGCCTG CGCTGGCCCA GCAGGCCAAG AAAACCTGGA AGGTCGGCGC TGCGGTGTAC
GGCCTGAAGG CCGAATTCGC GCAGCTATGG GTGAATGCGC TGAAAAAGCA CCCGCTGGTC
AAGGACGGCA CCGTCAAGCT CACGGTGTTC GACGGCAAGT ACGACGCGCT GACGCAGAAC
AACCAGTTCG AGACCATGAT CACGCAAAAG TACGACGGCA TCCTGTTCGT GCCGATCGAC
CTGCAGGCCG GCGCCGATGC GGTGTCCAAG GCGGCCGAGG CGAACATCCC GGTGGTCGGC
TCCAACGGCC GCGTCAACAG CGACAAACTG CTGTCGTATG TCGGCTCGAA CGACGTGATC
GCCGGCGCCA TGCAGGCGCA GGCGGTCGTC GATGCGATGG GCGGCAAGGG CAACGTGGTG
ATCCTCGAAG GCCCGATCGG GCAGTCGGGG CAGGTCGAGC GGCGCCAGGG CAACCTGAGC
GTGCTGGCCA AATACCCGAA CGTGAAGGTG CTGGAAATGA AAACCGCGAA CTGGTCGCGC
GCCGAGGCGC TGTCGCTGAC CGAGAACTGG CTCACCGCGC ATGCCGGCAA GATCAACGGC
ATCATCGGCC AAAACGACGA GATGGCGCTC GGCGCAATCG AGGCGGTCAA GGCCAAGGGG
CTGGACCCCA AGACCATTCC GACCGCCGGC ATCGACGGCG TCAGTGATGC AGTGCGCGCG
GTCAAGGCCG GCATCATGGC CAGCGTGCTG CAAGACGCCA GCGCGCAGTC CCAGGGGGCG
CTCGACGTGC TGCTGCGCAA GCTGATCGGC GCCAGCTACA AGCCGCGCTC GGCCATGTGG
GCGCAGTACG GCGCGGCCGG CCTGCAATGG GACGACGGCG CGGCCCGGGC CTACAACATC
CCGTGGACCC CGATCACGCT GCAAAACGCC GACGCGCTGC TGGCGCAACG CAAATGA
 
Protein sequence
MSRQGPRQYP PRKTIPQPKE HELKTIIRLA ALALALSLSA TGPALAQQAK KTWKVGAAVY 
GLKAEFAQLW VNALKKHPLV KDGTVKLTVF DGKYDALTQN NQFETMITQK YDGILFVPID
LQAGADAVSK AAEANIPVVG SNGRVNSDKL LSYVGSNDVI AGAMQAQAVV DAMGGKGNVV
ILEGPIGQSG QVERRQGNLS VLAKYPNVKV LEMKTANWSR AEALSLTENW LTAHAGKING
IIGQNDEMAL GAIEAVKAKG LDPKTIPTAG IDGVSDAVRA VKAGIMASVL QDASAQSQGA
LDVLLRKLIG ASYKPRSAMW AQYGAAGLQW DDGAARAYNI PWTPITLQNA DALLAQRK