Gene Veis_3171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3171 
Symbol 
ID4691645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp3533485 
End bp3534816 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content63% 
IMG OID639850933 
Productextracellular solute-binding protein 
Protein accessionYP_997919 
Protein GI121610112 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0465993 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTATC CCTTCCGTGC TGTCGCAGTT TGCCGACGGG CTTGGGTCAC CGCCCTGGGC 
TTGTCGCTGT GCGCTGGCGC CAGCGCGCAG ACCGAGTTGG TCATTGCCAC CGTGAACAAC
GGCCACATGA TCGAGATGCA AAAGCTCGGC AAGCACTTCG AGCAGGCCCA CCCTGACATC
CGGCTCAAGT GGGTCACGCT GGAGGAGGGT GTGCTGCGCC AGCGCGTGAC GACCGATATC
GCCACCAAGG GCGGCCAGTT CGATGTGATG ACCATTGGCA TGTATGAGAC GCCGATCTGG
GGCAAGAAGG GCTGGCTGCA GGCGTTGAAG ACCGACGCCG CCTACGATGC CGACGATCTG
TTGCCCGCGA TACGCCAGGG CCTGTCGGTC GACGGCAAGC TGTTCGCGGC CCCGTTCTAC
GGCGAAAGCT CGATGCTGAT GTACCGCAAG GACTTGGCCG ACAAGGTGGG GGTGCAGGTG
CCCGAGCGTC CGACCTGGCC GCAGATCAAG GATTTGGCGG CCAAGATCCA CGACCCCAAA
AACGGCGTGT ACGGCATCTG CCTGCGCGGC AAGCCGGGCT GGGGCGACAA CATGGCTTTT
CTGAGCACGC TGGTGAACAC CTTCGGCGGC CAATGGTTCG ACATGCAGTG GAAGCCGCAG
CTTCAGTCCA AGCCCTGGCA GGAGGCCATC CACTTTTATG TCGATCTGCT CAAGCACCAT
GGCCCGCCCG GCTCGTCGGC GAACAGTTTC AACGAGCTCC TGGCGCTGAC CAATTCCGGT
AAATGCGGCA TTTGGATCGA CGCCACCATT GCCGCCTCGT TCGTCAGCGA TGCCAGGCAG
TCGAAGGTGG CCGGGCAAAT GGCTTTTGCC CAGGCGCCGA CGATGCACAC GCCCAAGGGC
GCGAACTGGC TGTGGTCGTG GAATCTGGCG ATTCCGGCAG GTTCCCGGAA GGTGGACGCG
GCGCAGAAGT TCATCACCTG GTCGACCAGC AAGGACTATG TGCAACTGGT GGCCAAAACC
AATGGCTGGG CCAATGTGCC CACCGGCACG CGCCGGAGCA CCTATGCCAA TGCCGAGTTC
CAGAAGGCGG CCCGTTTTGC GGCCGCAGAA AAGATGGCCA TCGATTCGGC CAACCCCACG
GACGCGACGC TGCCCCAAAG TCCCTATATC GGCGTGCAGT TTGCCGCCAT TCCTGAATTC
CAGGCCATCG GCATCGCTGT GGGCCAGCAG ATGAGCGCGG CGCTGGCTGG CAAGAGCACG
GTCGAGGCGG CCCTGAAGGC CAGCCAGACC CTGGCCGAGC GTGAGATGAA GAAGGCGGGC
TACTACCGGT GA
 
Protein sequence
MPYPFRAVAV CRRAWVTALG LSLCAGASAQ TELVIATVNN GHMIEMQKLG KHFEQAHPDI 
RLKWVTLEEG VLRQRVTTDI ATKGGQFDVM TIGMYETPIW GKKGWLQALK TDAAYDADDL
LPAIRQGLSV DGKLFAAPFY GESSMLMYRK DLADKVGVQV PERPTWPQIK DLAAKIHDPK
NGVYGICLRG KPGWGDNMAF LSTLVNTFGG QWFDMQWKPQ LQSKPWQEAI HFYVDLLKHH
GPPGSSANSF NELLALTNSG KCGIWIDATI AASFVSDARQ SKVAGQMAFA QAPTMHTPKG
ANWLWSWNLA IPAGSRKVDA AQKFITWSTS KDYVQLVAKT NGWANVPTGT RRSTYANAEF
QKAARFAAAE KMAIDSANPT DATLPQSPYI GVQFAAIPEF QAIGIAVGQQ MSAALAGKST
VEAALKASQT LAEREMKKAG YYR