Gene Veis_1879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1879 
Symbol 
ID4693619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp2102163 
End bp2103524 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content64% 
IMG OID639849646 
Productextracellular solute-binding protein 
Protein accessionYP_996650 
Protein GI121608843 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.909678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114435 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAA CCACAGCCAC CACAGCCACC ACGCAGCGAC AACCGGCCGG CCTTCGTCGG 
CGCACGCTGA TCCGGCGCGG CAGTGCTGCG CTCGGCGCCG CGCTGACGCT GGGCGCGCCC
ACGGTGTGGG GGCAGGGCAA GAAGACGCTG CGCTTTCTCA ACAGCGAAAC CTCGATCGAC
AGCATCCGCG CGCTGAAAGT CGCCTGCGCC GACTACGAAA GGCAGTTCGG CACCCGGATC
GTGATCGATT CCGTTCCCAT CGACGACGCC TTCATCAAGG TCACGACCTC GCTGCGCGGC
GGACAGCCCT ATGACATCGC CACGCTGGCC TTTGTCGGCC ATGTGCTGCT GTTGCAGGCC
GACGACCGCC TGATGCCGCT GACCGAACTG ACCAGCAAGC ACCAGTGGGG GCACAAAATC
CTGTTCCCGC TCAAGAACCA GGTCTACTGG TACCCCTATG ACTACAACCT GGCGTGGATC
TATTACCGCA AGGACCTGTA CGAGCAAAAG GGCCTGAGCG TTCCCAAGTC TTACGAGCAG
ATGCTCAAGA ACGCGCAGAC GCTGAGCGCC GATGGCCGCC ATGGCGCGCT GTTTCCGATC
GGCAGCAACG GCGCGACGAA CTGGCTGTCG TCGGGCTTCA TGTGGGCCGA GGGCGTGAAG
CTGTTCGACG ACCAGTGGAA CGTGCTGCTG GACAGCGCCG AGATGGCGCC CCGGGTGGGG
CGCTATCTGG ACTTTTTCGC CGCGCTCTAC AAGAGCATGC CGGCGGGCGC GAGCCAGGCG
GGCTTCGGCG AAATGCTCAG CAATTTTTCC TCCGACAAGG TGGCGCATGT GGCCTACGCC
GGGCGCATCA TCGAGGCGCT CGAGCGCAAC GCGCCGGCGC TGTCGACCCG CTACGGCATC
ACCCCCTACA TGGACAGCAA GGGCCGGGCG AAGGCGGTCA ACCACGGCTA CGACGGCTGG
GTGGTGCTCA AGACGCCGAA CTCCGACGAG GCCATGAAAT TCATGGCCTG GTTCACCGAG
CACCAGTACA TCAACTTCCT GCACACCGCG CCGCTGCACT TTCAGCCGCC CCGCCTGGAC
ATCTACGACG ACCTGCGCTG GCGCGCCCAC CCGCTGATCG CCAAGCACCA GGAGGCGGTC
GAGACGATGC GCAGCTTCAT CACCGACAAG TCGGTCATCC TGACTTCCGT GGATACCGAA
GGCCCGGCGC CCGATCTGCG CCCCGGCAAG GTGCTCGAAG GCTTCGTGAT CCCCGAAATG
CTGCAAAACA AGGTGCTGAA AAACATGCCG TCGGCCGAAT GCGTGAAGCT CGCCGCCGAC
AAGATGCGCA AGCTCACCGG GGCTGGCGCA GGTGCTGCGT AG
 
Protein sequence
MTETTATTAT TQRQPAGLRR RTLIRRGSAA LGAALTLGAP TVWGQGKKTL RFLNSETSID 
SIRALKVACA DYERQFGTRI VIDSVPIDDA FIKVTTSLRG GQPYDIATLA FVGHVLLLQA
DDRLMPLTEL TSKHQWGHKI LFPLKNQVYW YPYDYNLAWI YYRKDLYEQK GLSVPKSYEQ
MLKNAQTLSA DGRHGALFPI GSNGATNWLS SGFMWAEGVK LFDDQWNVLL DSAEMAPRVG
RYLDFFAALY KSMPAGASQA GFGEMLSNFS SDKVAHVAYA GRIIEALERN APALSTRYGI
TPYMDSKGRA KAVNHGYDGW VVLKTPNSDE AMKFMAWFTE HQYINFLHTA PLHFQPPRLD
IYDDLRWRAH PLIAKHQEAV ETMRSFITDK SVILTSVDTE GPAPDLRPGK VLEGFVIPEM
LQNKVLKNMP SAECVKLAAD KMRKLTGAGA GAA