Gene Veis_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_2002 
Symbol 
ID4691705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp2272262 
End bp2273845 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content63% 
IMG OID639849768 
Productextracellular solute-binding protein 
Protein accessionYP_996772 
Protein GI121608965 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.956925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCA TCTGCTTTTC CCGCACCCTG CGGACATTCG CGTTGTCGGC CGCGCTGGCG 
GCTCCGGTGC TCTGGGCGCA GGCCCAGACG CTCACTGTCG TGATGCAAGG CGGCTTGCGG
GTGATGGATC CGATCACCAG CACCGCGTTT TTGACGCGGG ACCACGGCTA CATGATCTAC
GACACGCTGC TCGGCACAGA CGCCCACTTC AAGATACAGC CGCAGATGGC CGACTGGAAG
GCGTCCGCAG ATGGCCTGCG CTACCGCTTC ACCCTGCGCA GCGGACTCAA ATGGCACGAT
GGGGCGCCCG TGACCAGCGC CGATTGCATC GCGTCGATCA AGCGCTGGGC CGAGGTCGAT
TCGAGCGGCC AGGTGCTGCT GCCGATGATC GACAGCATCG AGGCTGTCGA CGACAAGGTA
TTCGAGGTGG TGCTGAAAGA GCGCACCACG CTGTTGCTCG AGGGCCTGGC CAAGCTCAGT
TCGCGCCCGG CCTTCATGAT GCCCAAACGC ATCGCCGCCA CTGCCGCCGC CACGCCGTTG
ACCGAATACA TCGGTTCGGG CCCGTTCCGT CTGGTCCGGG CGGAATTCAA GCCCGGCCTG
AAGGTGGTGT ACGAGAAGAA CAAGGACTAT GTGGCGCGCA GCGAGCCGGC AAGCTGGACT
GCGGGCGCGA AGCTCGTGGG CGTCGAGCGG GTCGAATGGA TCGCCATGCC CGATGCGATG
ACCTCGATCA ATGCGCTGAA AAATGGCGAG GTGGACTTCA TCCAGCAGGT TCCCTATGAC
CTGGTGCCGA TGCTGGAGCA TCAGAAAAAC GTGACGGTGC AGGTGCTCGA CAAGCTCGGA
TCGTGGACTT ACTTCCGCTT CAATCATCTT CATCCGCCGT TCGACAACAA GCTCGTGCGC
CAGGCCGCCA TGGCTGCGGT GGGCCAGGAG GACGTGCTCA AGGCGCTCGT GGGCAACCCG
AAGTTCTACC GGACATGCGC CGCAGTGTTT GGTTGCGGCC ATCCGAACGG CAGCAGCTAC
GGCGCCGAAT GGGTGATCCC CTCGGACATC GACAAGGCCA AGGCCCTTTT GAAAGAGGGC
CGCTACGACG GCATGGCGAT CGTAGTGCTG CAACCGACGG ATGTTGCCAT CGTGGCGGCC
CAGCCGATCG TGATTGGCGC GGCCTTGCGC AAGGCGGGCT TCAAGGTCCA GATGAAGACC
ATGGACTGGC AAACCGTGGT GACGCAGCAG GGCAATCAGA AATCGCCGCA GGAGGGCGGC
TGGAACATCT TCGCCACCGC CGGCCTGTTG GCCACGAGCG GCGATCCAAT GACCAACACG
ACCGTAGGAT CGAACGGCAG GAAAGCCTGG GCCGGCTGGC CCGACGTTCC GGCGATCGAG
GTTTTGCGGC AGCGCTACGT TCGCTCCACC GACCTGGCCG AACGCCAATC CATTGCTGTG
GAACTCCAGA AACTGGTGAT CGACAACGGC GTGGTCGCGC CACTGGGCCA GTTTCTGATT
CCAGCGGCAT ACAGCACGGC GATCAGCGGC GTGCTGGAGT CTCCGGTGAC TGTGTTTTGG
AACATCAAGA AATCTGCCAA ATGA
 
Protein sequence
MKPICFSRTL RTFALSAALA APVLWAQAQT LTVVMQGGLR VMDPITSTAF LTRDHGYMIY 
DTLLGTDAHF KIQPQMADWK ASADGLRYRF TLRSGLKWHD GAPVTSADCI ASIKRWAEVD
SSGQVLLPMI DSIEAVDDKV FEVVLKERTT LLLEGLAKLS SRPAFMMPKR IAATAAATPL
TEYIGSGPFR LVRAEFKPGL KVVYEKNKDY VARSEPASWT AGAKLVGVER VEWIAMPDAM
TSINALKNGE VDFIQQVPYD LVPMLEHQKN VTVQVLDKLG SWTYFRFNHL HPPFDNKLVR
QAAMAAVGQE DVLKALVGNP KFYRTCAAVF GCGHPNGSSY GAEWVIPSDI DKAKALLKEG
RYDGMAIVVL QPTDVAIVAA QPIVIGAALR KAGFKVQMKT MDWQTVVTQQ GNQKSPQEGG
WNIFATAGLL ATSGDPMTNT TVGSNGRKAW AGWPDVPAIE VLRQRYVRST DLAERQSIAV
ELQKLVIDNG VVAPLGQFLI PAAYSTAISG VLESPVTVFW NIKKSAK