Gene Veis_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3641 
Symbol 
ID4694577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4026097 
End bp4027056 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content64% 
IMG OID639851396 
Productextracellular solute-binding protein 
Protein accessionYP_998375 
Protein GI121610568 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0351893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAT CGGTTGTGAA ATGGCTGGTG CTGGCCGCAG CATGCGTGCC GGGTCTGGCC 
GCACAGGCGC AGGCGCAAAA CACCAAACTG GTGCTGGGCA TGTCCGGCTG GACGGGCTTT
GCGCCGCTGA CGCTGGCCGA CAAGGCGGGC CTTTTCAGCA AGCATGGCCT GGATGTGGAG
ATCAAGATGA TTGCGCAAAA GGACCGCCAT CTGGCCCTGG CCGCCAAGTC GATTCAGTGC
GCTGCGACCA CGGTCGAGAC CCATGTGGCC TGGAATGCCA ACGGCGTGCC CATCGTGCAG
ATTTTTCAGA CGGACAAGTC CTACGGCGCC GACGGCCTGG CGGTGCGCGG CGATATCAAG
GGCTTTGCCG ATCTGCGCGG CAAGACCATT GGCGTGGATG CGCCGGGCAC CGCGCCTTTC
TTTGGCCTGG CCTGGATGCT CAGCAAGAAC GGCATGACGC TCAAGGATGT CAAGCTCACC
ACGCTGTCGC CCCAGGCTGC GGCCCAGGCT TTCGTGACCG GGCAAGGCGA TGCGGCGATG
ACCTACGAGC CGTATCTTTC CACCGTGCGC GACAACCCGG CTGCGGGCAA GATTTTGGCC
ACCACGCTCG ACTATCCGAT GGTGATCGAC ACGGTCGGCT GCGACCCCGC CTGGCTCAAG
GCCAACCCCC GGGCCGCACA GGCGCTGGCC GATTCCTATT TTGCGGCGCT GGACATGATC
CGGGCCGATC CCGCCAAGTC CAACGACATC ATGGGCGCGG CGGTCAAGCA GACGGGCGCA
CAGTTTGCCC GGTCGGCGTC GTTTTTGCGC TGGCAGGATC GGGCCGCGAA CCAGCGGTTT
TTCGCCGGCG AGCTGACCGC GTTCATGAAG GACGCCACGG CCATCTTGCT GGCGACCGGC
ATCATCCACA AAGCGCCGGA TGATCTGGCC GCGCTGTTCG ACGCACGCTT CGTGCAATGA
 
Protein sequence
MSRSVVKWLV LAAACVPGLA AQAQAQNTKL VLGMSGWTGF APLTLADKAG LFSKHGLDVE 
IKMIAQKDRH LALAAKSIQC AATTVETHVA WNANGVPIVQ IFQTDKSYGA DGLAVRGDIK
GFADLRGKTI GVDAPGTAPF FGLAWMLSKN GMTLKDVKLT TLSPQAAAQA FVTGQGDAAM
TYEPYLSTVR DNPAAGKILA TTLDYPMVID TVGCDPAWLK ANPRAAQALA DSYFAALDMI
RADPAKSNDI MGAAVKQTGA QFARSASFLR WQDRAANQRF FAGELTAFMK DATAILLATG
IIHKAPDDLA ALFDARFVQ