Gene Veis_4059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4059 
Symbol 
ID4694315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4452545 
End bp4453582 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content65% 
IMG OID639851806 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_998782 
Protein GI121610975 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0780892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.409572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAGCC CGAAAATCAG CGTCAAAAAT CTCTACAAGG TCTTTGGCAG CAACCCCTCG 
CAAGCCATCC GACTGCTCGA CGAAGGGCGC AGCAAAGACG AGATTTTTGC CCAGACCGGG
CAGGTGGTGG GCATCAACAA GGTCAGCTTC GACGTGCTGG CCGGCGAAAT TTATGTGCTG
ATGGGCTTGT CAGGCTCGGG CAAATCGACC CTGATCCGGC TGATCAACCG ACTGGTCGAA
CCGTCCTGCG GTTCGATCAA CATCGACGGG CTGGACATCG CCGCGCTGTC GCAGGCCGAA
CTGGTCAAGT GGCGCCGCAA ACGGGTGGCG ATGGTGTTCC AGTCGTTTGC GCTGATGCCG
CACCGCAATG TGCTGTCCAA CACCGCCCTC GGGCTGGAGA TGGCCGGCAC GCCGCGCCAG
CAGCGCGAAG CCCGCGCCAT GGAGGTGCTG GCCCAGGTCG GCCTGCAAAC CTACGCCGCC
AAATACCCGG CGCAACTGTC CGGCGGCATG CAGCAGCGCG TCGGGCTGGC CCGGGCTTTG
GCGGTGGACC CCGACATCCT GCTGATGGAC GAGGCCTTCT CGGCGCTCGA TCCGCTCAAG
CGGGTCGAAA TGCAAAGCCT GCTGCTCGAC TTGCAGCGCG AGCAGCAGCG CACCGTGCTG
TTCGTCTCGC ACGACCTGGA GGAGGCGCTG CGCATAGGCA ACCGCATCGC CATCATGGAA
GGCGGCAACC TGGTGCAGGA AGGCACGGCC CACCAGATCA TCACCGAGCC GGCCAACGCC
TACGTGCGCA AATTCTTCGA AGGCGTGGAC ACCTCGCGCT ATCTGACGGC GGCAGACCTG
CTCGACCCCC GGCTCAACGG CCACTCCTGG GACGGCGGTG CGCGCCTGTC CTGGTCGACG
CCGTTGCCCG AAGCGATGAA GATCGTGCTC GACAGGGACC AGCCGATCGG CGTCTTCGAT
GCCAGCGACC GCTTGCTCGG CTGCATCTCC GCGCGCAGCC TGCTCGACAG AATGTCCCGG
GAGGCACGCC ATGTCTGA
 
Protein sequence
MSSPKISVKN LYKVFGSNPS QAIRLLDEGR SKDEIFAQTG QVVGINKVSF DVLAGEIYVL 
MGLSGSGKST LIRLINRLVE PSCGSINIDG LDIAALSQAE LVKWRRKRVA MVFQSFALMP
HRNVLSNTAL GLEMAGTPRQ QREARAMEVL AQVGLQTYAA KYPAQLSGGM QQRVGLARAL
AVDPDILLMD EAFSALDPLK RVEMQSLLLD LQREQQRTVL FVSHDLEEAL RIGNRIAIME
GGNLVQEGTA HQIITEPANA YVRKFFEGVD TSRYLTAADL LDPRLNGHSW DGGARLSWST
PLPEAMKIVL DRDQPIGVFD ASDRLLGCIS ARSLLDRMSR EARHV