Gene Veis_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3969 
Symbol 
ID4694310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4354484 
End bp4355434 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content61% 
IMG OID639851718 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_998694 
Protein GI121610887 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.277601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTGA AAAAATTTGC TGCATTCGGG TGTGCGGCCT TGCTTGTTGC GTGGTCGAGC 
CCATCATTTG CCGAGCCGGA GCCGGCGAGT TGCCGCAATC TGCGTTTTGC CGACAACGGC
TGGACCGACA TCACGTCGGT GACCGCGCTG GCTTCGGTGG TCTTCGAGGC GCTCGGCTAC
AAGCCAAGCA CGACGATGGC GTCGGTGCCC ATCTCGTTTG CCGGTCTGAA GAACAAGCAG
CTCGATGTAT CGCTGGGCTA CTGGTGGCCG CTGCAGCAGT TGCAGGTTCA GCCGCTCCTC
GACAGCAAGT CGATCAACAT GATCGAACCG CCCAACCTGT CCGGCGCCAA GGCGACGCTT
GCGGTGCCAG GCTACGCTTG GGCAGCCGGC CTGAAGTCGT TCGACGATAT CGCCCGGTAC
CGCAAGGAGC TCGACGGCAA GATCTACGGG ATCGAGTCGG GCAGCAGTGC CAATGCGAAG
ATACAAAAGA TGATCGACCA GAACCTGCAC GGGCTTGGCG GCTTCAAGTT GGTCGAGTCC
AGCGAGGCCG GGATGCTGGT CACGCTCGAG CGTGCGATCC GCAACCAGAA GTGGCTCGTG
TTCTGGGGTT GGGAGCCGCA TCCGATGAAT ATCCAGTTCA GCATCAATTA CCTGTCGGGC
GGCGATGCGA CGTTCGGCCC CAACTACGGC GAGGCGCGCG TCTATACGCT GACCGCGACC
GATTTTCTTG AGCGCTGCCC CAACGCCGGC AAACTGGTCA CGCAGTTGCG CTTCTCGACG
CAGTTGGAGA ACCAGCTGAT GCAGGCGGTG ATGAACAAAA CCAGGCCGGC TGAGGCTGCG
CGTGCGTATC TGAAGCAAAA TCCCCAGGTG CTCGATCCGT GGCTTGCGGA CGTGAAGACC
TTCGATGGCA AGGATGGACT GGCGGCCGCG AAAGCGCAGC TCGGTCTGTG A
 
Protein sequence
MALKKFAAFG CAALLVAWSS PSFAEPEPAS CRNLRFADNG WTDITSVTAL ASVVFEALGY 
KPSTTMASVP ISFAGLKNKQ LDVSLGYWWP LQQLQVQPLL DSKSINMIEP PNLSGAKATL
AVPGYAWAAG LKSFDDIARY RKELDGKIYG IESGSSANAK IQKMIDQNLH GLGGFKLVES
SEAGMLVTLE RAIRNQKWLV FWGWEPHPMN IQFSINYLSG GDATFGPNYG EARVYTLTAT
DFLERCPNAG KLVTQLRFST QLENQLMQAV MNKTRPAEAA RAYLKQNPQV LDPWLADVKT
FDGKDGLAAA KAQLGL