Gene Veis_4048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_4048 
Symbol 
ID4695323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4437361 
End bp4440378 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content70% 
IMG OID639851795 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_998771 
Protein GI121610964 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.802614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCA CCAGGGTCAG GCAGCCGGCC GCCCGCATCG ACGGCTCGCG CCAGTTGCGC 
TTTAGCTTCA ACGGCCGCGA CTACACCGGC CACCCCGGCG ACACGCTGGC CTCGGCGCTG
CTCGCGCAAG GGGTGCGGTG CGTGGCGCGC AGCTTCAAGT ACGGGCGGCC GCGCGGCATC
ATCGGCGCCG GCGCAGAAGA GCCGAATGCG CTGGTGCAGC TCGGCGTGGG CGCGCTCACG
ACGCCGAACG TCAAAGCCAC GCAGGCCGAG CTGTATGAGG GCTTGGTCGC CCATTCCACG
TCGGGCTGGC CGGCGCTGGC TTTCGACCTG AAATCGCTGC TCGGCCGGGG CGCGCGCTCC
ATGATGCCGG CCGGGTTCTA CGGCAAGACC TTCAAATGGC CGCGCCGGCT GTGGCCGCTG
TACGAGGCGG TGCTGCGCCG GTGCGCCGGC TGGGGCGCGG CACCCGGGTT GCCCGACCCC
GAGCGCTACG ACCACTTGCA CCACCATGTG GATGTGCTGG TGGTCGGCGC CGGCGCCTGC
GGCCTGCTGG CCGCGTTGCA GGCCGGGCAG GCGGGCCTGA AGACCTTGCT GCTCGACGAG
CAAAACGAGT TGGGCGGCTG GCTGTTGTCC GACCCGCGGG CGCGCATCGA CGGCCGCGAC
GGCCCGGCCT ACATCCGCTC GGTGCAGTCC GCCTTGGCGG GCTTGCCGCA GGTGCGCGTG
CTGACCCGCA CCACCGCCTT CGGCATGTAC GAGCACAACC TGGTGCAAGC GGTCGAACTG
GTGCAAGACC ATATCGCCCC GGCCGAGCGC CAGGCCCATC TGCCGCGCCA GCGCCTGCAC
AAGATCCGCG CGCGCCAGGT GGTTCTGGCC ACCGGCGCCA TCGAGCGCCC GCTGGTCTTT
GGCAACAACG ACCTGCCGGG CGTGATGACA GTCTCTGCCG GGCAGACTTT TTTGCAGCGC
TACGGCGTGC GGGTCGGGCA GCGCGTGGTG ATTTGCGGCA CCAGCGATCT GATCCACGAT
TGCGCCGAAG ACCTGGCCCA GGCCGGCGCC CGCGTCGTCG TGGCCGATGT GCGCCATGGC
GTGACCGCCC GCAGCAGCGC CTACCAGGTG TTGGGCGGCC ACGGCATTGC CCGGGCCATG
GGCCGCGGCC ACGTCAAAAG CGCGCACCTG GTGCCGCTGC ACGCCACGCG CGAGGAGGCC
ACAAGCGCGG GCCGGCATGT GGCCTGCGAC GTGGTGCTCA GTTCTGGCGG GCTGTCGCCA
ACGGTGCATC TGTTTTGCCA TGACGGCAGC CGCCCGCTCT GGGACGACGC AGCGGCGGCC
TTCGTGGCCC CCGGCACCGG GCGCCCGGGC GTGGCCTGCG TCGGGGCGGT CACCGGCGCG
TTCGAATTGC CGGCGGCGCT GGCGCAGACC ACACAGGCCA TGCACCGGGT GCTGGCCGCC
TGCGGCCGGC AGCGGTCATT GCAGACACCG GTTTGCCCGC CCCCGCCCCA ACGACGGGCG
GCGCGGCCGA TGTTCCTGAT GCCCTCCTGC TGCGCGTCCG ATGGCAAGCG GGCCAAGCTC
CATGCCAAGG CTTTCGTCGA CTACCAAAAC GACGTGACCG CCGCCGACAT CGGGTTGGCC
GTGCGCGAGA ACTACCACAG CATCGAGCAT GTCAAGCGCT ACACCGCGCT GGGTTTCGGC
ACCGACCAGG GCAAGCTGTC GAACGTCAAC GGCGTGGTGC TGACCGCGCG CGCGCTGCAG
CGCCCGGTCG GCGAGGTCGG CACCACCACC TACCGCCCGG CCTACACCCC GGTGAGCCTG
GGCGCGCTGG CCGGGACGAT GGTGCAAGAC TGCTTCGACC CGAGCCGCTA CACCGCGCTG
CATGAGGCCC ATGTGGCGCG CGGCGCGGCG CTGGAGCCGG TGGGCCAGTG GCTGCGGCCC
TGGTATTTCG CCCGCGCCGG CGAGGACCTG CGCGCTGCCG TGAACCGCGA ATGCCTGGCG
GCGCGCCATG GGGTGGCGCT GATGGACGCC TCGACGCTGG GCAAGATACA GATCGACGGC
CCGGACGCGC GCGAATTCCT CAACCGCATC TACGCCAACG CCTGGAGCCA GTTGGCGGTG
GGCAAATGCC GCTACGGCCT GATGCTCGAT GAAAACGGCA TGGTGATGGA CGACGGCGTG
ACCGCCTGCA TCACGCCCCG GCAGTTCTAC ATGACCACCA CCACCGGCGG CGCCGCCCGG
GTGCTGAACT GGCTCGAACG CTGGCACCAG ACCGAGTGGC CCGAGCTGAA GGTCTGGATG
ACCTCGGTGA CCGACCACTG GACGACCATC GCCCTGGTCG GCCCCAAGGC CCGGACGGTG
CTGGCCAGGC TGTGCCCGGA CATCGACCTG CGCGCCGACA GCTTCCAATT CATGGACTGG
CGCGCCGGCA CGGTGCATGG CCTGCCGGCC CGGGTGTTTC GCATCAGCTT TTCCGGCGAA
CTGGCGTATG AGCTGAATGT CGAATCCGGC TACGGCCACG CGCTGTGGGA AGCCGTGATG
GCCGCCGGCG CCGAGTTCGA CATCACGCCC TACGGCACCG AGACCATGCA TGTGCTGCGC
GCGGAAAAGG GTTTCATCAT CGTCGGCCAG GACACCGACG GCTCGATCAG CCCGCTGGAC
CTGGGGATGG GCTGGGCCGT GGGCATGAAA AAAACGTACA GCTTCCTGGG CAAACGCTCG
CTGGCGCGCA GCGACACCGC GCGCGACGAC CGCAAGCAGT GGGTCGGCCT GCTGACGCAA
GACCCGTCGG TGGTGCTGCC CGAAGGCGCG CAAATCATGG ACAGCGCCCG CACCGGCGCG
CACAACCGGA TGCTCGGCCA TGTGACTTCC AGCTACCACA GCGCCTTCCT GGGCCGCTCC
ATCGCGCTGG CCGTGGTCGC TGCGGGCCGG CAGCGGATCG GCCAGACCCT GTATGCGCAC
GCCCATGGAC GCGCCACCGC CGCGCAGGTG GTCGGCAGCG TGTTCGTCGA CCCGAAGGGG
GAGCGACAAA ATGTCTGA
 
Protein sequence
MSGTRVRQPA ARIDGSRQLR FSFNGRDYTG HPGDTLASAL LAQGVRCVAR SFKYGRPRGI 
IGAGAEEPNA LVQLGVGALT TPNVKATQAE LYEGLVAHST SGWPALAFDL KSLLGRGARS
MMPAGFYGKT FKWPRRLWPL YEAVLRRCAG WGAAPGLPDP ERYDHLHHHV DVLVVGAGAC
GLLAALQAGQ AGLKTLLLDE QNELGGWLLS DPRARIDGRD GPAYIRSVQS ALAGLPQVRV
LTRTTAFGMY EHNLVQAVEL VQDHIAPAER QAHLPRQRLH KIRARQVVLA TGAIERPLVF
GNNDLPGVMT VSAGQTFLQR YGVRVGQRVV ICGTSDLIHD CAEDLAQAGA RVVVADVRHG
VTARSSAYQV LGGHGIARAM GRGHVKSAHL VPLHATREEA TSAGRHVACD VVLSSGGLSP
TVHLFCHDGS RPLWDDAAAA FVAPGTGRPG VACVGAVTGA FELPAALAQT TQAMHRVLAA
CGRQRSLQTP VCPPPPQRRA ARPMFLMPSC CASDGKRAKL HAKAFVDYQN DVTAADIGLA
VRENYHSIEH VKRYTALGFG TDQGKLSNVN GVVLTARALQ RPVGEVGTTT YRPAYTPVSL
GALAGTMVQD CFDPSRYTAL HEAHVARGAA LEPVGQWLRP WYFARAGEDL RAAVNRECLA
ARHGVALMDA STLGKIQIDG PDAREFLNRI YANAWSQLAV GKCRYGLMLD ENGMVMDDGV
TACITPRQFY MTTTTGGAAR VLNWLERWHQ TEWPELKVWM TSVTDHWTTI ALVGPKARTV
LARLCPDIDL RADSFQFMDW RAGTVHGLPA RVFRISFSGE LAYELNVESG YGHALWEAVM
AAGAEFDITP YGTETMHVLR AEKGFIIVGQ DTDGSISPLD LGMGWAVGMK KTYSFLGKRS
LARSDTARDD RKQWVGLLTQ DPSVVLPEGA QIMDSARTGA HNRMLGHVTS SYHSAFLGRS
IALAVVAAGR QRIGQTLYAH AHGRATAAQV VGSVFVDPKG ERQNV