Gene Emin_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1354 
Symbol 
ID6263735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1458404 
End bp1459519 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content43% 
IMG OID642611835 
Productglycine cleavage system T protein 
Protein accessionYP_001876241 
Protein GI187251759 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR00528] glycine cleavage system T protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.91248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTATAC CACATAAATA TGGGGGCTTA ATGGCACACT CAACAGATAT ACAATTACAA 
AGGACTCCTT TGCATAAAGC CTGCGTAAAA GCGGGCGGCA AGATGGTTGA TTTTCACGGT
TGGGAACTTC CTATACAATT TGCAGGTATA ATAGCTGAAC ATAAATCCGT GCGCGAACAC
GCGGGGATTT TTGACGTCTC CCACATGGGC CAGCTTTTAA TGACCGGGCG CGACGTGCAT
AAATTTTTAG AATACGTAAC GTCTAATAAA ATAAAAAACA GTCCTTCTCA GGGCACTTAC
ACACATGTCC TAAATGAAAA GGGCGGCGTT GTTGACGATG TTGTGGCCTT TTGTAAAGAA
GAAGGTAAAT TCCTTGTTGT GGTAAATTCA GCCACAACTC ATAAAGATTT TAAATACTTT
TCAAAAATGA CTGCGGGTTT TGACGTTGTT GTTGAGGACC TCAGCTCCGA GTTTGGCATG
GTTGCCGTGC AAGGGCCGGA GGCTATGTCT CACGCGGAAA AACTTGTGCC CGGCATATCT
GAACTTCCCA GGTTTAATAT TAAAGAAGTT GTTTTATTTG GGCAGCGCTG TCTTATTACC
CGTACAGGCT ATACAGGTGA GGACGGACTG GAAATTATGG CTCCTCATAA AGCAATTGTT
GATATATGGA ACTTTTTTAT TGATTTAGGC GCGGCTCCCT GCGGTTTAGG CGCGAGGGAC
GTTTTAAGGC TTGAGGCGGG TTATCTTTTG TACGGCGTTG ACGTTGATGA CGAACATACC
TCCTACGAGG CTTCCTGCGG CTGGGTTGTT AAGCTTGACA AACCGGACTT TGTGGCAAAA
GCTATTTTGG CCAAACAAAA GGAAGAAGGC GTAAAAATTA AATTAACTTC TTTCCAGCTT
ACCGGGCCGG GAGTACCTAG AGAACATTGC AAAGTTTTTT TTAAAGGGGA GGAAATAGGT
TCTTTAACAA GCGGAACGTA TTCCCCCATT TTTAAAGGTA TCGGAAAAGG CTATGTAAAT
AGAATTTTAG AAATTGACGA TGAAGTTGAA ATAGAATCAG GCGCGCGCAA AATGACCGCG
AAAGTAGTAA AAAGTTTTTA CAAGAACAGA GTTTAA
 
Protein sequence
MFIPHKYGGL MAHSTDIQLQ RTPLHKACVK AGGKMVDFHG WELPIQFAGI IAEHKSVREH 
AGIFDVSHMG QLLMTGRDVH KFLEYVTSNK IKNSPSQGTY THVLNEKGGV VDDVVAFCKE
EGKFLVVVNS ATTHKDFKYF SKMTAGFDVV VEDLSSEFGM VAVQGPEAMS HAEKLVPGIS
ELPRFNIKEV VLFGQRCLIT RTGYTGEDGL EIMAPHKAIV DIWNFFIDLG AAPCGLGARD
VLRLEAGYLL YGVDVDDEHT SYEASCGWVV KLDKPDFVAK AILAKQKEEG VKIKLTSFQL
TGPGVPREHC KVFFKGEEIG SLTSGTYSPI FKGIGKGYVN RILEIDDEVE IESGARKMTA
KVVKSFYKNR V