Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2628 |
Symbol | |
ID | 4072037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3100174 |
End bp | 3101202 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984645 |
Product | glycine cleavage T protein, aminomethyl transferase |
Protein accession | YP_591703 |
Protein GI | 94969655 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) |
TIGRFAM ID | [TIGR03317] folate-binding protein YgfZ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.509615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAATT CCTTGCTTCA AGAGAAAACC GCGCCGGCAC GGATGGGCGA GTACCACGGT GCGCACGCAG CCGCCGTCTA CACCGACGTC GCCCGCGAAT TCGATGCCCT TCGTACCGGA GCCGCTGTTT ACGAAGCGAC CTGGCGCGCA AAGATCGTCG CGACTGGCGA AGACCGCGTC CGCTGGCTCA ATGGCATGAT CACGAACAAC GTTCGCGATC TCGCTGTCAG CCGCGGCGTA TACAGCTTTG TCCTTAACGC ACAGGGCCGC ATTCAGGGCG ATCTCATCGC TTTTCAGCGT GGCGATTACA TCCTGCTCGA AACCGATGAA TCGCAGGCCG AGTCGCTGAC CGCTCTGTTC GATCGCTTCA TCATCATGGA CGACGTTGAA ATCGCGAATG TCAGCGAGAA GCTGGCCTCC ATCGGTGTAA AGGGACCGAA GGCCGCAGAG GTCCTTCGCG AAGCTGGTTT CCCTGCCGAT CTCAAAGCAC TCGATGTCGT GGATGCGACC TGGAACGGCG TCGGCATTTC CGTCGCCTGC GGCGCGAGCG AGCAGTTCCC CGAATTCGAA ATCTGGTTCG CGCCGGAACA TACCGTGGCC GTGTGGGACG CGCTGGTTTC CGCCGGAGCT CAGCCCGTAG GCTACGAAGC CCTTGAGTTA CATCGCATTG CGACCGGCAT ACCCGCCTTC GGACAGGACA TTCGCGAGCG CGATCTCCCG CAGGAGACCG CGCAGAGTCA CGCGCTACAC TTCTCCAAGG GCTGCTACGT CGGCCAGGAG ATCGTCGAGC GCATCCACTC GCGCGGCAAT GTTCATCGCG GATTTACTGG TTTCTCACTT TCGCAACTCG TAAACTCCGG GACAAAGCTC GTCAGAGACG GCAAAGAAGT TGGCGAGATC ACCAGCGTTG CCGAACTGCC ATCCAAGAAA ATCATTGCCC TTGGCTATGT GCGCCGCGAG GCAGCTACCA GCGAGCTTGT TGCCGGTGAC GCAACTGCCA AAGTACATCC CTTACCGTTT GAGTTTTAA
|
Protein sequence | MRNSLLQEKT APARMGEYHG AHAAAVYTDV AREFDALRTG AAVYEATWRA KIVATGEDRV RWLNGMITNN VRDLAVSRGV YSFVLNAQGR IQGDLIAFQR GDYILLETDE SQAESLTALF DRFIIMDDVE IANVSEKLAS IGVKGPKAAE VLREAGFPAD LKALDVVDAT WNGVGISVAC GASEQFPEFE IWFAPEHTVA VWDALVSAGA QPVGYEALEL HRIATGIPAF GQDIRERDLP QETAQSHALH FSKGCYVGQE IVERIHSRGN VHRGFTGFSL SQLVNSGTKL VRDGKEVGEI TSVAELPSKK IIALGYVRRE AATSELVAGD ATAKVHPLPF EF
|
| |