Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3011 |
Symbol | |
ID | 4071566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3570182 |
End bp | 3571234 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985030 |
Product | phosphoribosylformylglycinamidine cyclo-ligase |
Protein accession | YP_592086 |
Protein GI | 94970038 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000165502 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGACTCAAG CCGACATGGA TACCAAGCCC GTTACGTACG CCGACGCCGG CGTTGATATT GAAAAAGCCA ACCGCACCAA GCAGCGCATT AAGTATTTGG CGCACAAGAC GTTCACCAAG AGCGTCCTGA GCGAGATTGG CGGCTTTGGC GGCCTTTTCC AGATCGATAA AAAGAAGTAC CTGGACCCGG TGCTCGTTTC GAGCGTGGAC GGCGTAGGCA CGAAGCTGAA AATCGCATTC GAAATGAACC TTCACCACAC GATTGGTGCG GACCTGGTCA ACCATTGCGT GAACGACATC GCGGTGCAGG GCGCGGCGCC GATGTTCTTC ATGGACTACC TGGCAACCGG CAAACTGGAT CCGGACATTG CGGAGAGGAT CGTTACCGGG CTCGCGGATG CTTGCAAGCA CAATGGCTGC GCGCTGATCG GCGGCGAGAC GGCCGAGATG CCGGGCTTCT ATCCCGACGG CGAATACGAT CTCGCTGGAT TCATCGTGGG AGTGGTCGAA CGCGATAAGG TCATCACTGG CAAAGAGGTT GTGCCGGGAG ATGTGCTGGT CGGGCTGCCG TCGAATGGGC TGCATACGAA CGGATATTCG CTCGCCCGGA AACTGCTCTT CTCCATCGCC GGATACTCGC CCGAAACGTA TGTAAATGCG ATTAAAGGCA AGGTCGGCAA CGAGCTGATG AAGACGCACA AGAGCTACTG GCCTGCGGTC CGTCGGCTGG TGGAGGCGGA GTGCGTAAGC GCGATGGCAC ACATTACAGG CGGCGGCATT ACCGAGAACC TGCCGCGCGT GCTGCCCAAG GGCACGGGCG CGGTGGTGGA ACTGGGATCG TGGCCGGTGC TGCCGATCTT TACCCACATG CAGCAGCTCG GGAATATCAG CCAGGACGAG ATGCTCCGCA CCTTCAACAT GGGTATCGGG ATGGTGCTGG TGATTCCGGC GAAGAAGTTC AAGAAGGTAC AGACAGTGCT GGAGCGCGCT GGGGAGAAGG GCTATACGAT CGGGCGCATT GTGAAGGGCG ACCGAAAAGT CAGCTACTCG TAA
|
Protein sequence | MTQADMDTKP VTYADAGVDI EKANRTKQRI KYLAHKTFTK SVLSEIGGFG GLFQIDKKKY LDPVLVSSVD GVGTKLKIAF EMNLHHTIGA DLVNHCVNDI AVQGAAPMFF MDYLATGKLD PDIAERIVTG LADACKHNGC ALIGGETAEM PGFYPDGEYD LAGFIVGVVE RDKVITGKEV VPGDVLVGLP SNGLHTNGYS LARKLLFSIA GYSPETYVNA IKGKVGNELM KTHKSYWPAV RRLVEAECVS AMAHITGGGI TENLPRVLPK GTGAVVELGS WPVLPIFTHM QQLGNISQDE MLRTFNMGIG MVLVIPAKKF KKVQTVLERA GEKGYTIGRI VKGDRKVSYS
|
| |