Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0262 |
Symbol | proA |
ID | 4808545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 322558 |
End bp | 323853 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105674 |
Product | gamma-glutamyl phosphate reductase |
Protein accession | YP_001036694 |
Protein GI | 125972784 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0014] Gamma-glutamyl phosphate reductase |
TIGRFAM ID | [TIGR00407] gamma-glutamyl phosphate reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000913374 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATCA GTGAAATGGC GTTTCAGGTC AAGGAGGCTT CTATCAGGCT TGCGGCAGCA GGTACCGAAT TAAAGAACAA AGCCCTTGAG AATATAGCCA GACTGTTAAT GGAGAGAAAA GACGAGATTA TAAAGGCAAA CAATGAAGAT TTACAGAGAA GCAGGGAGGA AAAGCTTGCA GAACCCCTTC TTAAGAGATT GAAATTTGAT GAAGCCAAGA TTGTGGATGT GATAGACGGT ATTAACAGCT TGATAAAGCT GGAGGATCCG GTGGGAAAAA CCCTTCTTTC CACTGAGCTT GACGAGGGAC TTGAGCTTTA TAGAGTTACC TGCCCGATAG GAGTTGTGGG AATAATTTTT GAGTCAAGAC CGGATGCCCT TGTTCAGATT TCCACGCTGT GTCTTAAAAG CGGAAACGGT GTTCTTTTAA AGGGGGGTTC CGAGGCCCGG GAGACCAATA AGATACTTGC CCAAATTATT ACCGAGGCCA CTGAAGAAGT TGGAATTCCG CCAAACTGGA TAAAACTTTT GGAGACTAGG GCGGATGTAA ATGAAATGCT TAAAATGGAC AAATATATTG ACCTAATCAT TCCAAGAGGT TCAAATGAGT TTGTAAGGTA TATTATGGAT AATTCCAGAA TTCCTGTAAT GGGCCATGCG GACGGAATAT GCCATTGCTA TATAGATGAG GATGCCGACA TTGACATGGC AATTCGGATT GTTGTGGATT CCAAAACCCA GTATGTTGCT GTCTGCAACG CAACGGAGAC ATTGCTGGTT CATAAAAATA TTGCTCCGAA AGTTCTGCCT GAGCTGAAAA GAGCTTTGGA CAGCAAAAAT GTGGAACTGG TGGGCTGCAG TGAAACACAG AAGATTATAC CTGTTGCCCC GGCAACGGAA GAGGACTGGA GGACAGAGTA TCTTGACTAC AAGCTGTCTG TTAAAGTAGT CGGTGATCTG GAGGAGGCCA TAGAACATAT CAACACCTAT GGTTCGGGGC ATACTGACAG TATAATTACA AACAGCAAAG AAAAGGCGGC AGCATTTATG TCCCTTGTAG ATTCGGGCAA TGTCTTCTGG AATTGCTCCA CACGCTTTAG TGATGGGTTT AGGTATGGTT TTGGCGCTGA AGTCGGAATC AGCACAAGCA AAATTCATGC AAGAGGACCG GTGGGACTTG ACGGATTGTT GATTTACAAG TACAAGCTTA TCGGCAATGG ACATATTGTG GAGGATTATG CCAAAAGGAC AAAAAGTTTC AAACATAATA AAATGAACAA GCAATTCCCT CTATAA
|
Protein sequence | MSISEMAFQV KEASIRLAAA GTELKNKALE NIARLLMERK DEIIKANNED LQRSREEKLA EPLLKRLKFD EAKIVDVIDG INSLIKLEDP VGKTLLSTEL DEGLELYRVT CPIGVVGIIF ESRPDALVQI STLCLKSGNG VLLKGGSEAR ETNKILAQII TEATEEVGIP PNWIKLLETR ADVNEMLKMD KYIDLIIPRG SNEFVRYIMD NSRIPVMGHA DGICHCYIDE DADIDMAIRI VVDSKTQYVA VCNATETLLV HKNIAPKVLP ELKRALDSKN VELVGCSETQ KIIPVAPATE EDWRTEYLDY KLSVKVVGDL EEAIEHINTY GSGHTDSIIT NSKEKAAAFM SLVDSGNVFW NCSTRFSDGF RYGFGAEVGI STSKIHARGP VGLDGLLIYK YKLIGNGHIV EDYAKRTKSF KHNKMNKQFP L
|
| |