Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1122 |
Symbol | |
ID | 3833254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1149813 |
End bp | 1151159 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637829050 |
Product | citrate synthase |
Protein accession | YP_429979 |
Protein GI | 83589970 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCTCC AGGCCTGTCT TGATTATTAC TGCCAGCTGG CGGAAAAAAA TAATACGATT GATCCGGATT TATATGAAAA ATATAACGTC AAGCGCGGGC TAAGAAATAA AGACGGCACC GGTGTCCTAG TCGGTTTAAC AGAGATCGGG GAAGTCCATG GCTATATCCT TGATGAAGGG GAAAGAACCC CCGATCACGG TCGCTTGCTC TACCGGGGAA TTGATATCTG GGATATTGTA AGGGGCTTTC AGCAAGAAGG GCGCTTCGGT TTTGAGGAAG TCTGTTACCT GTTACTCTTT GGGGAATTGC CTGACAAGAA AAGGCTGGAG GAATTCACTC ATCTCCTTGC TGAACACAGG TCCCTTCCCG ATGGTTTTGT AGAAGACATG ATTCTTAAGG CCCCCAGTAA CGATATTATG AATAAATTAG CCCGTAGTGT CCTGGCGGCT TACTCCTACG ACCCCAATCC AGATGATATA AGCATCCGCA ATGTTCTCAG GCAATGCATC GAGCTTATCG CCCGGTTTCC CATTATGGTA GCCTATGGGT ATCAGGCGAA GGCTCACTAC TATGAGAATA AAAGTCTCTA TATCCATCTG CCCCAGACAG AGCTAAGTAC GGCCGAGAAC TTCCTGTACA TGATCCGTCC AGACAACCAG TACACATCCC TCGAAGCAGA GTTGCTGGAC CTGGCCCTGG TGCTCCACGC CGAGCATGGC GGCGGTAATA ACTCGGCCTT CACCGTCCAC GTTGTATCTT CAACAGGCAC TGATACTTAT TCCGTAATAG CCGCCGCAAC CGGGTCTTTA AAAGGGCCCA AGCACGGCGG GGCCAATATT AAGGTCATGA AAATGATCGA AGATATAAAG AACAACGTTA AGGACTGGCA TGACGAGGAA GAATTAAGGA ATTACCTTAT TAAAATCTTA CGCAAGGAGG CCTTTGACCG GAGCGGCTTA ATCTATGGTA TCGGCCATGC CGTTTATACT CTTTCAGATC CCAGGGCTGT TTTATTAAAA CAAAAGGCAG CCGAACTTGC CCGGGAAAAG GACATGGAAG ATGAATTTGG CCTTTACCTG GCTATAGAAA AAATGGCACC GGAACTCTTT GTTGCCGAAA AGAAGGTCGA TAAGGTGGTC GCTCCTAATG TTGACTTTTA TTCCGGTTTT GTTTATAAAA TGTTAAATAT TCCCATAGAG TTATATACTC CCATTTTTGC CATATCCAGA ATAGCCGGCT GGTGCGCCCA CCGGATTGAA GAGCTGGTAA GTGGCGGTAA AATCTTCAGG CCGGCTTATA AAAATGTCCT TGGCAAGAAG AGTTATATAC CCCTGGAACA GAGGTAG
|
Protein sequence | MDLQACLDYY CQLAEKNNTI DPDLYEKYNV KRGLRNKDGT GVLVGLTEIG EVHGYILDEG ERTPDHGRLL YRGIDIWDIV RGFQQEGRFG FEEVCYLLLF GELPDKKRLE EFTHLLAEHR SLPDGFVEDM ILKAPSNDIM NKLARSVLAA YSYDPNPDDI SIRNVLRQCI ELIARFPIMV AYGYQAKAHY YENKSLYIHL PQTELSTAEN FLYMIRPDNQ YTSLEAELLD LALVLHAEHG GGNNSAFTVH VVSSTGTDTY SVIAAATGSL KGPKHGGANI KVMKMIEDIK NNVKDWHDEE ELRNYLIKIL RKEAFDRSGL IYGIGHAVYT LSDPRAVLLK QKAAELAREK DMEDEFGLYL AIEKMAPELF VAEKKVDKVV APNVDFYSGF VYKMLNIPIE LYTPIFAISR IAGWCAHRIE ELVSGGKIFR PAYKNVLGKK SYIPLEQR
|
| |