Gene Moth_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1122 
Symbol 
ID3833254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1149813 
End bp1151159 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content47% 
IMG OID637829050 
Productcitrate synthase 
Protein accessionYP_429979 
Protein GI83589970 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCC AGGCCTGTCT TGATTATTAC TGCCAGCTGG CGGAAAAAAA TAATACGATT 
GATCCGGATT TATATGAAAA ATATAACGTC AAGCGCGGGC TAAGAAATAA AGACGGCACC
GGTGTCCTAG TCGGTTTAAC AGAGATCGGG GAAGTCCATG GCTATATCCT TGATGAAGGG
GAAAGAACCC CCGATCACGG TCGCTTGCTC TACCGGGGAA TTGATATCTG GGATATTGTA
AGGGGCTTTC AGCAAGAAGG GCGCTTCGGT TTTGAGGAAG TCTGTTACCT GTTACTCTTT
GGGGAATTGC CTGACAAGAA AAGGCTGGAG GAATTCACTC ATCTCCTTGC TGAACACAGG
TCCCTTCCCG ATGGTTTTGT AGAAGACATG ATTCTTAAGG CCCCCAGTAA CGATATTATG
AATAAATTAG CCCGTAGTGT CCTGGCGGCT TACTCCTACG ACCCCAATCC AGATGATATA
AGCATCCGCA ATGTTCTCAG GCAATGCATC GAGCTTATCG CCCGGTTTCC CATTATGGTA
GCCTATGGGT ATCAGGCGAA GGCTCACTAC TATGAGAATA AAAGTCTCTA TATCCATCTG
CCCCAGACAG AGCTAAGTAC GGCCGAGAAC TTCCTGTACA TGATCCGTCC AGACAACCAG
TACACATCCC TCGAAGCAGA GTTGCTGGAC CTGGCCCTGG TGCTCCACGC CGAGCATGGC
GGCGGTAATA ACTCGGCCTT CACCGTCCAC GTTGTATCTT CAACAGGCAC TGATACTTAT
TCCGTAATAG CCGCCGCAAC CGGGTCTTTA AAAGGGCCCA AGCACGGCGG GGCCAATATT
AAGGTCATGA AAATGATCGA AGATATAAAG AACAACGTTA AGGACTGGCA TGACGAGGAA
GAATTAAGGA ATTACCTTAT TAAAATCTTA CGCAAGGAGG CCTTTGACCG GAGCGGCTTA
ATCTATGGTA TCGGCCATGC CGTTTATACT CTTTCAGATC CCAGGGCTGT TTTATTAAAA
CAAAAGGCAG CCGAACTTGC CCGGGAAAAG GACATGGAAG ATGAATTTGG CCTTTACCTG
GCTATAGAAA AAATGGCACC GGAACTCTTT GTTGCCGAAA AGAAGGTCGA TAAGGTGGTC
GCTCCTAATG TTGACTTTTA TTCCGGTTTT GTTTATAAAA TGTTAAATAT TCCCATAGAG
TTATATACTC CCATTTTTGC CATATCCAGA ATAGCCGGCT GGTGCGCCCA CCGGATTGAA
GAGCTGGTAA GTGGCGGTAA AATCTTCAGG CCGGCTTATA AAAATGTCCT TGGCAAGAAG
AGTTATATAC CCCTGGAACA GAGGTAG
 
Protein sequence
MDLQACLDYY CQLAEKNNTI DPDLYEKYNV KRGLRNKDGT GVLVGLTEIG EVHGYILDEG 
ERTPDHGRLL YRGIDIWDIV RGFQQEGRFG FEEVCYLLLF GELPDKKRLE EFTHLLAEHR
SLPDGFVEDM ILKAPSNDIM NKLARSVLAA YSYDPNPDDI SIRNVLRQCI ELIARFPIMV
AYGYQAKAHY YENKSLYIHL PQTELSTAEN FLYMIRPDNQ YTSLEAELLD LALVLHAEHG
GGNNSAFTVH VVSSTGTDTY SVIAAATGSL KGPKHGGANI KVMKMIEDIK NNVKDWHDEE
ELRNYLIKIL RKEAFDRSGL IYGIGHAVYT LSDPRAVLLK QKAAELAREK DMEDEFGLYL
AIEKMAPELF VAEKKVDKVV APNVDFYSGF VYKMLNIPIE LYTPIFAISR IAGWCAHRIE
ELVSGGKIFR PAYKNVLGKK SYIPLEQR