Gene Moth_2309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2309 
Symbol 
ID3831423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2428386 
End bp2429669 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content62% 
IMG OID637830233 
Productadenylosuccinate synthetase 
Protein accessionYP_431139 
Protein GI83591130 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0104] Adenylosuccinate synthase 
TIGRFAM ID[TIGR00184] adenylosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0139427 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGG TAGTACTGGT TGGCGCCCAG TGGGGCGATG AAGGTAAGGG AAAAATCACA 
GACTACCTGG CTGAAAGGGC CGATGTGGTG ATTCGCTACC AGGGAGGTAG CAACGCCGGC
CATACGGTAA TGGTCGGCCA TGAGGAATTT AAACTGCACC TGGTGCCTTC GGGTATCCTC
TACCCCGGCA AGCTCTGTAT TATCGGTAAC GGTGTAGTCC TCGACCCGGC GGTCCTGGTG
GAGGAGTTAG ACGGCCTGGC GGCCCGGGGC GTGGATACCT CCGGTCTGAA GATCAGCAAC
CGGGCTCACC TGATCCTTCC CTACCACAAA GGCCTGGACG CCGCCGAGGA GGAACACCGC
GGTGCGGCCA TGATTGGCAC CACCAAAAGG GGTATCGGCC CGGCCTATGT GGATAAAGCC
GCCCGGACGG GTATCCGGGT GGGCGACCTC CTGGACTGGG AGGAGTTTAG CGCCAAAGTG
GCCCATAACC TGGCTGCCAC CAATGAACTC CTGGCTAAGA TTTATGACCG GCCGGGATAT
GATCTCCAGG CCATCCTGGA GGAATACGCC GGTTACGCCC GGCGCCTGCG GCCGCTGATT
GCCGACAGCG TTCGCCTGGT GAACCGGGCC CTCCAGGAGG GGCGTAAGGT TCTCTTTGAA
GGCGCCCAGG GGACCCTCCT GGACCTGGAT CAGGGAACCT ATCCCTTTGT GACTTCATCC
TATCCCGTTG CCGGCGGGGC CTGCATCGGC GCCGGCGTCG GCCCGACACG CATCGACAAG
GTCATTGGCG TGGTCAAGGC CTATACCACC AGGGTGGGTT CCGGCCCCTT CCCTACGGAG
ATTACCGGGC CTGCCGGTGA CGCCCTGAGG CAACAGGGCA TGGAATTCGG TACCACCACC
GGGCGACCGC GCCGCTGCGG CTGGCTGGAT ACGGTTATCC TGCGCCATGC TGCCGAGGTA
AACGGCCTGA CGGGTATCGC CCTGACCAAG CTGGACGTCC TGACGGGCCT TGATCCTTTA
AGAATTTGTA CCAGTTACCG CTACCGGGGG ACGGTGGGGG AAGATTTTCC GGCCAGCCTG
AAGGCATTAG AGGAGTGCGA ACCGGTTTAT GAGGAACTCC CGGGCTGGCA CGAAGACATT
ACCGGCGCTA GGTCCCTGGA TGACCTCCCG GCTAATTGCC GCCGTTATAT CCGGCGGCTG
GAAGAGCTCA CCGGCGTTCC CGTCCACCTC ATCGCCGTGG GCCCGCGCCG GGACCAGACC
ATTGTTTTGG AGAGTCCTTT TTAA
 
Protein sequence
MAAVVLVGAQ WGDEGKGKIT DYLAERADVV IRYQGGSNAG HTVMVGHEEF KLHLVPSGIL 
YPGKLCIIGN GVVLDPAVLV EELDGLAARG VDTSGLKISN RAHLILPYHK GLDAAEEEHR
GAAMIGTTKR GIGPAYVDKA ARTGIRVGDL LDWEEFSAKV AHNLAATNEL LAKIYDRPGY
DLQAILEEYA GYARRLRPLI ADSVRLVNRA LQEGRKVLFE GAQGTLLDLD QGTYPFVTSS
YPVAGGACIG AGVGPTRIDK VIGVVKAYTT RVGSGPFPTE ITGPAGDALR QQGMEFGTTT
GRPRRCGWLD TVILRHAAEV NGLTGIALTK LDVLTGLDPL RICTSYRYRG TVGEDFPASL
KALEECEPVY EELPGWHEDI TGARSLDDLP ANCRRYIRRL EELTGVPVHL IAVGPRRDQT
IVLESPF