Gene Moth_2285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2285 
Symbol 
ID3831317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2396761 
End bp2397987 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content58% 
IMG OID637830205 
Productargininosuccinate synthase 
Protein accessionYP_431115 
Protein GI83591106 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.117848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAG TCTTTACTGT GGCCGAAAAG GTAGTTCTCG CCTATTCCGG CGGGCTGGAT 
ACCTCCATTA TTATCCCCTG GCTCAAGGAA ACCTATGGTT ATGAGGTTAT CGCCGTGGCC
GTCGATGTCG GCCAGGGAGA AGAGCTGGAA CCCCTGGAGG AAAAGGCTAT AAAGAGCGGG
GCCAGCAAGA TCTATATTCT GGATAAAAAG AAGGAGTTTG TGGAGGAGTA CATCTGGCCC
ACCCTCAAGG CCGGCGCTGT TTACGAGGGC AAGTACCTCC TGGGCACCTC CTTCGCCCGG
CCCCTCATTG CCAAATGCCT GGTGGAGGTC GCCGCACAGG AAGGGGCCAC GGCCGTGGCC
CACGGAGCCA CCGGCAAGGG CAACGACCAG GTGCGTTTCG AACTGGGGGT TAAGGCCTTA
AATCCCCAGC TAAAGGTCAT CGCTCCCTGG CGGATCTGGA ACATCCGCTC CCGGGAAGAG
GCCATGGACT ACGCCGCCGC CCGGGGCATC CCGGTGCCGG TGACTAAAGA CCGGCCCTAC
AGCATGGACC GTAACCTCTG GCACTTGAGC CACGAGGGAG GCGATCTCGA GGATCCCTGG
AATGCACCCG GGGACGACCT TTACCTGATA ATCACCCCGC CGGAACAGGC CCCGGATAAA
CCGACTTATG TAACCATCGA CTTTGAAAAG GGTATTCCGG TAGCCGTGGA CGGGGAAAAA
CTGGACGCCG TCGCCCTGGT GGAGAAGCTC AATGACCTGG CGGCGGCCAA CGGTGTGGGC
ATAGTTGACA TTGTAGAAAA TCGCCTGGTT GGTATGAAGT CCCGGGGCGT TTATGAAACC
CCCGGGGGGA CGATCCTCTA TACAGCCCAC CGGGAACTGG AGTACCTCAC CCTGGACCGC
ATGACCATGC ATTTCAAAGA AATGGTGGCC GCCAAGTACG CCGAGCTGGT TTACGACGGC
AACTGGTTCT CACCCTTGAA AAAAGCCCTG GACGCCTTTG TGGACAGCAC CCAGGAGACG
GTGACGGGCA CGGTGCGTCT AAAACTCTAT AAAGGCAGCT GCACCCCGGC CGGGGTCAAA
TCACCTTATT CCATCTACAA CGAGGACCTG GTCACCTTCG GTGCCGGCGG TGACTACGAC
CATAAGGACG CCACCGGTTT CATCAACCTC TTCGGCCTGC CCTTGAAGGT ACGGGCGCTG
ATGGAACAAA AAACTGGACT GAGATAG
 
Protein sequence
MKGVFTVAEK VVLAYSGGLD TSIIIPWLKE TYGYEVIAVA VDVGQGEELE PLEEKAIKSG 
ASKIYILDKK KEFVEEYIWP TLKAGAVYEG KYLLGTSFAR PLIAKCLVEV AAQEGATAVA
HGATGKGNDQ VRFELGVKAL NPQLKVIAPW RIWNIRSREE AMDYAAARGI PVPVTKDRPY
SMDRNLWHLS HEGGDLEDPW NAPGDDLYLI ITPPEQAPDK PTYVTIDFEK GIPVAVDGEK
LDAVALVEKL NDLAAANGVG IVDIVENRLV GMKSRGVYET PGGTILYTAH RELEYLTLDR
MTMHFKEMVA AKYAELVYDG NWFSPLKKAL DAFVDSTQET VTGTVRLKLY KGSCTPAGVK
SPYSIYNEDL VTFGAGGDYD HKDATGFINL FGLPLKVRAL MEQKTGLR