Gene Moth_2289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2289 
SymbolargJ 
ID3831321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2401323 
End bp2402540 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content64% 
IMG OID637830209 
Productbifunctional ornithine acetyltransferase/N-acetylglutamate synthase protein 
Protein accessionYP_431119 
Protein GI83591110 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.630289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAG ACTTCCAGCC GGTTGCCGGC GGCATCACCG CCCCGCGGGG TTTTGTCGCC 
GCCGGCATTC ATGCCGGTTT GAAAAAGGAA AAATTGGACC TGGCTCTGAT TGTGAGCGAG
GTGCCGGCGA CGGCGGCGGC CGTCTATACC CGCAACCGGG TCAAGGCGGC GCCCCTGCGG
GTGACGGCGG AACACCTCAA GGCCGGCCTG GCCCGGGCCA TTGTCGCCAA CAGCGGCTAT
GCCAACGCCT GTACCGGGGA GCGGGGTTAC CGGGACGCCC GGGAGATGGC GGTAGTCACG
GCCGGAGCCG TCGGTTGCGA ACCGTGGCAG GTGGTGGTGG CCTCCACCGG CGTCATCGGC
GTGCCCCTGC CTATGGATAA AGTCACCGCC GGCATCCAGG CCGCTGCCGC CCGGCTGGCG
GTGGAGGGAG GCAGAGATGC GGCGGCAGCC ATCATGACCA CCGATACCCG GATCAAAGAG
ATCGCCATCC AGTTGCCCCT GGGGGGAGAG ACGGTAACTA TTGCCGGCAT CGCCAAGGGG
TCGGGCATGA TCCACCCCAA TATGGGCACC ATGCTCTGCT TCCTGACCAC CGACGCTGCC
ATCGAACAGG AGGATCTGGA ACAGGCCCTG AGGGTAGTGG TGGATCGGAC CTTTAATATG
GTGACCGTGG ACGGCGACAC CAGCACCAAC GACATGGCAG TCATCCTGGC CAACGGCTGC
GCCGGCAACG CCCCCTTGAC CATTGAAGAG CATGCCGCCT TCCGGTCCGG GTTGGAGTAT
GTCTGTCGCT ACCTGGCCCG CCTCATCGCC CGTGACGGGG AAGGGGCCAG TAAACTGATA
ACCGTTGAGG TTTATGGCGC GGCCAGCGAG GTCGAGGCCC GCCAGGTGGC CCGGTCCGTA
GCCGGTTCCA ACCTGGTCAA GAGTGCCATC TTCGGCGCCG ACGCCAACTG GGGCCGTATC
ATCTGTGCCG CCGGTTACTC CGGTGCTGAA ATCGACCCGG ATAAGATAGA CATCTACCTG
GAAAGCCACG CCGGCCGCGA GCAAATGGCC GCCGGTGGCG AGCCCCTGCC CTTCAGCGAA
GCAAAGGCGG CGGCCATTCT GGCGGAAGAG GAGATTACCA TCATCCTGGA TCTGAACCGG
GGCCGCGCCG CGGCTACAGC CTGGGGCTGC GACCTTACTT ATGATTATGT AAAGATTAAT
GCCTCTTACC GGACTTGA
 
Protein sequence
MTQDFQPVAG GITAPRGFVA AGIHAGLKKE KLDLALIVSE VPATAAAVYT RNRVKAAPLR 
VTAEHLKAGL ARAIVANSGY ANACTGERGY RDAREMAVVT AGAVGCEPWQ VVVASTGVIG
VPLPMDKVTA GIQAAAARLA VEGGRDAAAA IMTTDTRIKE IAIQLPLGGE TVTIAGIAKG
SGMIHPNMGT MLCFLTTDAA IEQEDLEQAL RVVVDRTFNM VTVDGDTSTN DMAVILANGC
AGNAPLTIEE HAAFRSGLEY VCRYLARLIA RDGEGASKLI TVEVYGAASE VEARQVARSV
AGSNLVKSAI FGADANWGRI ICAAGYSGAE IDPDKIDIYL ESHAGREQMA AGGEPLPFSE
AKAAAILAEE EITIILDLNR GRAAATAWGC DLTYDYVKIN ASYRT