Gene Moth_0753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0753 
Symbol 
ID3831466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp789606 
End bp790586 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content34% 
IMG OID637828684 
Productcarbamoyl phosphate synthase-like protein 
Protein accessionYP_429614 
Protein GI83589605 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.564232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCT TATTTACATC GGCAGGTCGT AGAGTTGAAT TGATAAAATT ATTTCAGGAA 
GTACAAGAAG TAGATCGAGT TATAGCAGTA GATATAGGCA ATACTGCTCC AACTCTATAT
TGCGCAGACA AAGGTTTTGT TATCCCCAAA GAAGGTTCCG ATAATTATCT TGATGTACTG
CTGGAAATAT GCCGTAAATA TAATATACAC ATGATTATAC CGCTCTTTGA TTTGGAGCTT
CCTTTTCTTT CAAAACTGAA GGAAACTTTT CATAAAGAAG GAGTTGTTGT TTTAATCTCC
TCACCTCACG TAATCGAAAT TTGTTTAGAT AAACTTCAAA CTGCAAAATT TTTTAATGAA
CACCAGATAC CTACTCCTTT AACATATACA CCAGAAGATT TTTTATCAAG GGACATACAA
GGTTTCGATG GTCAATTTAT TGTTAAACCC CGTTTTGGAT CTTCCGGTAA AGGTGTAAAC
AAATGTAATA CCAAAGAAGA AATTAATTAT TTTATAAGAC AAAATGAGAC ATATATTATT
CAAGAATTTA TTAGGGGTTA TGAAGTTACC CTAGATGTTT TATGTGATTT TAAAGGTAAT
TGCATTTCGA TCGTTCCGCG AAAACGTCTT AAAGTACGTG GAGGAGAAGT AGAAAGAGCA
GTAACTATTG AAGCACCAAA TCTTTTAAAA ATAACCCAGG AAATTGTCTC TAAATTAAAC
GCAGTAGGGC CTATAAACAT TCAATGCTTT ATTACTGAAC AAGGACCTGT TTTTACAGAG
ATTAACCCAA GATTTGGTGG CGGATACCCC CTTTCCTATC ATGCTGGTGC GGATTTTCCT
AAAATGATTG TTAAAATGGC CTTAGGAGAA AAGATTGCTC CACGAATTGG AATTTATAGG
AGAAATTTAT ATATGCTAAG ATATGATACG GCAATTTATA AAGGAGAGGA TGAATTGATT
GATCAAAGTT GCTATCTTTG A
 
Protein sequence
MNILFTSAGR RVELIKLFQE VQEVDRVIAV DIGNTAPTLY CADKGFVIPK EGSDNYLDVL 
LEICRKYNIH MIIPLFDLEL PFLSKLKETF HKEGVVVLIS SPHVIEICLD KLQTAKFFNE
HQIPTPLTYT PEDFLSRDIQ GFDGQFIVKP RFGSSGKGVN KCNTKEEINY FIRQNETYII
QEFIRGYEVT LDVLCDFKGN CISIVPRKRL KVRGGEVERA VTIEAPNLLK ITQEIVSKLN
AVGPINIQCF ITEQGPVFTE INPRFGGGYP LSYHAGADFP KMIVKMALGE KIAPRIGIYR
RNLYMLRYDT AIYKGEDELI DQSCYL