Gene Moth_0881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0881 
Symbol 
ID3831519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp910272 
End bp911357 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content63% 
IMG OID637828811 
Productcarbamoyl-phosphate synthase small subunit 
Protein accessionYP_429741 
Protein GI83589732 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGTGC GCGGGTTTTT GGTGCTGGAG GACGGAACGG TATATAGCGG CGAGGCCTTT 
GGTTACCCCG GCCGCTCTCA CGGGGAGGTC GTTTTCAATA CCAGCATGAC CGGTTATCAA
GAGATCCTGA CCGACCCCTC CTATTGCGGC CAGATTGTAG CCCTGACCTA CCCCCTGATC
GGCAACTACG GCATTAACGA TGAGGATCTC GAGTCGGATG GCCCCCGGGT AGCCGGCTTC
GTCGTCCATG AAGCCTGCCC GCGGCCCAGC AACTGGCGGT CAACGGGTAG CCTTGATCAT
TACCTCCGGG AAAACCGCAT CCCGGCCCTG CAAGGGGTGG ATACCCGCGC CCTCACAAGG
CACCTGCGCC GACGGGGCAC CATGCGGGGC ATCCTGGCCA CGGGCGAGGT GGATTTGGAG
GAAATCAAGG CCCTGGCCGC TACCCGGCCG GCCCTGAGCG GCGCCAAACT GGTACCGGCG
GTTACCAATG CCAAGCCGTA TACCGTCGAG GGAGGGCCGC GCCGGGTAGT TCTCTATAAT
TTCGGCGTCA AGGAGAATAT CATCCGCTGG CTGCGCCGGG AGGGATGCAC CGTTACCGTC
ATGCCGGCCC GAAGTACAGC AGCCGCTATT CTGGCCCTCA ACCCCGAAGG GGTGGTCGTT
TCCAATGGCC CGGGCGACCC CAAGGACGTT CCCTACGGTG TGGCCACCGT CCGGGAACTA
CTGGGCCGGG TACCACTGAT GGGCATTTGC CTGGGCCACC AGCTCCTGGC TCTGGCCCTG
GGAGGCGATA CCTACAAACT CCCCTTCGGC CACCGCGGCG GCAACCACCC GGTTAAGGAT
TTAAGCACCG GTCGGGTCTA TATTACCTCC CAGAACCATG GTTACGCCGT CCGGGCTGAC
TCCCTGCCGA CAGGGGCGGT CGTCTCCCAT ATCAACCTCA ACGACGGCAC GGTGGAAGGC
CTGCGCCATC GGGAGTTGCC CGCCTTCTCC GTGCAGTATC ACCCCGAATC CTCGCCGGGA
CCGACGGATT CCGAGTACCT CTTCCACGAA TTTATCAGGC TGGTAGACGA ACACCGGGGG
CAATAA
 
Protein sequence
MPVRGFLVLE DGTVYSGEAF GYPGRSHGEV VFNTSMTGYQ EILTDPSYCG QIVALTYPLI 
GNYGINDEDL ESDGPRVAGF VVHEACPRPS NWRSTGSLDH YLRENRIPAL QGVDTRALTR
HLRRRGTMRG ILATGEVDLE EIKALAATRP ALSGAKLVPA VTNAKPYTVE GGPRRVVLYN
FGVKENIIRW LRREGCTVTV MPARSTAAAI LALNPEGVVV SNGPGDPKDV PYGVATVREL
LGRVPLMGIC LGHQLLALAL GGDTYKLPFG HRGGNHPVKD LSTGRVYITS QNHGYAVRAD
SLPTGAVVSH INLNDGTVEG LRHRELPAFS VQYHPESSPG PTDSEYLFHE FIRLVDEHRG
Q