Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2293 |
Symbol | |
ID | 3831325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2406550 |
End bp | 2407479 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637830213 |
Product | hypothetical protein |
Protein accession | YP_431123 |
Protein GI | 83591114 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2309] Leucyl aminopeptidase (aminopeptidase T) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.885941 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGATC TAGCTTCAGC CTGCCGCCTG GCCCTGGCCG AGTGCCTGGC CGTCAGGGCG GGGGATAAGG TTCTCATAGT TACCGACACC GATTTGCAGC CCCTGGGGGA AGCCTTCTTC CGGGCCGCCA GGGAGCTGGG GGCCGAGGCG GCCGCGATAA CCATGCTTCC CCGGGACAAC CACGGCCAAG AACCACCGGA GATGGTGGCG GCAGCCATGC TCAAGGGCCA GGTGGTGGTC CTGGTTACCT CCAGGTCCCT CTCCCATACC AGGGCCCGCC GGGCGGCCAA TGACGCCGGC GCCCGCATCG CCTCCCTGCC CGGGGCCACG GCCGATATGC TGGAACGTAC CCTGGCAGTT GATTATGACG CCATGGCCGC GGAGTGCGAG GATTATGCCG CAATCCTTAC GGGGGGACAG GAGGTACACC TGACGACCCC GGCGGGCACA GATTTAACCT TCAGCATCGC CGGCCGCCGG GGGCACCCCG ATACGGGCCT CTATCGCCGG CCAGGAGATT TCGGCAATCT TCCTGCGGGC GAGGCCTATA TAGCTCCCGT GGAGGGAACC GCCCGGGGGA TACTGGTTAT CGACGGCGCC ATGTCCGGAA TCGGCTTTTT AAAGGAGCCC CTGCGAATCA GGGTGGAGGA AGGCCGGGCC GTGGAGGTCA GCGGCGGGGA GGCCCGTGCC CTGGAGGAAA TACTCAACCG TTATGGCCCG GAGAGCCGTA ATATTGCCGA ACTGGGTATC GGTCTCAATC CCCTGGCCAA ATTAACGGGC AACGTCCTGG AGGATGAAAA GGTGCGGGGT ACGGTCCATA TCGCCCTGGG GGACAACAGC ACCTTCGGGG GCCGGGTGGA AGCCCCCAGC CATCTGGATG GCATTCTGCT GCGGCCCCGG CTCAGCGTTG ACGGCCGGCA GGTTTTGTAG
|
Protein sequence | MPDLASACRL ALAECLAVRA GDKVLIVTDT DLQPLGEAFF RAARELGAEA AAITMLPRDN HGQEPPEMVA AAMLKGQVVV LVTSRSLSHT RARRAANDAG ARIASLPGAT ADMLERTLAV DYDAMAAECE DYAAILTGGQ EVHLTTPAGT DLTFSIAGRR GHPDTGLYRR PGDFGNLPAG EAYIAPVEGT ARGILVIDGA MSGIGFLKEP LRIRVEEGRA VEVSGGEARA LEEILNRYGP ESRNIAELGI GLNPLAKLTG NVLEDEKVRG TVHIALGDNS TFGGRVEAPS HLDGILLRPR LSVDGRQVL
|
| |