Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1345 |
Symbol | |
ID | 3831903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1390704 |
End bp | 1392269 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637829281 |
Product | amino acid permease-associated region |
Protein accession | YP_430201 |
Protein GI | 83590192 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000295081 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACGG CAACAGAAAA ATTGGAGCTC CGCCGGGAGG TTACCGTGTG GGGCTCCTAC ATGTGGGGTT ACGCGGATGT CGGTGCCGAC ATTTATGCCG CCCTGGGACT GGTTATGCTC TGGGCTAAAG GTGCTACCAG CCTGGCCTTC GCCTTGGCCG GTCTGGTTTA CATCATGATC GGCCTGGCTT ATACCGAGCT TGCCGCCGCT TACCCCGTTG CCGGCGGTGG CCAGTTTTTT ACCTTGCGCG GGCTGGGTGA TTTCTGGGGT TTTGTAGCCG GGGCAGCTTT GCTCTTGGAC TACACCATAG ATATCGCCCT TTTTGCGACT GCTTCCGCTG GCTACATCAA CTTCTTTCTG CCTTACCTTT TCGGGGTCAA TATTGACTCT CTGGCCGTCA GCATCGGTCC CCTGCACCAC GTCAACCTGG TGTGGATGGC AGAGGCCCTG GCCCTGGTGT TCTTCCTTAT TGCCCTCAAT ATTCGCGGTA TGAGGGAATC TTCTTTGCTC AATGAAGTCC TGGGTGCTAT TGATATCTTG ACGGAATCAA CCATTATCGT CTTTGGGTTT CTCTTTGCCT GGCGGCCGGA ACTCCTGGCC CATCAGTGGG TGACTCAGTT CCCGACTTTT AAGGAATTTG CCTATGGCTC CTCCCTGGCT ATTATCTCCT TTGTCGGCCT GGAGTCTATC TCCCAGGCGG CCCAGGAAAC CAAACGGCCG GCGACTGTCG TTCCCCGAAC TTCTGTCGGC CTGATCTTTA CTGTATTCAT TTTCGCCACC GCCTTTTCTA CCATGAGCCT GGGAGTCCTA CCCTGGCAGG ATATCGCTAA AGCCGTCGGC GATCCGGTGG CTACCCTGGC CCATGCCATC CCCTTTATTG GCATTATTGC CGGCCCTTTT GCCGCCCTGC TGGGGGCTAC CATCCTGCTC ATTTCGGCCA ACTCCGGGGT CATGAGCGCT TCCCGGTTGA CCTTTGCCAT GAGCCAGTTT AATTTTATCA GCGACTGGTT CAACGCCGTG CACCCTCGTT ACCGGACTCC TTACCGCACT ATCCTTGTCT TCTCGGGCAT TGGTATCCTT CAGTTAGTTC TTTCTTTCCT GACGCCCAAT GCTATGGATA CCCTGGGTAA CATGTATGCC TTCGGGGCTA CCACAGGCTA TATCCTGGTT TTTATTGCCC TGATTAAACT ACGCTTCACC GATCCCTATG CACCCCGGCC CTATAAAGTG CCTTTAAACA TAAAGATAAA TTATCGTGGC CGGGTGGTGG AGTTCCCCAT CCTGGGGGTA ATCGGTACCC TGGGTATCAG CACTATACTC TTCGAAGTCA TTCTTACCCA TGCCATTGGC CGTATCGCCG GGCCGGCGTG GATTATCCTC TGTTTCCTCT ACTACGCTTA TTATCGCCGG AGCAAGGGCT ACCCCATTTT CGGCAACATC CCCCGGGACT GGGAAGCCCA GCAAATAAAG GTCCTGGAAG CGGCCGAAGA GTACGACCTT CTCGAGGAAT ACAAGCAGGC TCTTGCAGAA CGGGAACGCC TGGAGGCTAA GGCCCATGTC AAGTGA
|
Protein sequence | MATATEKLEL RREVTVWGSY MWGYADVGAD IYAALGLVML WAKGATSLAF ALAGLVYIMI GLAYTELAAA YPVAGGGQFF TLRGLGDFWG FVAGAALLLD YTIDIALFAT ASAGYINFFL PYLFGVNIDS LAVSIGPLHH VNLVWMAEAL ALVFFLIALN IRGMRESSLL NEVLGAIDIL TESTIIVFGF LFAWRPELLA HQWVTQFPTF KEFAYGSSLA IISFVGLESI SQAAQETKRP ATVVPRTSVG LIFTVFIFAT AFSTMSLGVL PWQDIAKAVG DPVATLAHAI PFIGIIAGPF AALLGATILL ISANSGVMSA SRLTFAMSQF NFISDWFNAV HPRYRTPYRT ILVFSGIGIL QLVLSFLTPN AMDTLGNMYA FGATTGYILV FIALIKLRFT DPYAPRPYKV PLNIKINYRG RVVEFPILGV IGTLGISTIL FEVILTHAIG RIAGPAWIIL CFLYYAYYRR SKGYPIFGNI PRDWEAQQIK VLEAAEEYDL LEEYKQALAE RERLEAKAHV K
|
| |