Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1005 |
Symbol | |
ID | 7407907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1102453 |
End bp | 1103793 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643715370 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002572879 |
Protein GI | 222528997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000760178 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAAT TAAACGCAGA CTTGACATCA GACAATTCAC TGCGCAAAAG TCTCAATTTT GTAATACTTG GCATCACATT TGGCATAGTT TTTTTCAATG TAACAACAGG GTCACCAGTT GCAGGATTTG CAAAGGCTAT AGGATTTGGC GATCTGATGT ATGGTGTGAT GCTTGCCCTG CCAGTGCTTG GTGGTGTAGC GCAAGTTTTT GCATCTTATT TTCTTGAAAA GTCAAAAAAA AGAAAGTTTA TATTTTTAAT AAGCGGATTT ATTCACAGAC TACCATGGGC ATTAATTGCC ATTTTGCCAC TGATTTTAAG AAAAGGCTCG TATATTTTGT TATTCTTTCT GGTACTATTG ATGACAATAT CTTCAATATC TAATTCGTTT ACAAATGTTT CTTTCTGGTC ATGGATTAAT GACTTGGTCC CAATGCACAT AAGAGGTAGG TTCTTTTCCA GAAGAGCAAC AATCTCTACC ATAGTTGGAA TGCTCAGCGG ACTTGCCATT GGTAAATTTC TGGACATTTA TAATAACCTT TTAGGATTTT CTATAGTTTT TGTGTTTGCA GCTATAATGG GAATGCTTGA TATTGCTTGT TTTTTCTTTG TAAAAGATAT TCCTATGAAG GTCCAAAATC AACAAACTGA TCTGAAAAAT ATGTTTGTTT CAACACTTAA AAATAATCAT TTTAAAAAAT TTATGGTCTT TTTTATCATT TGGAATTTTG GACTTAGCAT TGCAGGTCCG TACTTTAATA TGTATATGAT AAAAAACCTC AAAATGAGTT ATTTTGATAT AATTCTCCTG ACCCAGATTG TGAGCAACAT TGTAACCATA CTCACATTAC CATACATAGG AAGAGTGGTA GATAAAATAG GTAACAGACC CATGCTCCTT TTTGCAGCAA GTATTTTGTC GTTCTTGCCT ATTGTATGGT GCTTTACTAA TGAAAACAAT TACAAGTATT TAGTAGCTAT AATAAGTATT TTTGCAGGAC TGTTGTGGCC TATAATTGAT ATGAGTAACA ATAATTTAAT CCTGAAACTA TCTGATCAAA CCCAAACATC TATGTATGTT GGTGTTATAA ACATGTTCAA TGCAATATTT GGATCGGCTA TTCCAATTAT ACTTGGAGGC TATCTTATAG AAGATATCGC ACCTTATGTT GTTACTTTTT TCAAAAATTA TATGCATTTT GATATAACTA CATATCATGT TGCATTCTTT GTATCAGGTT TTTTAAGATT TTTATCTGTG ATTTATCTTA AAAAGAACGT AAAGGAACCC GGCGCAAAGA GCCTCAAGAA TGTTATCAAG AGTAAAATAA AAAGATCATA A
|
Protein sequence | MFKLNADLTS DNSLRKSLNF VILGITFGIV FFNVTTGSPV AGFAKAIGFG DLMYGVMLAL PVLGGVAQVF ASYFLEKSKK RKFIFLISGF IHRLPWALIA ILPLILRKGS YILLFFLVLL MTISSISNSF TNVSFWSWIN DLVPMHIRGR FFSRRATIST IVGMLSGLAI GKFLDIYNNL LGFSIVFVFA AIMGMLDIAC FFFVKDIPMK VQNQQTDLKN MFVSTLKNNH FKKFMVFFII WNFGLSIAGP YFNMYMIKNL KMSYFDIILL TQIVSNIVTI LTLPYIGRVV DKIGNRPMLL FAASILSFLP IVWCFTNENN YKYLVAIISI FAGLLWPIID MSNNNLILKL SDQTQTSMYV GVINMFNAIF GSAIPIILGG YLIEDIAPYV VTFFKNYMHF DITTYHVAFF VSGFLRFLSV IYLKKNVKEP GAKSLKNVIK SKIKRS
|
| |