Gene Athe_1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1005 
Symbol 
ID7407907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1102453 
End bp1103793 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content33% 
IMG OID643715370 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002572879 
Protein GI222528997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000760178 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAAT TAAACGCAGA CTTGACATCA GACAATTCAC TGCGCAAAAG TCTCAATTTT 
GTAATACTTG GCATCACATT TGGCATAGTT TTTTTCAATG TAACAACAGG GTCACCAGTT
GCAGGATTTG CAAAGGCTAT AGGATTTGGC GATCTGATGT ATGGTGTGAT GCTTGCCCTG
CCAGTGCTTG GTGGTGTAGC GCAAGTTTTT GCATCTTATT TTCTTGAAAA GTCAAAAAAA
AGAAAGTTTA TATTTTTAAT AAGCGGATTT ATTCACAGAC TACCATGGGC ATTAATTGCC
ATTTTGCCAC TGATTTTAAG AAAAGGCTCG TATATTTTGT TATTCTTTCT GGTACTATTG
ATGACAATAT CTTCAATATC TAATTCGTTT ACAAATGTTT CTTTCTGGTC ATGGATTAAT
GACTTGGTCC CAATGCACAT AAGAGGTAGG TTCTTTTCCA GAAGAGCAAC AATCTCTACC
ATAGTTGGAA TGCTCAGCGG ACTTGCCATT GGTAAATTTC TGGACATTTA TAATAACCTT
TTAGGATTTT CTATAGTTTT TGTGTTTGCA GCTATAATGG GAATGCTTGA TATTGCTTGT
TTTTTCTTTG TAAAAGATAT TCCTATGAAG GTCCAAAATC AACAAACTGA TCTGAAAAAT
ATGTTTGTTT CAACACTTAA AAATAATCAT TTTAAAAAAT TTATGGTCTT TTTTATCATT
TGGAATTTTG GACTTAGCAT TGCAGGTCCG TACTTTAATA TGTATATGAT AAAAAACCTC
AAAATGAGTT ATTTTGATAT AATTCTCCTG ACCCAGATTG TGAGCAACAT TGTAACCATA
CTCACATTAC CATACATAGG AAGAGTGGTA GATAAAATAG GTAACAGACC CATGCTCCTT
TTTGCAGCAA GTATTTTGTC GTTCTTGCCT ATTGTATGGT GCTTTACTAA TGAAAACAAT
TACAAGTATT TAGTAGCTAT AATAAGTATT TTTGCAGGAC TGTTGTGGCC TATAATTGAT
ATGAGTAACA ATAATTTAAT CCTGAAACTA TCTGATCAAA CCCAAACATC TATGTATGTT
GGTGTTATAA ACATGTTCAA TGCAATATTT GGATCGGCTA TTCCAATTAT ACTTGGAGGC
TATCTTATAG AAGATATCGC ACCTTATGTT GTTACTTTTT TCAAAAATTA TATGCATTTT
GATATAACTA CATATCATGT TGCATTCTTT GTATCAGGTT TTTTAAGATT TTTATCTGTG
ATTTATCTTA AAAAGAACGT AAAGGAACCC GGCGCAAAGA GCCTCAAGAA TGTTATCAAG
AGTAAAATAA AAAGATCATA A
 
Protein sequence
MFKLNADLTS DNSLRKSLNF VILGITFGIV FFNVTTGSPV AGFAKAIGFG DLMYGVMLAL 
PVLGGVAQVF ASYFLEKSKK RKFIFLISGF IHRLPWALIA ILPLILRKGS YILLFFLVLL
MTISSISNSF TNVSFWSWIN DLVPMHIRGR FFSRRATIST IVGMLSGLAI GKFLDIYNNL
LGFSIVFVFA AIMGMLDIAC FFFVKDIPMK VQNQQTDLKN MFVSTLKNNH FKKFMVFFII
WNFGLSIAGP YFNMYMIKNL KMSYFDIILL TQIVSNIVTI LTLPYIGRVV DKIGNRPMLL
FAASILSFLP IVWCFTNENN YKYLVAIISI FAGLLWPIID MSNNNLILKL SDQTQTSMYV
GVINMFNAIF GSAIPIILGG YLIEDIAPYV VTFFKNYMHF DITTYHVAFF VSGFLRFLSV
IYLKKNVKEP GAKSLKNVIK SKIKRS