Gene Athe_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1781 
Symbol 
ID7408568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1854408 
End bp1855919 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content36% 
IMG OID643716158 
ProductABC transporter related 
Protein accessionYP_002573647 
Protein GI222529765 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000368661 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTACA TTTTGCAGGT AAAGGATATT TCTAAAAGAT TTGGTAATAT TCAAGCAAAT 
GATAATGTGT TTTTGGATGT CAAAAAAGGT GAGGTACATG CCATACTTGG GGAAAATGGT
GCTGGAAAGT CTACTTTAAT GAATATCATC TATGGTCTTT ATACTCCTGA TTCTGGAGAG
ATATATTTTG AAGGTCAAAA ACTTGAAGTC AAAGGACCTC ATGAAGCAAT TGAAAAAGGA
ATAGGAATGG TTCATCAGCA TTTTATGTTG ATACCTGTAT TTACCGTGGC TGAAAATATT
GTTTTGGGAT TTGAGCCAAA AGGTTTTAGG TTTAATGTTC AAGAAGCTGA GAAGAAAATT
CTTGAGATTT CGAAGAAATA CAATTTAGAA ATTGACCCAA AGGCAAAAGT TGGAGATTTA
AGTGTAGGTA TGCAACAAAG AGTAGAGATA TTAAAGGCTT TTTACAGAGA TGCAAGGCTT
TTGATACTTG ATGAACCAAC AGCAATGCTA ACACCCCAAG AGACAAGGGA ACTTTTTAAG
ATTATAAATA ACCTGAAAGC TCAAGGGATA TCCATATTAT TTATAAGCCA CAAACTTGAT
GAGGTTATGG AAATTTCAGA TAGAGTAACT GTTATGAGAA GAGGAAAGAC AATAAAGACC
TTGAACACCA AAGAAACAAC CGAACAGGAA CTTGCAAATT TGATGGTCGG AAGAGAAGTT
AAACTTGTTG TTGAAAAGAC TGAACCGCGG TTAGGAGAGA CTGTGTTAAA GGTTGAAAAC
CTTTCAGTCA AACTGAAAAA CGGTGTTGAA AAGGTCAAAG ATGTAAGTTT TGAAGTAAGA
AGAGGAGAGA TTTTTGGTAT AGCAGGTGTT GATGGAAATG GACAAAATGA GCTTGTAGAA
GCTATTGTTG GACTTATTTC ATCAACAGGG AAAATAATCT TCAAAGGAGA GGAAATTCAA
AACCTTCCCA CCCGCAGACG TTACGAAAAA GGGATTGCTT ATATTCCAGC AGACAGGCAG
CAGGACGGGC TTGTTTTGAA CTTTACAGTG GCAGAAAACA TTGTGCTCAA AAGGTACTAT
AAAAAGCCAT ATTCTAATGG AGGTTTTTTA AATTATAAGG TAATAATCTC AGAAGCTGAT
AGACTCATAC ATGAATTTGA TGTGCGTCCA CCTGATTACA AGTTATTTGC AAAGAATCTT
TCAGGTGGCA ATCAGCAAAA GGTAATCTTG GCAAGAGAGT TTTCAAGCAG TCCAGACCTT
TTAATTGCTG TTCAACCAAC AAGAGGAATG GATGTGGGAG CTATAGAGTA CATCCATAGA
AAACTGATTG AACTTCGGGA CAGTGGTAAA GCAATACTAC TTGTTTCTTT AGAACTTGAT
GAGATTTTGA ATCTTTCTGA CAGGATTGCT GTGATGTATT CGGGCAGGAT TATGGATATT
TTGGAAAGTA AAAATGCAAC AAAAGAAGAG ATAGGACTTA TGATGATAGG CAAGAAAAAG
AAGGAGGCCT AA
 
Protein sequence
MEYILQVKDI SKRFGNIQAN DNVFLDVKKG EVHAILGENG AGKSTLMNII YGLYTPDSGE 
IYFEGQKLEV KGPHEAIEKG IGMVHQHFML IPVFTVAENI VLGFEPKGFR FNVQEAEKKI
LEISKKYNLE IDPKAKVGDL SVGMQQRVEI LKAFYRDARL LILDEPTAML TPQETRELFK
IINNLKAQGI SILFISHKLD EVMEISDRVT VMRRGKTIKT LNTKETTEQE LANLMVGREV
KLVVEKTEPR LGETVLKVEN LSVKLKNGVE KVKDVSFEVR RGEIFGIAGV DGNGQNELVE
AIVGLISSTG KIIFKGEEIQ NLPTRRRYEK GIAYIPADRQ QDGLVLNFTV AENIVLKRYY
KKPYSNGGFL NYKVIISEAD RLIHEFDVRP PDYKLFAKNL SGGNQQKVIL AREFSSSPDL
LIAVQPTRGM DVGAIEYIHR KLIELRDSGK AILLVSLELD EILNLSDRIA VMYSGRIMDI
LESKNATKEE IGLMMIGKKK KEA