Gene Athe_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1661 
Symbol 
ID7409491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1754649 
End bp1756298 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content28% 
IMG OID643716030 
Producthypothetical protein 
Protein accessionYP_002573528 
Protein GI222529646 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00190356 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATGAGA TTGTTAAAGA AATATCTGCC AAAGAATTTT ATGAGATTTT TAAAAGGTTT 
AAGTATTTCA ATAATATTTT TATAGGCTTA GAAGTTAAGA ATAAAAACTT TGGATTGGGG
AAAATACAAT CATTTAAAAA AAATGCGGGC GATTTCTTAG TAACTGTAAA GTTTTTGAAC
GGTGAGTTTT TCAATTTCAG CATGAAAAGT TTTGAGATAT ATTTTCAATC GGTTAGGTTT
CCCACTAAGA AATTAAATGT AATAAATAAG TGGATGCAGA AAAAGGTTTC AATCGTTTTT
TCAAAGAAGT TATTTATTTC GAAAAGATTT AAGAGAAGAT ATTTTTATCA GAAGAAAAAA
GGATACAAAC CAGCATATTT TATGTTTAAA ACTTTTATTG AGCAGCTTAG ACAGAAATAC
AAAAGTTTCG GAATTTACCA TTATACAGAT TTTACTAACT TAGAATCTAT TTTTAAAGAG
GGTTTCTTGT ATAGCAGAGT GGATTGTATA AAAAAAGGAT TACAATTTAC AGATGGAGCA
GATCATAATG TATTAAGTAT TGCACCTTAT GATGTAAAAA ACAAGGTCAG ATTTTATTTT
AGACCCAATA CACCGACTCT TTTTGAAAAT GAGGGAATAA AACTTCCAGC TTACATTGGG
AAAGCTCATA TGCCTATTCC TGTGTGTTTG TTATTTGATG ATGAATTGAT TTTACTTGAC
ACCACAGAAT TTTCAAATGG TAATGCAACA AGTGTTAAGT ATACCCAAAT TGGTTGTACA
TATGCTTTTT TTAAATCAAT GGAATGGGAT TTAATATTTC ATGAAGGATA TATTGAACCT
TTTGAGAGGA ATAAAATAGT CAACAGAAGA AATTCTGAAT TGTTAAGTAC TACACCTGTA
TCTTTGAAAT ATTTGAAGAA AGTGATTTTT CGAACTAAAG CAGATTTTAA AAGAGCAAGT
AATTTGTTTG GTATGAATAA AAAGTTTTGT GTAGACATAA ATTATTTTTC AGACAAAAGC
AGGAATTATT ACTCAGAAGA AAAGATGGTG AATTTTATTG TGGATTACGA AGTTGATGTT
TATTTTAATA AACAAAGGAG AATTAGTTCA ATAAAATTGG AACTATATTC CTGGAGGCGT
CATGAAGATT ATAAAATTGA GGTAAGGTTT CTCGATAAGA GGGGAAATAT TCTCCCTTTT
GGCAATTTTA GAATTGAAAA ATCATTAAAT ATTTCAACTA CCGATGGAAT GTTTTTATAT
GTGGATCTTT ACAACGTAAA AGGAGATTTT TCTTGTGCAG AAAAATTAGA GGTATTGATG
AACAACGTCG TTTGTGTGGA AGAATTTTTA GATAGATTTA AAGTAAAAGA TATAAAAATT
AATTTTAATT CATGCGTGAA TGGTATTCAG ATTCTGTTCG AGGTAATGAT TGTAGATATA
AGTCTTTTGT TTAGTAAACA CAGTGCTTGG CTGTACACGT CAAATGATGA AAGAAAATAT
CCTGATTTAG AGGAACGAAA AATTATTTCT TTTGAAAAAA TATTATTGAA GCAAGAATAT
TATAATATTC AAAAAGAAGA TGTTGAAGTA ATAATCTATA TGGTAGATAA TAAAATTGTG
AAAAGTATAA TTGTTAGCAA AGGGGAATAA
 
Protein sequence
MDEIVKEISA KEFYEIFKRF KYFNNIFIGL EVKNKNFGLG KIQSFKKNAG DFLVTVKFLN 
GEFFNFSMKS FEIYFQSVRF PTKKLNVINK WMQKKVSIVF SKKLFISKRF KRRYFYQKKK
GYKPAYFMFK TFIEQLRQKY KSFGIYHYTD FTNLESIFKE GFLYSRVDCI KKGLQFTDGA
DHNVLSIAPY DVKNKVRFYF RPNTPTLFEN EGIKLPAYIG KAHMPIPVCL LFDDELILLD
TTEFSNGNAT SVKYTQIGCT YAFFKSMEWD LIFHEGYIEP FERNKIVNRR NSELLSTTPV
SLKYLKKVIF RTKADFKRAS NLFGMNKKFC VDINYFSDKS RNYYSEEKMV NFIVDYEVDV
YFNKQRRISS IKLELYSWRR HEDYKIEVRF LDKRGNILPF GNFRIEKSLN ISTTDGMFLY
VDLYNVKGDF SCAEKLEVLM NNVVCVEEFL DRFKVKDIKI NFNSCVNGIQ ILFEVMIVDI
SLLFSKHSAW LYTSNDERKY PDLEERKIIS FEKILLKQEY YNIQKEDVEV IIYMVDNKIV
KSIIVSKGE