Gene Athe_1176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1176 
Symbol 
ID7408758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1269010 
End bp1270878 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content39% 
IMG OID643715541 
Producthypothetical protein 
Protein accessionYP_002573049 
Protein GI222529167 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00138392 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAGC TTCTAAACTT AAAGGTAAAA GATGAAGTGT ATGAAATTTT AAGCAGCTGC 
AAGGGAATAA TTATGCCAGA AAAGAGACTT GATTTTATAG ATTTATCCCT TGGTGGAAAA
GATAATATGG TGTTTGAGGT AAAGTATGAG GTAGAAGGTA AGGGTGAAGT AGTTGAGGCG
ATAGTTACAA GATGTAAAAA TGGTATTGTT GTAAATTATA CTGATGTGTA CATGAGAAGA
AGAGACCCAG ATAGTTTAAT TATTGGTGAT GATGGGGAAA CTGACAAACA ACGATATAAA
GACATTTATG GAGACAATTT TGAAAGAGTG AGGAAAGAAA CATTTGAATG GCTGAAAAAA
CAAGAATTAG TAGTATATGG ATTTTATGCT GGAGGAAAGG AACACGGGTA TCCTGCTCTT
GTAATAGCTC CGCTAAATGC CGCATTTTTT GGATTTGCAC TTGCTGATAT CCAGGGATTT
ATTCCAAAAA GCGAATTTGA AAAGATTGAT GTTTTTGAAC CAAAAGCAGT GATATATGTA
GCTCCACCTT TTAGACATAC ACATTTTAAC GGAAAACAAG TGGTTGTGCA CAATAGGTTA
AATGGCGTCC ATGAAATATT TTCATATAAC TTGTACCCAG GACCAAGTGC TAAAAAGGGA
GTATATGGTG TGCTTTTAAA CATCGGTGAA ATGGAAGGCT GGGTTGCTGC ACATGCTTCG
ACAGTCAGGA TTGTTACACC ATATGACAAT GTGATAACAA TAATGCACGA GGGAGCAAGT
GGTGGCGGAA AGAGCGAAAT GTGCCAGCAG ATGCATAGAG AAAAAGACAA TAGAGTCTTG
CTTGGAGAAA ATATTATAAC AAAAGAAAGA ATTTACCTTG AAATAAAAGA ATCTTGCGAG
ATACATCCAG TTACAGACGA TATAGCACTT GTTCATCCCA GCCTTCAAAA GGGTTCAAAA
ATGGTTGTAA AAGACGCCGA ACAAGGTTGG TTTGTAAGAC TTGATAATAT TCCACATTAC
GGTACAGACC CACAGCTTGA GAGGCTTTGC ATTCACCCAC CCGAGCCGTT AATATTCTTA
AATTTAGAGG GTGTGCCTGG TTCAACCTGT CTTATATGGG AACACACAAT GGATGAGCCA
GGAAAACCTT GCCCTAATCC AAGGGTTATT TTGCCTCGCA GGTTTATTCC GAACATTGTG
GATGAACCTG TTGAGGTTGA CATACGAAGC TTTGGTGTGA GAACACCACC TTGCACTAAA
CAAAAGCCGA CTTACGGTAT TATAGGGATG TTTCACCTTT TACCACCTGC ATTGGCATGG
CTGTGGAGGC TTGTTAGCCC CCGCGGTCAT GCTAATCCAA GTATAACGCA GGCTGAGGCT
TTGAGCTCTG AGGGTGTTGG TTCTTACTGG CCATTTGCAA CAGGACTTAT GGTAAAACAA
GCAAACCTTT TGCTTGAACA GATTTTACAG TTTACAAAAA CTCAGTATAT TCTCATTCCA
AACCAGCACA TAGGTGCATA CAAGGTTGGC TTTATGCCAC AGTGGATAAC AAGAGAATAC
TTAGCAAAAA GAGGTAATGT TAAATTAAGA CCTGATCAGC TAAAACCTGC AAAGCTGCCG
CTTTTGGGAT GGGCACTTGA ATACATGAAA GTTGAAGGAA CTTATATACC AAAGTTTTTA
CTCCAGGTTG ATCTCCAGCA AGAGGTTGGA GAAGAAGCCT ATATGGAAGG AGCAAAGATT
TTGACAGAAT TTTTCAAGAA AGAGATTATA AAATTCAAAA CATTGGATTT ACACCCACTT
GGAAGAAGAA TTATAGAATG CTGCTTGGAT GATGGAAGCA TAGATGACTA TGTATCTTTG
ATAAAATAG
 
Protein sequence
MIKLLNLKVK DEVYEILSSC KGIIMPEKRL DFIDLSLGGK DNMVFEVKYE VEGKGEVVEA 
IVTRCKNGIV VNYTDVYMRR RDPDSLIIGD DGETDKQRYK DIYGDNFERV RKETFEWLKK
QELVVYGFYA GGKEHGYPAL VIAPLNAAFF GFALADIQGF IPKSEFEKID VFEPKAVIYV
APPFRHTHFN GKQVVVHNRL NGVHEIFSYN LYPGPSAKKG VYGVLLNIGE MEGWVAAHAS
TVRIVTPYDN VITIMHEGAS GGGKSEMCQQ MHREKDNRVL LGENIITKER IYLEIKESCE
IHPVTDDIAL VHPSLQKGSK MVVKDAEQGW FVRLDNIPHY GTDPQLERLC IHPPEPLIFL
NLEGVPGSTC LIWEHTMDEP GKPCPNPRVI LPRRFIPNIV DEPVEVDIRS FGVRTPPCTK
QKPTYGIIGM FHLLPPALAW LWRLVSPRGH ANPSITQAEA LSSEGVGSYW PFATGLMVKQ
ANLLLEQILQ FTKTQYILIP NQHIGAYKVG FMPQWITREY LAKRGNVKLR PDQLKPAKLP
LLGWALEYMK VEGTYIPKFL LQVDLQQEVG EEAYMEGAKI LTEFFKKEII KFKTLDLHPL
GRRIIECCLD DGSIDDYVSL IK