Gene Athe_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1839 
Symbol 
ID7408953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1914071 
End bp1915798 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content34% 
IMG OID643716216 
ProductS-layer domain protein 
Protein accessionYP_002573705 
Protein GI222529823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAT TTAAAAAGCG TCTTTTGCTG ATATGTGTAA TGTTGGTTTT TGCTATAGTA 
CAAATTTTTT CTGCAATTGC TTTTGCGCAA GGCACATCAA ATCCTATTTT TTCTGACCTT
CCTCAAAATC ATTGGGCATA CAATGCAGTG AAATTCATGG TAGAAAGAGG AATTATAACA
GGTTATCCAG ATAACACATT CAGACCAGAC AATCCAGTTA CAAGAGCTGA ATTTGCAAGG
ATTATGGTAA TTAGCTTGAA CCTTCCAATC AAAGTGACAG ATAATCCATC CTTTAAAGAT
GTTCCAAAAG ACCACTGGGC ATATCCACAT GTAGAGACTG CAAAATTTTA TTTGACAGGT
TTTAGAACTC AGAATGGTGA CTACTTTAAG CCATCTGACT ATGCGGTAAG AGAGGATATG
GCAGTTGCCC TTGTAAAAGC AAAGGGATTG CAGAATGAAA ATGTTGACCT GAGTATTTTA
AGTAACTACA TTGATAAAGA CCAGATATCA AAGAATCTTG TTAAACATGT CGCAATTGCC
ATTGCAAAGG GTATTATGGT GGGAAGCCCA GTTTCAAATT CTAATCAATA TAAGTTTGAC
CCGCAAGGAA TTCTAACACG TGCACAGGCA GCAGTGCTGT TGTATAATGT TATTAATGCT
CAATCAACTG AAGAAAAAGT TACCTATGAC GATTCTTCAT CAGGTTCTAA TCAGCAATAT
ACTTATCCTG TACCCAATGT TACTGCCTAT ACAAAGGGGG ACAGAGTTGT CCTGATATGG
AATAGAATAA ATGACAAAAA ACTGAAAGGA TATGCAGTTG TTATCTCAAA AAACAATAGC
CAGCCGGGAT ATCCGCAAGA TGGTTATCTT ACAATCTTAT CTGATAGAAA TGCCAATTAT
ATAGAAATTG GAGTAAACTC AAAATATAAC AATGGCGATT TTGGAGCTTA TATAAAGAGC
GGAGAAGAGT ATTATTTCAG TGTTACAGCA ATCTATGAAG GAAATGTCTA CGTAAAAGGC
AATGCTGTGA AAATGAGAAT GCCAGTTATA CCAAATTATT TTGAAAAACC ATCTGTTAAG
TATGAATATA AAGACAATAA ATTTGTTTTA AGCTGGCAAA AGATAGACGA TTTCCGACTT
ATAGGATATT GGATTGTGAT ATCCAAAAAG ACTAAAGAAC CTAAATATCC GGACAATGGT
TATCTTGTTT TTATCAATGA CAAAAATACA ACTCAGATTA TTATTGACAA CACAATTCCT
TACAAAAATG GAGATTTCGG TGAGTATTTA AAAGATGGTG AAGAATATTA TTTTAGTGTA
ACAGCTCAGT ATCAGGACAG GGTTGTCCCT GGGAATAGCA TCAAGGCCAT TTATTATTCT
AATTTGGAGA TTGCAAAATT AAGACCAAAG TTGCAGGCAA AAACAGTAAG ATGGAGGGGA
CAGTGGTATA TAAACTTAAG ATGGGACAAA ATTGATAGCG ATAAGCTTCA AGGATATAAA
GTTGTAGTAT CTGATAAAAA CTCAACACCC GACTTAAATA AAGATGGGTT ACTGGCAGTA
ATATCAGATA AAAACGTTAC TTCTGTAAAT ATCAAAGCAA AAGATAAGTA TTTACTAAAT
GGAGAATATA AGGAACTCAA AAGAGGACAT TACTATTACT TTACAGTTTA TGCTATTTAC
TCTGATAGAG TAGTAGATGG CAATGTAATT AGAATAAAGA TACCATAA
 
Protein sequence
MKLFKKRLLL ICVMLVFAIV QIFSAIAFAQ GTSNPIFSDL PQNHWAYNAV KFMVERGIIT 
GYPDNTFRPD NPVTRAEFAR IMVISLNLPI KVTDNPSFKD VPKDHWAYPH VETAKFYLTG
FRTQNGDYFK PSDYAVREDM AVALVKAKGL QNENVDLSIL SNYIDKDQIS KNLVKHVAIA
IAKGIMVGSP VSNSNQYKFD PQGILTRAQA AVLLYNVINA QSTEEKVTYD DSSSGSNQQY
TYPVPNVTAY TKGDRVVLIW NRINDKKLKG YAVVISKNNS QPGYPQDGYL TILSDRNANY
IEIGVNSKYN NGDFGAYIKS GEEYYFSVTA IYEGNVYVKG NAVKMRMPVI PNYFEKPSVK
YEYKDNKFVL SWQKIDDFRL IGYWIVISKK TKEPKYPDNG YLVFINDKNT TQIIIDNTIP
YKNGDFGEYL KDGEEYYFSV TAQYQDRVVP GNSIKAIYYS NLEIAKLRPK LQAKTVRWRG
QWYINLRWDK IDSDKLQGYK VVVSDKNSTP DLNKDGLLAV ISDKNVTSVN IKAKDKYLLN
GEYKELKRGH YYYFTVYAIY SDRVVDGNVI RIKIP