Gene Athe_0481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0481 
Symbol 
ID7407560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp552113 
End bp553495 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content34% 
IMG OID643714868 
Producthypothetical protein 
Protein accessionYP_002572385 
Protein GI222528503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000859311 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATA AAATTATTAC GAATTATAAC CTGATTAAGA ATACTCACAA CAATATTAAG 
GTCAATAGTT CCAATCATAG TGTTTCTGCA CAAACGCTTT CAACAGGTTG GATTGTGCCT
GATAAACCTT CTGTGCAGTC TACTGCTGCA TACAAAGTTA CACTTTCAAA CACTTCTTTG
CAAAAAGCTT CTACCAGCAG TAAAACTCAT GGTTCGAATA CACGAAATAG TGGAGGGATT
TTGGGTATTT TTTCAAATAT CAAAAAAGAT ATTAGTGACA CTACAAAAAG TATTCAGAAC
AAAGTCAACA ATTTTGTAAA ATCTACAGTT TCAAATTTGA ATAACACAGT TAAAACAGTA
GAGCACAAAA TTTCTTCAGT AGTAAAATCA ACTGGGGAAA AATTAGAGAC TGTTGCAAAG
GATATTAAAG AAGGTTTGAA AAAAGCTGTA GATGTTACAA CAACAAATAG TATTTCTGTT
GTAGGGAATA AAACTATAAA AGAGAAAAAA ATAACCTTGA ATGTAGCAGG CAACAAAATG
TATCTAAAAT TTACATCTTC AGTAAGTGGA GAAGTGGGAG TTGAAAAATC TTCTCAGTAC
AAAGCAAATA CAGAGAAGGC TTCTGGTTTT GTTGAGCACA GCAATTCTGG TAAATTGAAT
TTGGAATTAG AGAAGAGAAA AATTAGCAAG AGTTTTGAGA ATAGCACGAG TATGAAAGTT
AATGATAAAA CTGAGATTGT AAGTAATGTA ACAGTTAACA AAAGGGGTGC TGAGATTGCA
GGTGGAACAA AACTGGTGCT TTTGAAGACA CATAATCAGG AAGTTAATGT AACAGTTGGG
GGAGCAGTAA AGAGCAATGG AAAAGCTGAA ATAAATTTAG CTAAGGTAAC TCATTCTTTA
TCAACAGGTA AGATAACTTC CGAACAAAGC ATAGGGTTAA GCATTGATGA GAAAACTTTT
AATAAGATAA AGACCACTAT GCAGGGTGTA TGGCTCGCAG TACCAAATGA TACAAAGTTT
AAGATAGGAG TTGCTGTTGG AGTTATCAAA GGGGCTGTAA ATACTGTAAA GAGTTTAGTT
GATGTTGTAA CTCATCCAAA GCAAATCGCA GAAGGAGCAA GAGAATTAAT TAAACATCCG
CAGGTAGCAT TAAAATATGT AGAGCAATCT ATTGCTAAGG CAAAGGAAGA ATTTGTAAAT
GGTGATGATT ACAAAAGAGG AGAAATGGTA GGGGAAGCAC TGTTTGAAGT AGGGGTTAGT
ATAGCAGGTA CCAAAGGATT AGATAAATTA GCGAAGGCAG CCAAAGTATC TAATAATTTA
GGAAAACTTA AAAAAGTATT TGATGTTACG ACAAAAGTAG CAAAACCAGC TTTTGGTCAT
TAA
 
Protein sequence
MSNKIITNYN LIKNTHNNIK VNSSNHSVSA QTLSTGWIVP DKPSVQSTAA YKVTLSNTSL 
QKASTSSKTH GSNTRNSGGI LGIFSNIKKD ISDTTKSIQN KVNNFVKSTV SNLNNTVKTV
EHKISSVVKS TGEKLETVAK DIKEGLKKAV DVTTTNSISV VGNKTIKEKK ITLNVAGNKM
YLKFTSSVSG EVGVEKSSQY KANTEKASGF VEHSNSGKLN LELEKRKISK SFENSTSMKV
NDKTEIVSNV TVNKRGAEIA GGTKLVLLKT HNQEVNVTVG GAVKSNGKAE INLAKVTHSL
STGKITSEQS IGLSIDEKTF NKIKTTMQGV WLAVPNDTKF KIGVAVGVIK GAVNTVKSLV
DVVTHPKQIA EGARELIKHP QVALKYVEQS IAKAKEEFVN GDDYKRGEMV GEALFEVGVS
IAGTKGLDKL AKAAKVSNNL GKLKKVFDVT TKVAKPAFGH