Gene Athe_2430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2430 
Symbol 
ID7408054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2574182 
End bp2575462 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content39% 
IMG OID643716793 
Producttransposase IS605 OrfB 
Protein accessionYP_002574271 
Protein GI222530389 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAGG CAGAAAAAAT TAAGCAAACA CGGCAGCAAA CTAAAGAAAG AAGAAAAAAT 
CAGATACCTG TGGTATATCA ACTAAAAATT AATCTTAGTT CTGTTTCTAA AGAGACAAAA
AACAAACTCT CTAAATTGTT TTTAGAAGCG AAATGGCTGT ACAACTACAT AGTTGCTGAC
ATTGAAAACA GACTTAATAG CAATGCAGAC AAACTAAAAG AAGTCGAAAT AAAGGTCGGG
GAAAATTTTG AGAGAAGAAA AATTGAAAAT CTCAGTTCAC AGATGAAGCA GGCACTAAGA
GAAAGAATAA AGCAGAATTT GTACTCTCTT CATGTACAAA AAGAAAACGG ATATAGAACA
GGTAAACTGC AATTCAAGCA TTGTGTTAAT TCAATACCCC TTAAGCAATA CGGGAACACT
TTCAAGTTTG TCAACCAGCA GAAGACCAGA GTTAAAATAC AGGGCATTAA AAAACCTTTG
AGGGTACTGG GTGGGCATCA AATTCCCGGA GATAGCGAAA TAGCAAAAGC AGAACTGGTA
AGAAGACCAA GTGGCATTTA TCTTTTTGTA ACCTGCTACA TAGATAGGGA TAAATACACA
GGAGCGTGGA AACACAGAAA GAACAGAAAA GGTGTAGTAA AACCAAGGGT ATGGCAGACA
TTTGCAACTG ATAGTGGAAT TGATTTCAAA CCCACTGGAT TTATGTTATC AAACGGTCTC
AAACTTGAAT GGCAGATAAA AGAAACCAGG CGACTAAAGA AGTTGCAAAA ACAGTTTTCA
CGTCAGAAAA AAGGTTCTAA AAGATGGTAC AAAACAAAGG AAAAGATTGC AAAGGAGTAT
GAAAAGTTAA CGAACATCAA GTACGACGTA ATTAACAAAA CCTGTTCATT TTTGTACAGG
TACAGGAAGA TATGTTTTCA AGATGACAAC ATCAAGGGTT GGAAGAACGG ACATTTTTCC
AAATCAGTAC ACCACAGCGC AGTAGGAACA ATTAAGAGAA GACTGAGCGA CAGTCTTCGG
GTTTCTACTG CGGTGGTTAA AAGCAACGTA CCAACCACAA AGACATGCAG CAGGTGTGGA
AGTCAACAGG AAATTTCGCT GTCGGACAGA ATTTTCAGAT GTTCAGTGTG TAGCCTGGAA
ATTGATAGGG ATTTAAATGC AGCGATAAAC ATGTTGAAGG AAGTAGGGCT GGGCCGGTCC
GAACTTACGC CCGTGGAGTG GGAGACCGCT GCCAAGATAT TCAGGGGTAA TCCCTATATC
CTGGTAAGTC ATACCACGTA G
 
Protein sequence
MTKAEKIKQT RQQTKERRKN QIPVVYQLKI NLSSVSKETK NKLSKLFLEA KWLYNYIVAD 
IENRLNSNAD KLKEVEIKVG ENFERRKIEN LSSQMKQALR ERIKQNLYSL HVQKENGYRT
GKLQFKHCVN SIPLKQYGNT FKFVNQQKTR VKIQGIKKPL RVLGGHQIPG DSEIAKAELV
RRPSGIYLFV TCYIDRDKYT GAWKHRKNRK GVVKPRVWQT FATDSGIDFK PTGFMLSNGL
KLEWQIKETR RLKKLQKQFS RQKKGSKRWY KTKEKIAKEY EKLTNIKYDV INKTCSFLYR
YRKICFQDDN IKGWKNGHFS KSVHHSAVGT IKRRLSDSLR VSTAVVKSNV PTTKTCSRCG
SQQEISLSDR IFRCSVCSLE IDRDLNAAIN MLKEVGLGRS ELTPVEWETA AKIFRGNPYI
LVSHTT