Gene Athe_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0473 
Symbol 
ID7407552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp543223 
End bp544434 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content39% 
IMG OID643714861 
Productsmall GTP-binding protein 
Protein accessionYP_002572378 
Protein GI222528496 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0038153 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACGA CACCGCGCAG TGAAAGGCTT CACATAGCTA TATTTGGAAA GCGAAACGCT 
GGTAAGTCAA GCTTAATCAA TGCCATCACA AACCAGCCAA TTGCGATTGT GTCTGACATG
CCAGGAACTA CTACCGACCC TGTATACAAA TCAATGGAAA TCTTGCCGCT CGGGCCTGTT
GTTTTGATTG ACACAGCAGG AATTGACGAT GAAGGCATAC TTGGCAAGCT CAGGATTGAA
AAGACACTGG AGGTTTTGAA CAAAACAGAT ATTGCTATTT TAGTGGTATC TGACATTGAT
GATTTCACGT ATGAAAAACA GCTTGTAAAA CTATTTGATG AGAAAAAAGT GCCAAGAATT
GGTGTTTTGA ATAAAATTGA CAAAGACCCA AATTACAAAG AAAAACTTTC TTTTTTGCAG
TCAAGTTTGG GAATGCCATT TTTAGCTGTG TCATGTGCCA CTTTAAAAGG CATTGATGAG
CTAAAAAATG CACTTTCAAA GCTTGTTCCA GATGTTGGTG AGGATTTGCG AATAGTAGGA
GATTTGATAA ACCCAGGCGA CTTTGCAGTT TTAGTTGTGC CAATTGACAA AGCTGCCCCA
AAAGGAAGAT TGATTTTGCC TCAGCAGCAG ACAATAAGAG ATATTCTGGA TTCTGATGCA
ATTGCAATTG TGACAAAGGA ATACGAGCTT AAAGAAACTA TCGAAAATCT TGGTAAAAAA
CCTGCAATTG TAATCACAGA CTCACAAGCT TTTTTAAAGG TTGACGCAGA CACCCCACCT
GATATTCCTC TTACTTCATT TTCAATTTTG TTTGCAAGAT ACAAAGGGGA TTTGGTGGAG
TTTGTTGAGG GAGTCAGGAA AATTAAAGAT TTAAAACCTG GAGATACAGT TTTGATTGCC
GAGGCATGTA CACACCACAG GCAATCAGAT GATATTGGAA CTGTTAAAAT TCCAAGGTGG
CTCCGCCAGA TAGCTGGATT TGATATAAAC TTTGAGTGGG TATCAGGCTA CAATTATCCA
AAAGACCTCA CAAAGTATAA GCTTATAATC CACTGTGGTG GGTGTATGAT AACACGAAGA
GAGATGCTAT TTAGAGTAGA ACTTGCTAAA AAACAGGGTG TGCCAATAAC AAACTATGGT
CTAATGATTG CATATGTCCA TGGAATCTTG CCAAGAGCAT TAAAACCGTT TGGGATTGAG
TTTGAATATT AA
 
Protein sequence
MNTTPRSERL HIAIFGKRNA GKSSLINAIT NQPIAIVSDM PGTTTDPVYK SMEILPLGPV 
VLIDTAGIDD EGILGKLRIE KTLEVLNKTD IAILVVSDID DFTYEKQLVK LFDEKKVPRI
GVLNKIDKDP NYKEKLSFLQ SSLGMPFLAV SCATLKGIDE LKNALSKLVP DVGEDLRIVG
DLINPGDFAV LVVPIDKAAP KGRLILPQQQ TIRDILDSDA IAIVTKEYEL KETIENLGKK
PAIVITDSQA FLKVDADTPP DIPLTSFSIL FARYKGDLVE FVEGVRKIKD LKPGDTVLIA
EACTHHRQSD DIGTVKIPRW LRQIAGFDIN FEWVSGYNYP KDLTKYKLII HCGGCMITRR
EMLFRVELAK KQGVPITNYG LMIAYVHGIL PRALKPFGIE FEY