Gene Acel_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1971 
Symbol 
ID4485099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2242086 
End bp2244152 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content66% 
IMG OID639730764 
Productthymidylate kinase 
Protein accessionYP_873729 
Protein GI117929178 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0125] Thymidylate kinase 
TIGRFAM ID[TIGR00041] thymidylate kinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.270362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGCCG TCGGTACGTC GGATTTGCGC GGCGTGCTGC GGATTCCGTC GTTCCGCAAA 
CTCTGGACCG CGCTCGCGGT GTCGAGTTTC GGCGACTGGC TGGGCTTTCT CGGCCAGACG
GCGCTGGCCG CCAGCTTGGC CAGCGGGCAC ACGTATCTGG CGCAGAATTT CTCGGTCGGG
ACGGTGGTCT TCGTCCGGTT GTTGCCCGCG GTGACCCTCG CGCCGTTGGC CGGAGCATTG
GCTGACCGCT TGGATCGCCG GCTCACCATG GTGATCGCCG ACATCGGACG GTTCGCGCTC
TACGCGTCGA TCCCGATCGT GCACACGCTC TGGTGGCTCT TTGTCGCGAC TTTCCTCATT
GAGACGTTGA GCCTGTTCTG GATTCCGGCG AAGGAAGCGA CGGTCCCGAA TCTGGTGCCT
CGTGAGCGCA TGGGCGACGC CGGCTCGCTG AATCTCCTCG GCGCCTATGG CACGGCCCCG
ATCGCCGCGG CGGTTTTCGC ACTGCTCAGC CTGCTCACCG GCGTCCTTGC CCGTTGGATT
CCGTTGTTCA CGACGAATCG TGTCGACCTC GCCCTGTATT TCGATGCGGT GACGTTCCTC
GTCTCGGCGG CGACGATTTT CAGCCTGCGG GAAATTTCCA CGCACGGGGG ACGGCGGCGG
ACCGCCGACG GCGTCGTCAC GCATCCGTCG GTGTTCCGGT CAATCGTCGA CGGGTGGCGG
TTCATGGGCC AAACACCGGT GGTGCGCGGC CTTGCGGTCG GAATGCTCGG TGCGTTCGGC
GCGGCCGGGG TCGTCATCGG TGTCGCCACG ATTTACGTGC GGGATCTCGG CGGCGGGGGA
GCCGGGTACG GCATGTTGTT CGGGGCGGTT TTTCTCGGGT TGGCCATTGG CATGTTCACC
GGTCCGCGGG TGCTGCGCGG ATTCTCCCGG CGTCGGCTCT TCGGGTTGAG CATCGCCGCA
GCGGGAGTGA TTCTCGCGGC GACCGCGCTG GTCCATAATC TGGTGCTTGT TGTTTTCGGT
GCGCTGCTGC TCGGCTGTTG CGCCGGCATT GCCTGGGTCA CCGGGTACAC CTTGCTCGGC
TTGGAAGTTG AGGACGCCAC ACGCGGCCGC ACCTGGGCGA CGTTGCAGTC GCTCATGCAG
GTGGACATTC TTCTGATGGT TGCGGCGGGA CCGTTTCTCT CTGGCGGCAT CGGCACGCAC
GTTCTGCGCA TCGGCGACGT CACGTTCCCG GTCAACGGCT CGGCGATGAC GCTGCTCTTC
GCAGGCATTG GTGCACTGGT CGTCGGCCTC GTCGCCTATC GGCAGATGGA TGACCGCGAC
GTGCCGTTGT GGCGTGATTT CATCGATGCC GTTTTCGGCT GGCACCCGTT GACCGGGCGG
GGTGTCGCCA CCGGTTTCTT CGTCGCGTTC GAAGGCGGCG AAGGGGCGGG CAAATCGACA
CAGGTGGAAT TGCTGGCCCG CTATCTTGCC GATCGCGGAT ACGAGGTCGT GGTGAGCCGC
GAACCCGGCG GGACGCCGCT CGGTCATCGG CTTCGCGAGA TTCTCCTCGA CCCCCGGGAA
CCGGCTCCCT CCCCGCGCGC CGAAGCGTTG CTGTACGCCG CCGATCGGGC GGAGCACGTC
GCGAAGGTGA TTCGGCCGGC GTTGGCGCGC GGTGCGATTG TCATCAGCGA CCGGTACGTC
GATTCATCCC TTGCCTATCA GGGCGGCGGG CGGGATTTAT CGCCGCGCGA CGTCGAGCAG
CTGTCGCGGT TCGCGACGTC TGGTCTGCGC CCGGATCTCA CCGTGCTGTT GGATGTTCCG
CCGGATGAGG GATTGGCCCG CACGGGACGG CGGGACCGCG CACCCGATCG TCTGCAAGCT
GAGGACGCGG CTTTTCACGA GCGGGTGCGG AACGTGTTTC GGCAGCGTGC CGAAGCCGAT
CCGGAACGGT ACCTTGTCAT CGACGCCCGG TTGTCCGCTG CGGACATTCA TCGGCTCGTC
GTTCAGCGGA TCCTGCCCGT CTTGCCGTTT CCCCCGAAAC GAGCTCCGCG CGACCCGCTG
CCGTCAACCG CCGGAGGGAC GCCGTGA
 
Protein sequence
MNAVGTSDLR GVLRIPSFRK LWTALAVSSF GDWLGFLGQT ALAASLASGH TYLAQNFSVG 
TVVFVRLLPA VTLAPLAGAL ADRLDRRLTM VIADIGRFAL YASIPIVHTL WWLFVATFLI
ETLSLFWIPA KEATVPNLVP RERMGDAGSL NLLGAYGTAP IAAAVFALLS LLTGVLARWI
PLFTTNRVDL ALYFDAVTFL VSAATIFSLR EISTHGGRRR TADGVVTHPS VFRSIVDGWR
FMGQTPVVRG LAVGMLGAFG AAGVVIGVAT IYVRDLGGGG AGYGMLFGAV FLGLAIGMFT
GPRVLRGFSR RRLFGLSIAA AGVILAATAL VHNLVLVVFG ALLLGCCAGI AWVTGYTLLG
LEVEDATRGR TWATLQSLMQ VDILLMVAAG PFLSGGIGTH VLRIGDVTFP VNGSAMTLLF
AGIGALVVGL VAYRQMDDRD VPLWRDFIDA VFGWHPLTGR GVATGFFVAF EGGEGAGKST
QVELLARYLA DRGYEVVVSR EPGGTPLGHR LREILLDPRE PAPSPRAEAL LYAADRAEHV
AKVIRPALAR GAIVISDRYV DSSLAYQGGG RDLSPRDVEQ LSRFATSGLR PDLTVLLDVP
PDEGLARTGR RDRAPDRLQA EDAAFHERVR NVFRQRAEAD PERYLVIDAR LSAADIHRLV
VQRILPVLPF PPKRAPRDPL PSTAGGTP