Gene Acel_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2139 
Symbol 
ID4485607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2419355 
End bp2420932 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content67% 
IMG OID639730941 
Productmetal dependent phosphohydrolase 
Protein accessionYP_873897 
Protein GI117929346 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR02692] tRNA adenylyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCAGCC AGCCTGTTAC CGCGATGACC GCTGTCAGCC TCCTGGCCGA AGCGCGCGCC 
AAGGCGCTGT CCGATCTTGC CGAGGTCCTG CCGATGCTCG ACGAGCTCGG CCAGCGGTTC
GCGCATGGCG GGCATGAGCT GGCGCTTGTT GGTGGGTCGA TTCGTGACGC CCTGCTCGAG
CGTCCCGTCG TTGATCTTGA CTTGGCGACC AGCGCCCGGC CCGATGAGGT GCTGAAACTT
GCGAAGGGCT GGGCGGAGAG CCAATGGGAG ACCGGGATCG CCTTCGGCAC GGTGGGCCTA
CGGAAGGACC GCTACCACCT GGAGATCACA ACGTACCGGG CCGAGCGGTA CGATCCGACG
TCCCGGAAGC CGGCGGTGGA GTACGGCGAA TCCCTGCTGG ATGACTTGAA GCGGCGGGAC
TTCACGGTCA ACGCGATCGC TCTGACGCTG CCCGAACACG AGCTGGTCGA CCCCTTCGAC
GGCATCCGGG ACCTGCTGCG TCGTCGGATC ACTACCCCAG GGCGTCCGGA GGATTCCTTC
AGCGACGATC CGCTGCGGAT GCTGCGGGCC GCGCGGTTTG CCGCCCAGCT CGGTTTCACC
GTCGACCCGC ACGTGGTTGC CGCGATGACC GCGATGGCGG CACGGCTTGA CGTGGTCTCC
GCTGAGCGGA TTCGCGACGA ATTCACCAAA CTGCTCCTTG CGCCGGAGCC GCGGCGGGGG
CTGGTTCTGC TCGTCGATAC CGGATTGGCG GACCGCTTTC TGCCGGAGCT GCCCGCGTTG
CGACGGCTTG AGGCGGACCG CGAGTACCGG CACAAGGACG TCTACGAGCA CACGCTCACG
GTCCTCGACC AGGCCATTGC TCTCGAGGAC GGCGAACCGG ACCTGGTGCT GCGGCTTGCC
GCGCTGCTGC ACGACATCGG AAAGCCGGCG ACCCGCCGCA AGGAGCCCGA TGGCCGGGTC
TCCTTCCACC ACCACGAAGT CGTCGGCAAG AAACTCGCCA AGGCCCGGCT GACCGCGCTG
AAATTTCCCA AGGACGTCGT GAACGACGTG AGCCGCCTGG TCGAATTGCA CTTGCGTTTC
CACGGCTACG GCACCGGGGA GTGGACGGAC TCCGCGGTCC GCCGGTACGT GCGGGACGCC
GGCCCGCTGC TCGACCGGCT GCACAAACTC ACCCGGTCGG ACTGCACGAC CCGCAACAAG
AAGAAGGCCG CGGCGCTGCA AGCGGCGTAC GACTCGCTGG TGCAGCGGAT CGAGGCGTTG
CGCCGGCAGG AGGAAATCGA CGCGATCCGG CCCGAGTTGA ACGGCCACGA AATCGGGGAA
ATCCTTGGGA TTCCACCCGG CCCGGAACTC GGTCGTGCGT ACCGTTTCCT GCTGGAGTTG
CGCCTGGAGC GGGGACCGAT CGGTAAGGAG AACGCGGCTG CGGTGCTGCG GGAATGGGCC
GCCGAGCACG GGATCACCCC GCGGACGACA GCCGCACCTC CGCCGCCTCG GGACGACGCA
GGCGCATCCT CGCCCGACAC GGCGGCTTCA TCGGGCGAGA CCCGTGAAGC CTCGTCAGCC
GGTGATCAGA CCAGCTGA
 
Protein sequence
MSSQPVTAMT AVSLLAEARA KALSDLAEVL PMLDELGQRF AHGGHELALV GGSIRDALLE 
RPVVDLDLAT SARPDEVLKL AKGWAESQWE TGIAFGTVGL RKDRYHLEIT TYRAERYDPT
SRKPAVEYGE SLLDDLKRRD FTVNAIALTL PEHELVDPFD GIRDLLRRRI TTPGRPEDSF
SDDPLRMLRA ARFAAQLGFT VDPHVVAAMT AMAARLDVVS AERIRDEFTK LLLAPEPRRG
LVLLVDTGLA DRFLPELPAL RRLEADREYR HKDVYEHTLT VLDQAIALED GEPDLVLRLA
ALLHDIGKPA TRRKEPDGRV SFHHHEVVGK KLAKARLTAL KFPKDVVNDV SRLVELHLRF
HGYGTGEWTD SAVRRYVRDA GPLLDRLHKL TRSDCTTRNK KKAAALQAAY DSLVQRIEAL
RRQEEIDAIR PELNGHEIGE ILGIPPGPEL GRAYRFLLEL RLERGPIGKE NAAAVLREWA
AEHGITPRTT AAPPPPRDDA GASSPDTAAS SGETREASSA GDQTS