Gene Acel_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1030 
Symbol 
ID4484571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1137274 
End bp1138857 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content71% 
IMG OID639729805 
ProductTraR/DksA family transcriptional regulator 
Protein accessionYP_872789 
Protein GI117928238 
COG category[T] Signal transduction mechanisms 
COG ID[COG1734] DnaK suppressor protein 
TIGRFAM ID[TIGR02420] RNA polymerase-binding protein DksA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00152227 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0090317 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGCGAT CGCAGGGACC ACGCGGTGGC GGGACGTCGT CGTGCCGCGT ACGCTGCGGG 
CAGCAGCGAC GATTCAGGCT CCGGAGTCTG GTGATTCGAT CTCCGGCGTG CCGCGTTGCT
GCGCGCAGCG GAACCGACGT ATCCTCTGCT GACCTGCGTG ACGATCAGGA ATCAGAGAGG
TTCGAGGGGG CCTCGATGGC AGGACAACGG CGTACGACGA CGGCGCCCGC CAGCCGGCGC
ACTACCACCA CGGGGACGCG CACCGGGGTC AAGCGCACCA CCGCCGGTGG CGCGAAGCGG
AGCGTGTCGG CGAACGCTTC GGTGGCGGGT CGGAAGGCCA CGACACAGGC GACCGCGAAA
AGGGCCGCGG GGAAGGCCAC CGCGGCGAAG CGCGCCGCGG CGGCGAAGGC CACGGTGACG
AAGACCGCGG CGGTGAAGGC CGCCACGAAG AAGGCCACGG CTGTGAAGAC CGCGGCGACG
AAGACCGCGG CGGTGAAGGC CGCGGCGAAG AAGGCCGCGG CAAAGAAGGC CGCGACGAAG
AAAGCCGCGG CAAAGAAGGC CGCGACGACG AAGACGTCTG CGGTGAAGAC CTCGGTAGCA
ACCAAGGCGT CTGCCGGGGC GAGAGCAGCT GCCCGGGCGT CGGCACCGGC CAAGGCCGCG
ACGACCGCGG CCAAGAAGGC CGCGACGAAA AGGGCTACGG CGCCGGCTAA GCCCGCGGCG
AAGAAGGCTG CGGCGCCCGC GAAGACCCCG GCGAAGCAGG CGGCTGCTTC CAAGAAGGCG
GCGGCCACGG CGGCGAAGGC ACCGGCGAAA GTGACCGCTC CCCGGAAGGC CGCAACAGCT
GCCCAGCGGG GCGGAGCGGC AGCCAAGGCG CCGGCGAAGG CCGCGCGGAA GGCACCGGCT
CCACCCACGA GCGTGCCACC GGCAACCGTC ACCTCGCCGT CCGGCCCGAC GCCAGCCGAG
ACCGCCGTTC GCCATGAGGC GGAGGTGCCT GGCGGCGTGC TCTCGTCACC GGCAGCGGCC
CAGCCCGCGG AACCAGCCGG TGCGGAGTCG TCCGCGTCTG CCGTCGAACC GCCGGCTGCC
GAACCGCCGG TCACGGCACC GTCCGCCGGC GTCGACGCCG CAGCAGCTCC GGCCGAGTCG
GAGACCCCCG CCAACTCGAC AGCCACGGCG CTCCCATCCG GAGCCGGCGA CGTCGGAGCC
GGCGAGATCG CGGATGAGTA CACCTGGACA GCGGCTGAGC TCGACGAGAT TCGTGCGCAG
CTCGAAGCGG AGATTGTCCG GTTGCGCCGG GAGATCGAAG TCGCGGAGTC GGGGCTCGCG
GAGCGGATGC GGGACGGCGG CGACGGCGCT GGTGACGACC AGGCGGACGC CGGCACGAAG
ACGTTCGAGC GGGAGCACGA GATGTCCCTG GCCAATAACG CCCGGGATCT GCTCGTGCAG
ACCGAGCACG CACTCGCCCG CATCGCAGAT GGCACGTACG GCCGTTGCGA GAACTGCGGC
AATCCCATCA ACAAGCTCCG GCTGCAGGCG AATCCGCGTG CGACGCTATG TGTGTCCTGC
AAGCAACGGG AGGAGCGTCG CTGA
 
Protein sequence
MRRSQGPRGG GTSSCRVRCG QQRRFRLRSL VIRSPACRVA ARSGTDVSSA DLRDDQESER 
FEGASMAGQR RTTTAPASRR TTTTGTRTGV KRTTAGGAKR SVSANASVAG RKATTQATAK
RAAGKATAAK RAAAAKATVT KTAAVKAATK KATAVKTAAT KTAAVKAAAK KAAAKKAATK
KAAAKKAATT KTSAVKTSVA TKASAGARAA ARASAPAKAA TTAAKKAATK RATAPAKPAA
KKAAAPAKTP AKQAAASKKA AATAAKAPAK VTAPRKAATA AQRGGAAAKA PAKAARKAPA
PPTSVPPATV TSPSGPTPAE TAVRHEAEVP GGVLSSPAAA QPAEPAGAES SASAVEPPAA
EPPVTAPSAG VDAAAAPAES ETPANSTATA LPSGAGDVGA GEIADEYTWT AAELDEIRAQ
LEAEIVRLRR EIEVAESGLA ERMRDGGDGA GDDQADAGTK TFEREHEMSL ANNARDLLVQ
TEHALARIAD GTYGRCENCG NPINKLRLQA NPRATLCVSC KQREERR