Gene Acel_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1371 
Symbol 
ID4485859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1525654 
End bp1527222 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content68% 
IMG OID639730155 
ProductErfK/YbiS/YcfS/YnhG family protein 
Protein accessionYP_873129 
Protein GI117928578 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.104167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGGC AACCTCACGA CGACGGCTGG GAGATCGGCG GAGGTGACCG CCGCATCCCC 
GCCGACGTGC TGCCCAAGCA TCGCGCGCCG AGGTCGCGGG TTCGTCCCGC GTTGCGGCAT
GGCCTGCTCG GCTTGCTGGT TGCCGGCGGT CTGGCGATTG CGACCGCAGC TGCTCTCGGC
GGGTGTCAGC AGCGGGCTCC CGCCGGCCTT GCCGCAGCGG GCGGCAACCA GACCAGCGCG
CCGACGTCGC GGACGTCCGT GTCGGCCCTG CCGGTGGATT TCACCGTGTC GCCGGCGGAC
AAGGCGACCG GCGTCCCGCT GAACGCGGAC GTGACGGTGA GCGTCCAGCA CGGGTCGTTG
ACGCTCGTAT CGGTGGCGGA CGGCGCCGGT CACCGGCTGG ATGGCGGCAT GGACGCCGAT
CACCACACCT GGCGGAGCGC TGAGGCGCTG CAGCCCGGCA CGACGTACCA CGTGGTCGCC
GAAGCGGAGG ACGACGCCGG GCAGGCGGCG ACGTTCAGCT CGACGTTCTC GACCATGCAG
CCGTCGAAAG TTCTCGGGAT GCGGATTGCC CCGCTGGAAG GGGAGACCGT CGGGGTCGGC
ATGCCGATCA TTATTTACTT CACGGCGCCG GTGACGGACA AAGCGGCGGT CGAGCGGCAC
TTGTCTGTCG AGGCGTCGAT GCCGGTTCTT GGTGCGTGGC ACTGGTACGG CGACACCGAG
GTGCACTACC GGCCGACGGT GTACTGGCCG GTCGGTGACC AGGTGACCTT GCACGCGGAT
CTCGCCGGAG TGGACGCCGG TCGCGGTGTG TGGGGCACCG AGAACCGCAC GGTGCATTTC
ACGATCGGCG ATTCGCACAT CAGCGTGGTG GACGCCAGGA CGCACATGAT GACGGTGCGG
ATCAACGGCC AGGTGGCGCG GGTTATTCCG GTGAGCACGG GGCGGGACAA ATACCCCACG
ACGTCTGGGG TGCACGTCGT TTTGGAGAAG GCACAAAAAG TCATCATGGA CTCGGCGACC
GTCGGGATTC CCAAGGGCGA TCCGGATTAT TACTACGAGG TCGTCTATTG GAATGTGCGG
ATTTCGTGGT CCGGGGAATT CGTGCACGCC GCGCCGTGGT CCGTCGCCGA CCAGGGTCGG
GTGAATGTCA GCCACGGCTG CGTCAACGTA AGCCCGGCGA ACGCCGAGTG GTTCTACAAC
CTGTCCCAAC GCGGTGACAT CGTCCAAGTT GTGGGCACAC CACGCGGGTT GGAGAGCGGC
AACGGCTGGA CCGACTGGAA CATGCCGTGG AGTCAGTGGG TGGCGGGCAG TGCCTTGCCG
GCGAATGAGA ATCCGCGGCT CTCTGATGCG GACGTCGTCG CCGCGGAGGG GGACGGTTTC
GGGTACTACA GCCCGACCCC AGGGGTGAGC AACAACCGGC ACTGGGTGCC GCCGGCTCCG
GCAACCCGAC CGGCGACGCC GACGCCGAGT TCGACGCCGA GTCCTACGCC GAGTTCGCAC
GCCACCGGCA GTGCCAAGCC GACGCCCACG GTTTCCGCCG GCTCGCCCAC GTCGTCGGGT
AGCGGTTAG
 
Protein sequence
MTGQPHDDGW EIGGGDRRIP ADVLPKHRAP RSRVRPALRH GLLGLLVAGG LAIATAAALG 
GCQQRAPAGL AAAGGNQTSA PTSRTSVSAL PVDFTVSPAD KATGVPLNAD VTVSVQHGSL
TLVSVADGAG HRLDGGMDAD HHTWRSAEAL QPGTTYHVVA EAEDDAGQAA TFSSTFSTMQ
PSKVLGMRIA PLEGETVGVG MPIIIYFTAP VTDKAAVERH LSVEASMPVL GAWHWYGDTE
VHYRPTVYWP VGDQVTLHAD LAGVDAGRGV WGTENRTVHF TIGDSHISVV DARTHMMTVR
INGQVARVIP VSTGRDKYPT TSGVHVVLEK AQKVIMDSAT VGIPKGDPDY YYEVVYWNVR
ISWSGEFVHA APWSVADQGR VNVSHGCVNV SPANAEWFYN LSQRGDIVQV VGTPRGLESG
NGWTDWNMPW SQWVAGSALP ANENPRLSDA DVVAAEGDGF GYYSPTPGVS NNRHWVPPAP
ATRPATPTPS STPSPTPSSH ATGSAKPTPT VSAGSPTSSG SG