Gene Acel_0669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0669 
Symbol 
ID4485431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp721052 
End bp722572 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content70% 
IMG OID639729437 
ProductXaa-Pro aminopeptidase 
Protein accessionYP_872428 
Protein GI117927877 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.519514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTTCC ATGGACACAT GACTGCCGAT CCCGCTCCGG CGACGTCAAC TGACCGACGT 
CACCCACCCG CTCCGGACGC TGCGCCGACC ACGTCAACCG AGCCCCGGCC GCCCGCAACG
GCCAGCCACG ACAGCGAGCC GTCCGCAGCG CTGGTCGAGT TCATGCGGCA AGGCTGGGCG
CCTAGGACCA GTGACGCCGG TCCCGTCGCG GCGGTTGCGC GGTACGCCGC CCGGCGCCGC
CGCGCCCTCC AGCACGCCCT GCCGGGACGG ACAATCGTCG TCCCCGCCGG CGCGCCAAAG
ACCCGGAACA ACGACGTCGC GTATCCGTTC CGCGCGTCCA GCGACTTCTG GTGGCTGGTC
GGCGCTAACC TGCCCGACGC CGTCCTCGTC CTGGACGACA CCGACGCCGT GCTCTACGCA
CGGGCGCCGA AATCCCGGCA TGACAGCGAC GAATTCTTCC GGGACCGGCA GTACGGCGAA
TTGTGGACCG GCCCGCAACC CAGTCTCCGC GACCTGGAAC GTCAGTTGGA CATTGCCACG
GCACCGCTGG AGGAATGCAT CGAACGTCTC CAGCGAAGCG AGGAGAGCCG GACCGTCGTC
CTGCGCGGTC ACGACCCGCG GGTGGACGCC GTCGTGCCGC CTCACCCCGA CGACCGGTCG
CTCGCGACGC TGCTGGCCGA GCTCCGGCTG CTCAAGGACG ACTGGGAAAT TGCCGAACTC
ACCACGGCTG CCGACGCAAC AATTCGCGGT TTCGCCGACG TCGCGCGGGA AATTCGGGCG
GCCGCCGAGC GAGACGTGGA CATCAGCGAG CGCTGGGTGG ACGGCACATT CTGGCGGCGG
GCCCGCGCCG AGGGGAACGA CGTCGGTTAC CCCACCATCG CAGCCGCCGG TCCGCACGCC
TGCGTCCTGC ACTGGACCCG CAATGACGGC CTGCTGCACA AGGGCCAGCT CCTCCTGCTC
GACGCCGGGG TGGAAACACG CCACGGCTAC ACCGCGGACG TCACCCGCAC CATGCCGATC
AGCGGAGCAT TCACCGAGGC GCAGCGGCAG GTCTACCAAC TCGTGCTGGC CGCCCAGGAA
GCCGCATTCG CCGCCCTCAA GCCGGGCGCG GCTTTCCGTG ATTACTACCG GGCCGCGGCG
GCGGTGCTGG CTCAGGGCCT CGCCGACTGG GGACTGCTCC CCGCGTCCGC CGCGGATATC
GACGGACCGC ACGGGGACCT GCATCGCCGC TACACCCTGC ACAACCCCGG GCACATGCTC
GGACTCGACG TGCACGACTG CGCCGCCGCC CGCCGCAACC GCTACCACGA CGGATTTCTC
GAACCCGGAC ACGTGCTCAC GGTGGAGCCC GGCCTGTATT TCCAGCCCGA TGACCTCACC
GTGCCGCCGG AACTTCGGGG CATCGGCGTG CGCATCGAGG ACGACGTCGT CATAACCCCC
GGCGGATATC GGAATCTCAC CGCGGCCCTG CCGCGACACC CCGATGAGCT CGTCCGCTGG
CTTGCCCAGC CGACGCCGTA G
 
Protein sequence
MVFHGHMTAD PAPATSTDRR HPPAPDAAPT TSTEPRPPAT ASHDSEPSAA LVEFMRQGWA 
PRTSDAGPVA AVARYAARRR RALQHALPGR TIVVPAGAPK TRNNDVAYPF RASSDFWWLV
GANLPDAVLV LDDTDAVLYA RAPKSRHDSD EFFRDRQYGE LWTGPQPSLR DLERQLDIAT
APLEECIERL QRSEESRTVV LRGHDPRVDA VVPPHPDDRS LATLLAELRL LKDDWEIAEL
TTAADATIRG FADVAREIRA AAERDVDISE RWVDGTFWRR ARAEGNDVGY PTIAAAGPHA
CVLHWTRNDG LLHKGQLLLL DAGVETRHGY TADVTRTMPI SGAFTEAQRQ VYQLVLAAQE
AAFAALKPGA AFRDYYRAAA AVLAQGLADW GLLPASAADI DGPHGDLHRR YTLHNPGHML
GLDVHDCAAA RRNRYHDGFL EPGHVLTVEP GLYFQPDDLT VPPELRGIGV RIEDDVVITP
GGYRNLTAAL PRHPDELVRW LAQPTP