Gene Athe_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1697 
Symbol 
ID7409207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1783912 
End bp1784976 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content38% 
IMG OID643716068 
Productpeptidase M24 
Protein accessionYP_002573564 
Protein GI222529682 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAGCA CAAGAGTTGA AAAACTGTTC AAAAGAGATG AGAGTATCGA AGCAGTTTTT 
GTGTCCAAAA AAGAAAATGT CAGATACCTT AGCAATTTCA AAGGCGATGA AAGCTTCCTG
CTTATTACAA GAGAAGGAGC AAAGTATCTT CTGACAGACT TTAGGTATAC CGAGCAGGCA
AAAAAAGAAG CAACTGAATT TGAAGTTGTT GACTACAAAG GAAAACTTTA TGATACTATA
AAAGACTTGA TGGTATCGCA TAACATTTCA AAGCTTTTTA TTGAAGGGTA TCATCTGCCG
TTTTTGTTTG TGAGTGAGAT GAAAGAAAAG CTTGAACACA GGGTATGTGC ACTTTCGTTT
TCTTTGGATG AAATAAGAGC TGTTAAAGAT GATGAAGAGA TAGAGAAAAT AAAAAAAGCT
GTTGAGATTG CTGACAGAGC ATTTGAGCAC ATCTTAAAGT TCATAAAGCC GGGTATTTCA
GAAAATGACG TGGTTGCAGA GCTCAACTAC TTTATACTGA AAAACGGTGC AAAAGGCTTT
TCATTTGAAC CTATAGTTGC CTCTGGGAAA AGAAGCTCTT TGCCCCACGG CGTCGCAACA
GACAAAAAGA TTGAAGCTGG CGATACTGTT ACAATTGACT TTGGATGCAA CTTTGACGGC
TACATGTCTG ATATGACAAG GACGGTATTT GTAGGGAAAG TGGAAAGTCA GATGGTAAAG
GTATACCATA TAGTAAAGGA AGCTCAGCAA AAGGCAGAAG AGTTTATAAA AGAGGGCTTA
AAGGCAAACG AAGTTGACAA GATTGCCCGT GACTATATAG GCTCTTTTGG TTATATGGAA
AAGTTTGGAC ACTCTTTGGG ACATGGTGTT GGACTTGAAA TTCACGAACT TCCAAGGCTT
TCACCAAAGT CTGAGATGGT TTTAGAAGAA AATATGGTTG TAACGATAGA GCCGGGCATT
TATATTGAGG ATTTTGGAGG TGTGAGGATA GAAGATATAG TTGTTGTGAA AAGTGGTGGA
TGTGAGATTT TGACAAAGTC GACAAAGGAG CTAATTGTAA TTTAG
 
Protein sequence
MVSTRVEKLF KRDESIEAVF VSKKENVRYL SNFKGDESFL LITREGAKYL LTDFRYTEQA 
KKEATEFEVV DYKGKLYDTI KDLMVSHNIS KLFIEGYHLP FLFVSEMKEK LEHRVCALSF
SLDEIRAVKD DEEIEKIKKA VEIADRAFEH ILKFIKPGIS ENDVVAELNY FILKNGAKGF
SFEPIVASGK RSSLPHGVAT DKKIEAGDTV TIDFGCNFDG YMSDMTRTVF VGKVESQMVK
VYHIVKEAQQ KAEEFIKEGL KANEVDKIAR DYIGSFGYME KFGHSLGHGV GLEIHELPRL
SPKSEMVLEE NMVVTIEPGI YIEDFGGVRI EDIVVVKSGG CEILTKSTKE LIVI