Gene Acry_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1148 
Symbol 
ID5160076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1278571 
End bp1280031 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content70% 
IMG OID640553062 
Productamidohydrolase 
Protein accessionYP_001234279 
Protein GI148260152 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.318929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATCC CGCCCAGCCC CGGCCGTCTC GCCATCCGCG CTCAGTTGCT CGGCTACGAC 
GGCAATCCCT TCGTGTCGGA TCCCGCGGAC TGCGTGCGGC ATGAGAGCGA CGGGCTGATC
CTCGTCGCGG ACGGGCGGAT CTCCCATGTC GGTCCCTACG TCGCCGATCT CGTGCCCGAG
GGCGTCGAAC TGCATGAGTA TCGCGATGCG CTGCTGATGC CGGGCTTCAT CGACGCCCAT
GTCCATTACG CGCAGACGCC GATGATCGGC GCCTATGGCA AGCAGCTGCT CGACTGGCTG
GAGACCTATG TCTTTCCCGT CGAGCAGCGC TATGCCGATC CCGATTTTGC CCGCGCCATG
GCCCGGCTCT TCTTCGCGCA GGAACTTGCC GCCGGCGTGA CCACCACGCT GTCCTACTGC
ACGGTTCATC CCGGCTCGGT CGACGCCTAT TTCGAGGAAG CGGCAAGGCT CGGCCTGCGC
GCCGGCGCCG GCAAGGTCCT GATGGACCGC AACGCGCCCG AGCCGCTGCG CGACACCGCA
CAGCGCGGGT ACGACGACTC CAGGCGCCTG ATCGACCGCT GGCATGGCCG CGGCCGGCTG
TTCTACGCCG TCACCCCGCG TTTCGCGCCG ACCAGCACGC CGGCCCAGCT TGAGGCGGCC
GGCGCGCTGT TCGCCGAGAC CGACGGCGTG TGCATGCAGA CCCACCTCTC CGAAAACCTC
GCGGAGCTTG ATTGGGTGCG CGCCCTGTTC CCCGATGCCC TGGACTACCT CGATGTCTAT
GATCGCGCGG GGCTGGTGGG TCCGCGCAGC CTGTTCGGCC ATGCCATCCA TCTTTCACCC
CGCGAATGGG ACCGTCTCGC CGGGGCGGGC GCCGCCGTCG TTCACTGCCC CACCTCGAAC
CTGTTCCTCG GCTCCGGCCT GTTCGACCTG CGCCGGGCGC TGATCGCCGG CAATCCGGTC
CGCACCGCGC TGGGGTCGGA TATCGGCGCC GGAACCAGCT TCTCGCCGCT CGCGACGCTG
AACGAGGCGT ACAAGGTCGC GGCCCTGCGG GGCGAGGCGC TCTCCGCCCA CCGGGCCTTC
TACCTCGCGA CCCTCGGCTC GGCGCGAGCC CTGTACATGG ACGACAGGAT CGGTCGCCTC
GCGCCGGGGT ACGAAGCCGA TTTCGCGGTG CTCGACCTCG CCGCCACGCC CCTCCTGCGC
GAGCGTCTGC GTTTCGCCGA CACGCTGGAG GAGGCGCTGT TCGTGCTGAT GACGCTGGGC
GGTGCGGGAT GCGTTCGGGC AACCTACGCG GCGGGCCGCC TCGTGCACGA CCGCACCCGG
CCCGATGCGT CAGCTCAGGC GGGCGAGGGC TGTTGCGACA CCGTCGCCGT AGGCCGGATC
GGCGCGGCGG CAATTGGCGA CGTGCCGCTC CTGGATGTGC CGCGAGGCAT CGCCGAGCGC
GCGGGCGGTG TTGTCGAATA G
 
Protein sequence
MMIPPSPGRL AIRAQLLGYD GNPFVSDPAD CVRHESDGLI LVADGRISHV GPYVADLVPE 
GVELHEYRDA LLMPGFIDAH VHYAQTPMIG AYGKQLLDWL ETYVFPVEQR YADPDFARAM
ARLFFAQELA AGVTTTLSYC TVHPGSVDAY FEEAARLGLR AGAGKVLMDR NAPEPLRDTA
QRGYDDSRRL IDRWHGRGRL FYAVTPRFAP TSTPAQLEAA GALFAETDGV CMQTHLSENL
AELDWVRALF PDALDYLDVY DRAGLVGPRS LFGHAIHLSP REWDRLAGAG AAVVHCPTSN
LFLGSGLFDL RRALIAGNPV RTALGSDIGA GTSFSPLATL NEAYKVAALR GEALSAHRAF
YLATLGSARA LYMDDRIGRL APGYEADFAV LDLAATPLLR ERLRFADTLE EALFVLMTLG
GAGCVRATYA AGRLVHDRTR PDASAQAGEG CCDTVAVGRI GAAAIGDVPL LDVPRGIAER
AGGVVE