Gene Apar_0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0440 
Symbol 
ID8413289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp509083 
End bp510081 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content48% 
IMG OID645022008 
Productadenosine deaminase 
Protein accessionYP_003179462 
Protein GI257784245 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1816] Adenosine deaminase 
TIGRFAM ID[TIGR01430] adenosine deaminase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCCT GTGCTCTTAT CGATTTGCAT GTCCATCTTG ATGGATCAAT CCCCCTTCCT 
GCCGCAGCCC AACTTGCAGC AGAAGCGGGA CTTAATTTCT CTTTGGATGA ACTCCAAGAA
AAGATGCAAG TCCCCGCTCA TTGTCAGGAT CTTAACCAAT ATCTTGCAAC GTTTGAGTTG
CCCTTAAAGC TCATGCAGTC AGAGCAAGGC ATACGTGCTG TTGCAAAGGC ATTTCATAAG
CAACTTGATG CAGAGGGTAT TCTCTATGCA GAACCCCGCT TTGCACCAGG AAGCCTTACG
GCGGAAGGTC TTTCTCAGCA AGAAATCCTT GAGGCTGCCC TTGCTGGTAG AGCGGATTTC
TTTGCAGAGA ATCCACAGTC AGAGCTTCAC ACGGCGTACA TCCTTTGCGC CATGCGTGGC
ACAGGTGAAG AGCTTAAACG TAAAAATGAA CAATCAATCG ATTTGGCTGT AGCATACCTT
GGAAAGGGTG TTGTTGCGGC AGACTTAGCG GGAGCAGAAG CACTCTTTGC CACAGAGAAT
TTCTCGTCAC TTTTTGCTGA AGCGCAAAGA AAAGATGTTC CTTTTACTAT TCACGCAGGA
GAAGCCGCTG GTCCAGAGAG CATCAAGGCC GCACTTCGTC TTGGCGCACA ACGCATTGGT
CATGGTGTAC GCTCCCTGGA AGATGTGAGT GTTATCCAGG ACCTCAAAGC TGCAAATGTT
ACACTTGAGA TTTGTCCTAC CAGCAACCTT CAGACACGCA TCTTTGAGTC AATAGAGCGC
TTCCCTCTTG AACAGCTGCT TGATGCTGGT CTAACGGTCA CCATCAACAC TGACAACATG
ACCGCTTCCA ACACTACCCT CTCGCACGAA TTTGAGCTTT TGCAGCAGTA CTGTGGTCTA
GACAAAAATA CCGCACGTGA GCTTGCTGAA AATGCTGCAC GTGCGGTATT TAGTGATTCT
AGCGAGAAGG ACTGTCTACT TGCCTACCTT AGGCAATAG
 
Protein sequence
MSSCALIDLH VHLDGSIPLP AAAQLAAEAG LNFSLDELQE KMQVPAHCQD LNQYLATFEL 
PLKLMQSEQG IRAVAKAFHK QLDAEGILYA EPRFAPGSLT AEGLSQQEIL EAALAGRADF
FAENPQSELH TAYILCAMRG TGEELKRKNE QSIDLAVAYL GKGVVAADLA GAEALFATEN
FSSLFAEAQR KDVPFTIHAG EAAGPESIKA ALRLGAQRIG HGVRSLEDVS VIQDLKAANV
TLEICPTSNL QTRIFESIER FPLEQLLDAG LTVTINTDNM TASNTTLSHE FELLQQYCGL
DKNTARELAE NAARAVFSDS SEKDCLLAYL RQ