Gene Apar_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0236 
Symbol 
ID8413084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp273176 
End bp274531 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content48% 
IMG OID645021804 
Productputative chlorohydrolase/aminohydrolase 
Protein accessionYP_003179259 
Protein GI257784042 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR03314] putative selenium metabolism protein SsnA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTTA TTGTGAACGG TCGTGTTATT ACGCGCGACG AAGCTCATCC GTACATTGAA 
GATGGTGCCG TAGCAATTGA AGGTCAGAAG ATTGTTGCCG TTGATACGCG TGCGAAACTT
GAAGCCGCCT ATCCCGATGC AGAAAAGCTT GACGCGCATG GCGGCGTCAT TATGCCAGGT
CTTATCAACT GCCATACTCA TATCTATTCT GGCTTGGCAC GTGGTCTTGC TATCAAAGAC
TGTAACCCTC ACAACTTCCT TGAAAATCTT GAGCAGCAGT GGTGGAACAT CGACCGTCAT
TTGACGCTTG ATGGAACTCG CGCTTGCGCT TACGCAACTA TCCTTGATTC ACTGCGAGAT
GGTGTTACTA CTATCTTTGA CCACCATGCA AGTTTCTGCG AGATTCCTGA TTCACTTTTT
GCTATCAAAG ACGTCGCAAA AGAGCTTGGT ATTCGTGCTT GCCTTTGCTA TGAGACATCT
GATCGTGATG GAGAGGCCAA GCGTGATGAG TCTATCGCTG AAAATGCTGC CTTTGCTAAA
TGGGCTGCAG ATGAAGATGA CGATATGATT GCTGCTATGT TTGGTGGACA CGCCCTCTTC
ACACTTTCCG ATGAGACGCT TGATAAGATG GTTGAGGCCA ACAACGGTCT TACTGGTTTC
CATATTCACG TTTGCGAGGG CATGGATGAT GTGTATGATT CCGCCCTTAA TCACGGAACC
ACTGCGGTAC ATCGCCTGCT TGATCATGGT CTTTTGGGCG AGAAAACCAT GCTTGGTCAC
TGCATCCACG TGACTCCTGC TGATATGGAC ATTCTTGCTC ATACTCATAC CAGTATTGTC
AACAATCCTG AGTCCAATAT GGGTAATGCT GTTGGGTGTG CACCGGTGCT TGAGTTCTTC
AAGCGCGGTA TTGACGTATG CATGGGCACC GATGCATATA CGCACGATAT GCTTGAATCG
CTTAAGGTCT TTTTACCTAT GCAGCGTCAT AACGCTTGCA TGCCAAATGT TGGTTGGGTT
GAAGCTATGA CCATGCTCTT CAAGAACAAT GTCAAGATGG CCGAGAAGTA CTTCTCAACA
AAGTCCAACG GCAAGCCTTT GGGTAAGCTC GCACCTGGTG CTGCTGCTGA CATTGCAATC
TTTGATTACA AGCCATTCAC GCCGTTCTCT GACGAGAACA TCGACGGTCA CATGCTTTTT
GGTTTTGAGG GCAAGAACTG CCGTACTACT ATTGTCAACG GTAAGGTTCT CTATAAGGAC
CGCGAATTTG TTGCGTTTGA TGAGGACAAG ATTAATGCTT GGACTATGGT TGAGGCCAAG
AAGCTTTGGG GAGAGCTCAA CGGCCGCCAG TATTAA
 
Protein sequence
MLLIVNGRVI TRDEAHPYIE DGAVAIEGQK IVAVDTRAKL EAAYPDAEKL DAHGGVIMPG 
LINCHTHIYS GLARGLAIKD CNPHNFLENL EQQWWNIDRH LTLDGTRACA YATILDSLRD
GVTTIFDHHA SFCEIPDSLF AIKDVAKELG IRACLCYETS DRDGEAKRDE SIAENAAFAK
WAADEDDDMI AAMFGGHALF TLSDETLDKM VEANNGLTGF HIHVCEGMDD VYDSALNHGT
TAVHRLLDHG LLGEKTMLGH CIHVTPADMD ILAHTHTSIV NNPESNMGNA VGCAPVLEFF
KRGIDVCMGT DAYTHDMLES LKVFLPMQRH NACMPNVGWV EAMTMLFKNN VKMAEKYFST
KSNGKPLGKL APGAAADIAI FDYKPFTPFS DENIDGHMLF GFEGKNCRTT IVNGKVLYKD
REFVAFDEDK INAWTMVEAK KLWGELNGRQ Y