Gene Apar_0955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0955 
Symbol 
ID8413826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1075121 
End bp1076725 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content44% 
IMG OID645022543 
Product4-phytase 
Protein accessionYP_003179975 
Protein GI257784758 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00228685 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGC ACGTATTTAA TAATCTTTCC CGTAAATCAT TTTTACGTGG ATCTTTGGCG 
GCTGCCGTTG CCGCCGGTTC TGCTTCACTA CTTTCTGCTT GTGGTGGATC TGCTAGCGGA
GATGGTGAAA AAAAGGTATT ACGCTTTGGC GTAAACAATC CAAAAGTTAC CTTTGATACT
CAAAAAACCT CTGGCTCTGT TGGTGTATCT GAAGCAGTTG CAGAATCTTT ACTGGTGCTT
AACCCAGACA CAAAAGAGAT TGAGCCAAAT CTGGTAACTG GTCTTCCTAC TGTTTCTGAT
GATGGTCTTA CTTACTCATT TGAGCTTAAA GATGGCGTCA AATTCCACAA CGGAGAAACC
CTTAAATCAT CTGACGTTAA GTACACCTTA ACGCGCATGT TCTTGCCTGC TACCAAGGCA
ACCTCAATTG ACTCTTATGC TTACATTGAG GGTGCTAAAG ACATTATTGC TGGTAAAACT
GAAGAGCTTT CTGGCGTTGT TATTAAAGAC GACCGTCACT TTGACATTAA GCTAACCCAA
CCGTATTCAA CCTTCAATGC AATCATGGCT CAATTCTATG CTGTCATCTA CCCAGAGAAG
GCCTGCAAAG AAGCTGGTGA AGCGTGGGGT ACTGAGACCA ATTTCATTGG CACTGGTGCA
TACAAGCTTG TCTCAAACGA TAGCGCTACA GAAGTTGTTC TTGAGGGATT TGCTGACTAC
CATGAAGGTA AACCTGGCCT GGATGAGCTT AGATTTGTTT ACATTGACGA CGCTAACACC
CGTGTTCTTA ACTACAAGAA CAACGATGTT GACCTGGTCT TTATTTCTCA GTCTCTTATT
CAGCAGTATC AAAACGATGA ATCTATCTCC AAAGAAATTG TCAATTACAC ACCTGCATCC
ACGCAGTTTG TAAACTTGAA TCTTCAGAGT CAGAACCTCA AAGATGTTCG TGTTCGTCAG
GCGCTCTCAA TGGCAATTGA TAGAGACACC ATTTGTAGCA CCATTCTTTC TAATGTTGCT
AAACCTGCAA AGTCATTTAT TCCATCTTCT GAGACTGGTT ACAATGAGTC CGCTCCAGAG
TTTGAGTACA ACGTAGACAA AGCAAAACAG CTTCTTGCAC AAGCAGGTGT TTCTAACCTT
ACTCTTAACG CACAGGTTAG AAGCCAAGAC CAGAACCTTA TGGTTGCTAT ACAGGATGCA
TGGTCCAAGA TTGGCGTTAC TTGTAACGTA AGCGTTATTG ACTCTGGTGT TTGGAGCGAC
GCACGCGCCA ACGGAGAGCT TGAGGTTACC TTGGTTACCT GGTCTACCCT CTCCTTCCAA
GGCATTGAGC ACATGGGCTC CTACTTCCGC TCTGACCGCG CTGCAAAGAA GTCTTCCTTC
TACAACAGTT CCGAGTTTGA CAACCTTGTT GACCAAGCTC GCCTCACAGT CAATGACAAT
GACAAAGTTC TTGAGCTTAC CCGTCAGGCA GATAGTCTCA TGACACACAC AGACTACGCT
TGCCTGCCTA TTGACTGGCC ACAGATGCCC TACGTCCTAA AGCCAGAATT TACGGGCCTC
AGCGTTCTGG TCAATCCTCA CTTTGATAAG GTTAAGAAGA AGTAA
 
Protein sequence
MKEHVFNNLS RKSFLRGSLA AAVAAGSASL LSACGGSASG DGEKKVLRFG VNNPKVTFDT 
QKTSGSVGVS EAVAESLLVL NPDTKEIEPN LVTGLPTVSD DGLTYSFELK DGVKFHNGET
LKSSDVKYTL TRMFLPATKA TSIDSYAYIE GAKDIIAGKT EELSGVVIKD DRHFDIKLTQ
PYSTFNAIMA QFYAVIYPEK ACKEAGEAWG TETNFIGTGA YKLVSNDSAT EVVLEGFADY
HEGKPGLDEL RFVYIDDANT RVLNYKNNDV DLVFISQSLI QQYQNDESIS KEIVNYTPAS
TQFVNLNLQS QNLKDVRVRQ ALSMAIDRDT ICSTILSNVA KPAKSFIPSS ETGYNESAPE
FEYNVDKAKQ LLAQAGVSNL TLNAQVRSQD QNLMVAIQDA WSKIGVTCNV SVIDSGVWSD
ARANGELEVT LVTWSTLSFQ GIEHMGSYFR SDRAAKKSSF YNSSEFDNLV DQARLTVNDN
DKVLELTRQA DSLMTHTDYA CLPIDWPQMP YVLKPEFTGL SVLVNPHFDK VKKK