Gene Apar_0178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0178 
Symbol 
ID8413026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp207146 
End bp208333 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content47% 
IMG OID645021750 
Productprotein tyrosine/serine phosphatase 
Protein accessionYP_003179205 
Protein GI257783988 
COG category[T] Signal transduction mechanisms 
COG ID[COG2365] Protein tyrosine/serine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGT TGCAGAAACG GTTTACTGCG TTTTTGAGTA TGTTTATGGC TCTGGTGCTC 
TCGTTTATGT TGGTAGGGTG CAACGTGGCG CCTTCTCAGC AAGAGAACAA GCCTGCGGAG
CAAACGCAGC AGAAAGAGCT TACAACTGGG GCGCTTGCTG CTACGCATGA GACCAAGTTT
GGTGGTATCT ATCTGGATAT TACCATCGAG GACTTTAATA AGATGGGCTT TGAGTTTGGT
GATAGCGTAG ACGTTACGTT TAGCAACGGC TACAAGCTTA TCGATATTCC GTACTATAAC
GGCTACTATA CCAAGACTGG TGAAGCTTTG ATTAGCGGAT ATCCCGGATA TCCTCATATT
GACGTATGCG TGAACAACGG AGATCCTCTG TGGGAGACCG CTGGTCTTAA AGAGGGCGAT
ACCGGCACCG TAACTCTGCA CGAGAAGCAG AAATACGCAA CAGTTCAGAA GGCTCTTGGC
GCAACATACA CTAAAAATCG CTCTGACTAT GCAAGTGACG AGGTCTTTGC CAACTTCCGC
GCTATGAGGG GCGGAAATAT GGCAGAGAAC GTAATGTATC GTTCGGCTTC ACCTATTGAT
AACCAGAACA ATCGTGCTCC TTATGCTGCC GAGCTTGCTC AAAAATGCGG CGTTCAATAC
ATTTTGGATC TTGCTGATAG CAATGAAGAG ATTGAGGGCT ATTACCAGAA AGCCGATTAC
GATGTTTCTT GGCATAAGGG ACTATATGAG GCCGGAAACG TTACGGCTCT GGACTTGAAT
GCAAATTATC GTAGTAAAAA GTATGCTGAG CGTCTGGTTG CAGGCTTGCG CGAGATGATT
AAGCATGAGG GACCATATCT TACTCACTGC ACGGAAGGCA AGGATCGTAC GGGCTTTACC
TGCGCTTTGC TTGAAGGCCT TTGCGGTGCA TCGTATGAGG AAATGCGCGA TGACTATATG
GCTACCTATG ACAACTATTA CGGTATTAGC GAGAAGAACG ACAAAGCTCG CTACGATGCT
GTCGTAGACG TTAAATTCAA CGACATTGCT CTTTGTGTTG GTGGACAGCC TCTGGGTGGC
TCGCTTGACG GTTTAGATTA TGCAGCTGGT GCTCGTAAGT ATCTGACCGA CGCTGGAATG
ACTGATGCCG AAGTCGATCA GCTTATTGCA AAACTGACAA AGAAGTAG
 
Protein sequence
MQKLQKRFTA FLSMFMALVL SFMLVGCNVA PSQQENKPAE QTQQKELTTG ALAATHETKF 
GGIYLDITIE DFNKMGFEFG DSVDVTFSNG YKLIDIPYYN GYYTKTGEAL ISGYPGYPHI
DVCVNNGDPL WETAGLKEGD TGTVTLHEKQ KYATVQKALG ATYTKNRSDY ASDEVFANFR
AMRGGNMAEN VMYRSASPID NQNNRAPYAA ELAQKCGVQY ILDLADSNEE IEGYYQKADY
DVSWHKGLYE AGNVTALDLN ANYRSKKYAE RLVAGLREMI KHEGPYLTHC TEGKDRTGFT
CALLEGLCGA SYEEMRDDYM ATYDNYYGIS EKNDKARYDA VVDVKFNDIA LCVGGQPLGG
SLDGLDYAAG ARKYLTDAGM TDAEVDQLIA KLTKK