Gene Apar_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1090 
Symbol 
ID8413963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1232791 
End bp1234344 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content49% 
IMG OID645022679 
ProductPSP1 domain protein 
Protein accessionYP_003180109 
Protein GI257784892 
COG category[S] Function unknown 
COG ID[COG1774] Uncharacterized homolog of PSP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.220883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCG TTGTCCCGGT GAAATTTGAA TACGCCGCAC GCGACCTTTG GTTTGATCCA 
AAGGAGTCTG GCGTTCTTGA AGGCGACCAC GTCATTTGTC AGACTGAGCG TGGTCTAGAG
ATTGGCCTTG CTATTGCTGA TCCTCGTGAA ATTTCTCAGG AGGAGTTTGA ATACAAGACA
AACAAAGCTG AGCTTAAAGA TGTTGTTCGC GTAGCAACTG AAGAAGATCT TTCTCGCGCA
GAAGAACTTG CCACTAAGGG CGAGGCTGCG CTTCCCATAT TCAGGAAGTT TGTTGTTCAA
GAAGAACTTG AGATGAAGCC TATTGGCGTT GAGTATCTTT TTGATGGCGA GAAGGTCGTT
TGTTACTTCT CAGCTGATGA TCGTGTGGAC TTTAGACAGC TTGTCCGTGA GCTTTCTCAT
GAACTTCATG AGCGTATTGA TATGCGTCAG ATTGGCGTCA GAGAAGAGGC GGCGGTCATT
GGTGGCTATG GTCACTGTGG ACAAGAGCTT TGCTGTAGAA GGTTTGGTCT TTCTTTTGAG
CCAGTTTCTA TTCGTATGGC AAAAGAGCAG GATTTGCCAC TGAACTCCAC TAAGATTTCT
GGTGCTTGTG GCCGCTTAAT GTGCTGCTTG CGCTATGAGT TTGAAGCGTA TCGTGATTTT
AAGAACCGTG CTCCAAAGCG TAATGCTGTT ATTGAGACTC CTTTGGGTAT GGCAAAAATT
GTTGAGTACA ACACACCAAA AGAAGAGATT GCACTTCGTC TTGAGAGCGG CAAAGTAGTC
CGTATTCCTC TTGCTGACAT GGATGCATCT CCTGCAGCAC AGCAGAAGTC GGATGAGCTA
GGTTGTTCTT GCAGACCAGA CAGTGTTTCT CGCGCTGCAC TTGAGCGTCT TGAATCTGTT
GAGGTTCAGA TGGCCCTGGC AGAACTTGAT AAAGCAAACG GTTTGATAGT TGATGAAGAG
CCAGAAATCA ATCCTGACCT CTTTGTTACT GAGGCTCCAA GACGCAAACG TGAGCGCTTT
GAGCAGAGCA ATGCAAATAG GCAGGAAAAT GCGCAGAAGT CTAGCGTAAG AAACACTCGA
GAAGAACAGG GTACTCAAAG CTCTTCTCGC ACCCGTCGTG TAAGAACCTC AAAGAATGCT
CAAGTAAACC CTGGTTCCAG TACTTTTGAT GCCGCAACTG ATACACTGCG TCGCACGCGT
CGCCGTCATC ACGTAACAGA CGATGGTGTC ACAAACACTC AGACACCCCA GAAGAACGAG
CAGAACTCAG CTTCTGCTCA GAAACAGCCT CAGGGCAAGC GTTTTAGAAG GACAGATGCT
CCTCAGAGTG CACAGAGTGC CGATGAGACT CAGGCTCCGG TCCGTACAAA GAGAACTCGC
CGTCCAGGCG ATAGGGCGGG CATGAAGGCC CAAAGTTCTT CACAGAGTGC TTCTGGCTCT
GGCAGTGCAC CTAATGCTGG ATCTGTAGAC GCTCAGTCAC AGAATGCTAC GCGTCGTAGA
CATAGAAGAG CTGGGCGAGA TAATCGTAGC CGCGGTTCAA AGGACAATGC GTAA
 
Protein sequence
MPTVVPVKFE YAARDLWFDP KESGVLEGDH VICQTERGLE IGLAIADPRE ISQEEFEYKT 
NKAELKDVVR VATEEDLSRA EELATKGEAA LPIFRKFVVQ EELEMKPIGV EYLFDGEKVV
CYFSADDRVD FRQLVRELSH ELHERIDMRQ IGVREEAAVI GGYGHCGQEL CCRRFGLSFE
PVSIRMAKEQ DLPLNSTKIS GACGRLMCCL RYEFEAYRDF KNRAPKRNAV IETPLGMAKI
VEYNTPKEEI ALRLESGKVV RIPLADMDAS PAAQQKSDEL GCSCRPDSVS RAALERLESV
EVQMALAELD KANGLIVDEE PEINPDLFVT EAPRRKRERF EQSNANRQEN AQKSSVRNTR
EEQGTQSSSR TRRVRTSKNA QVNPGSSTFD AATDTLRRTR RRHHVTDDGV TNTQTPQKNE
QNSASAQKQP QGKRFRRTDA PQSAQSADET QAPVRTKRTR RPGDRAGMKA QSSSQSASGS
GSAPNAGSVD AQSQNATRRR HRRAGRDNRS RGSKDNA