Gene Apar_0588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0588 
Symbol 
ID8413442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp655992 
End bp658871 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content44% 
IMG OID645022160 
Productphage tape measure protein 
Protein accessionYP_003179609 
Protein GI257784392 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID[TIGR02675] tape measure domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.739984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.209865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGAA CAGTAGTAAG AGGCTCTGTT CTGCTTACTC CTAAGTTCGA CAACCTCGGA 
GCTAATGTAA AGCGTGCTTT AGGTAGTGGC TATAAGTCAG CGGTATCTGT GCATACAAAT
GCTGGACGGC AAGCGGCGCA AAACTATGCA GGCGGCTTTG GTGGTGCAAC CGGAGCCATT
ATGGGAGTCG TTTCGAGCAT TACTACACGC GCCCTAGATG CTATTTCTGG CTCAATTGCA
TCAGCAGTAA ATCGTGTAGA TACGATTGCA AACTTTCCAA AAATTATGTC ATCTATTGGC
TACTCAGCTG ATGATGCGCG AGCAACTATT CAGCGTCTTT CAGACGGTAT TGATGGACTT
CCGACATCTC TTGATGCCAT TGTTGGCTCT GTACAGAAGA TTGCTCCTGT ATCTGGCTCG
CTTGCTACGG CAACAGATGT TGCTTTGGCA TTTAACAACG CTCTTTTGGC TGGTGGCAAG
AGTCAAGAGG TAATGAATTC TGCTTTTGAG CAGTATTCTC AGATGCTTTC AACTGGCAGA
GTTGACATGC AGTCATGGAA GATTCTCGCG CAAGCCATGC CAGGACAGCT GAATCAAATT
GCCAAAGCTC TATTAGGCGC TAATGCAAAT CAAGCAGACC TCTACAAGGC TATGCAAAGC
GGCACAATTA CATTTGACCA ATTCAACGCT GCAATTGTAA GCCTTAACAA TGAGGGTCTT
CCTGGGTATG CCTCATTTGC GGAACAAGCA AAAATCTCAA CAGAATCAAT TGGCACTGCC
TGGACAAATG TACAGAACCG CATTAATAAG GCAGTTGCAA AAATCATTGA TCACATTGGT
CAAGCTAACA TTGCCGGCGC AATCAACAAC TTCTCAAGTA GTTTTTCTGG CATCGCTGAT
ACGATAATTA CCTATCTTGA TCCTGTCATT TCCACTGTTG GTTCGTTTAT GAACCAGCTT
CAAAATAACG GGGCAATCAC ATCGTTTGGC AATACTTTAA ATGCTCTAAA AGACGTATTT
GATAGCACTA TCGGACTTAT TGGCGACCTC GTAACAACGT TTACTGGTCT CGATAACTCA
GAGAATGCTT CACGTAGCGC AGCAGATTTG CTTAAGGCCG CTGTTGATGG TGTTAAATCT
GCCATAGAGT TTGCACGAGA CGCAGTTCAA GGCTTAAGAG ACAATCTTAC GGTTGTGGCG
CCTGTCATCG TTGCTGTAGC CACCGCCTTG ATTGCTTATG AGACGATTAA AGCGGTACGC
TCAATAGCAG ATGACTTTGG ACTTTTAAAA AGTGCTGCTT CACTTGCCTT TGATGCTATC
AAGGGCGGAG AAGGCGCCCT ATCAACGCTT TCTATTTTTG GTGAACTTGC CGGCGAAGGC
GGAGTACTTG CAAGTATCTT CGGAACGATT TCAACAGCTA TTAGTGGTGT TGGAACAAGC
CTTCTTGCAC TCGTAGGATC TATCCCTGTT ATTGGTTGGA TTGTTATTGC AATTACCGCG
GTTGTTGCGG TTATAACGTG GCTTTGGAAT ACAAATGAGG ACTTCAGAAA CGCTGTAATC
AACATCTGGA ACACTATTTG TAGTGCTGTT AGTGGTGCTG TAAGTGCTAT TGGAGAGTTC
TTAGGTGGTG TTTTAGGCGG CATTGTTGAG GGAGCTAAGG CTGTTTGGAG CGGTCTTTCT
GGTGCCGTTA TAAGTGCCTG GGATGGCATA GTTACTTTCT TTACTGTCGA CTTACCTAAT
ACGTTTAATC AGTTTGTGTC ATTTTTATCT GGTATTCCTG CTGCAATCGG AGCATTTTTT
GAGGCATTGC CAGGACGTAT TTTGTATGGA TTGACGTTTG CGATTGTTTT TGTTGTTGCG
TTTTTTGCGA GTGTCGGCGC CAAGATTAGT GAATTTGGAG CAACTGTTAT TCAGCGGCTT
ATTTCATTCT TTACCGTTGA TATTCCAAAC GCAATTTTAA GCTTTGTACA GTCTGTTACA
ACGTTTTTTA CGGTCGATGT ACCTAATGCC TTTAATCAAT TTGTGACATT TGTGCAAGAG
TTACCTGGAA GAATTCAAGA AGCACTCGCC GAAATGCTCG TTAATGTTGT CTTATGGGCG
CTCGACGTGT ATACGCAAGC TTGTGAGGCT GGCTCTAATT TCTTAAGTGG TGTTAGTCAA
TTCTTCTCGC AACTACCAGG TCAGATTTGG TCGTGGCTTA TGGGAGCTAT AGCGTCTGTT
TCAAGCTTTG TCTCAAATAT TGCTTCACAG GCGGTATCAG CTGGTAATGG CTTTTTAAAT
GGAATCTCTA GTGGCTTTAA TTCAGCAATA AGCTTTATCA GTAGTATTCC AGGACAAATT
ACTAGCTTTT TTGCTGGCTG TGGAAGCTGG CTTATTAACT CTGGTCGTGC ACTTCTAGAT
GGTTTTGCAC AAGGTATTAG AAACGCTGTA AGTACTGTTA CAAACGCTGC GTCCGATGCA
CTTAGTGCCG TTCGTAAGCT TTTTCCATTC TCGCCTGCGA AGAAGGGACC ATTCTCAGGT
CATGGCTACA CGACATATTC TGGTCGTGCT CTTATGCGTG ACTTCGCAAA GGGAATCAAG
GGAAGTTCAT CGCTTGCTGA GACAGAGGCA ATGAGTGCAC TGTCAAGTGT GCATGACGTC
TTTAACAACG CTCGTCCTTT GAGCTTTTCA GCAGTTGCTG ACGCTAATAC AAACGGTATT
TATCGTGCGG CTTTTGAGCT TGACTCAAGG CAGCAACGTG CTAATGCAAC AACACTTGCA
GACATCTATG ACTTTATGCG CAACGGTGAG CTTGGACAGG TTATTGATGA GAATTCTAAT
AACATCGGTG ACCGAGACTT TGCGAGAGCG GTTCAAAAGG CGGTGAGAAC AAATGCATAA
 
Protein sequence
MAGTVVRGSV LLTPKFDNLG ANVKRALGSG YKSAVSVHTN AGRQAAQNYA GGFGGATGAI 
MGVVSSITTR ALDAISGSIA SAVNRVDTIA NFPKIMSSIG YSADDARATI QRLSDGIDGL
PTSLDAIVGS VQKIAPVSGS LATATDVALA FNNALLAGGK SQEVMNSAFE QYSQMLSTGR
VDMQSWKILA QAMPGQLNQI AKALLGANAN QADLYKAMQS GTITFDQFNA AIVSLNNEGL
PGYASFAEQA KISTESIGTA WTNVQNRINK AVAKIIDHIG QANIAGAINN FSSSFSGIAD
TIITYLDPVI STVGSFMNQL QNNGAITSFG NTLNALKDVF DSTIGLIGDL VTTFTGLDNS
ENASRSAADL LKAAVDGVKS AIEFARDAVQ GLRDNLTVVA PVIVAVATAL IAYETIKAVR
SIADDFGLLK SAASLAFDAI KGGEGALSTL SIFGELAGEG GVLASIFGTI STAISGVGTS
LLALVGSIPV IGWIVIAITA VVAVITWLWN TNEDFRNAVI NIWNTICSAV SGAVSAIGEF
LGGVLGGIVE GAKAVWSGLS GAVISAWDGI VTFFTVDLPN TFNQFVSFLS GIPAAIGAFF
EALPGRILYG LTFAIVFVVA FFASVGAKIS EFGATVIQRL ISFFTVDIPN AILSFVQSVT
TFFTVDVPNA FNQFVTFVQE LPGRIQEALA EMLVNVVLWA LDVYTQACEA GSNFLSGVSQ
FFSQLPGQIW SWLMGAIASV SSFVSNIASQ AVSAGNGFLN GISSGFNSAI SFISSIPGQI
TSFFAGCGSW LINSGRALLD GFAQGIRNAV STVTNAASDA LSAVRKLFPF SPAKKGPFSG
HGYTTYSGRA LMRDFAKGIK GSSSLAETEA MSALSSVHDV FNNARPLSFS AVADANTNGI
YRAAFELDSR QQRANATTLA DIYDFMRNGE LGQVIDENSN NIGDRDFARA VQKAVRTNA