Gene Apar_1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1141 
Symbol 
ID8414014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1288173 
End bp1289453 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content44% 
IMG OID645022730 
Productpreprotein translocase, SecY subunit 
Protein accessionYP_003180160 
Protein GI257784943 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000440606 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0147779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTAAAG GAATCGCCAA CGCATTTCGT ATTCCGGAAC TAAGGGGAAA GATTCTCCTT 
ACGGTACTGA TTCTTCTTCT GTACCGTTTT GGTGCATATC TTCCTGTGCC CGGTGCTCCA
TTCCAGCAGA TGCTTTCTGC ATATCAGAAT GGTCTTGCTA AGAATGGTGC AATGGCCGTG
CTTAACCTAT TTTCGGGTGG CGCTCTTTCT CGCATGTCTG TTTTCAGTCT GGGCATTATG
CCTTACATTA CGGCTCAGAT CATCTTCCAG ATGATGCAGT CCGTTATTCC ATCTCTTGGT
GAGCTTGCTA AAGATGGAGA GAGCGGTCAG CGTAAGATTA CGCAGTACAC GCGTAATCTT
ACGGTTGGCT TAGCTCTGCT CAATGCAATT GGATACCTGT TCTTGTTTAA GAGCTATGGT
GTATCTTTTG CATCTATGGA GGGCGTGCCT GAGGCACTTG AAAACTTCCT GGTAGTTTTT
ACCATGCTAG TCGGTGCCAT TATTATTATG TGGCTCGGTG AAGTTATTAC TCAGCAGGGT
GTTGGCAACG GAATGAGTTT GATTATCTTT GCAAACATTA TGGCTGGACT TCCAACAGCC
CTTATCTCTT CAGTAACTAC TCGTGGAAAT ATTGTTCTTA CCGTTGTTAC TGTTGTGGTT
ATGCTCGCTA TTATTCCGCT CATTGTTTAC ATTGAGCGTG GTCAGCGTCG TATTCCTATC
AACTATGCCA AGCGCGTTGT TGGTCGTCGT ATGATGGGTG GTCAGTCAAC TTACCTTCCA
ATTAAGGTAA ATACTGCAGG CGTTGTACCA ATTATCTTTG CATCTGCAAT CCTATATCTT
CCTGCTCAGG TTGCTGTGTT CTTCCCAGGA GTAGAGTGGA TGCAGAAGTT TGCTACTTCA
CTTTCTTCTG GTTGGGTTAA CTGGGTTCTC TCAGTTATAT TCATTGTCTT CTTTGCTTAT
TTCTACACTT CAATGGTGTT CAATCCAGAG GAGACAGCAG ACCAGCTTAA GAAGCAGGGT
GGTTTTATTC CTGGTGTTCG ACCAGGTACT GCCACTTCAA CGTATATCAA GACTGTTATT
GACAGAGTTA CCCTCCCCGG TGCTATCTTT ATGGCAGTTC TTGCAATCGT TCCAACTATT
ATTTTCTGGT TCACAGGCGA TTCGTTGATA CAGGCATTTG GTGGAACTTC TGTCCTGATT
ATGGTCGGTG TTGCAATGGA TACACTCTCT TCCATTGAGT CCCATCTTAA GATGCACAAT
TATGAGGGCT TCTTTAAGTA G
 
Protein sequence
MSKGIANAFR IPELRGKILL TVLILLLYRF GAYLPVPGAP FQQMLSAYQN GLAKNGAMAV 
LNLFSGGALS RMSVFSLGIM PYITAQIIFQ MMQSVIPSLG ELAKDGESGQ RKITQYTRNL
TVGLALLNAI GYLFLFKSYG VSFASMEGVP EALENFLVVF TMLVGAIIIM WLGEVITQQG
VGNGMSLIIF ANIMAGLPTA LISSVTTRGN IVLTVVTVVV MLAIIPLIVY IERGQRRIPI
NYAKRVVGRR MMGGQSTYLP IKVNTAGVVP IIFASAILYL PAQVAVFFPG VEWMQKFATS
LSSGWVNWVL SVIFIVFFAY FYTSMVFNPE ETADQLKKQG GFIPGVRPGT ATSTYIKTVI
DRVTLPGAIF MAVLAIVPTI IFWFTGDSLI QAFGGTSVLI MVGVAMDTLS SIESHLKMHN
YEGFFK