Gene Apar_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0142 
Symbol 
ID8412988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp161820 
End bp162893 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content50% 
IMG OID645021712 
Productbasic membrane lipoprotein 
Protein accessionYP_003179169 
Protein GI257783952 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA ACGTTTCTCG TCGTAGTTTT GTGACTGGCG CAACCGGCGC TGGCGCACTT 
GCAGCACTTG CAGGTCTTGC AGGCTGCAAC CAGAAGAAGG ACGACGGCGG CCAGAAGTCA
TCAGGTGAGA AGAAAAAGAA GCTCACCATG GTTACTGATA CTGGTGGTGT AAACGACCAG
TCCTTTAACC AGTCCGCTTG GGAGGGCATG ACTGAGCTCA AGGACAAGAA CGGCTGGGAT
GTCAGCTACC TTGAGTCCAA GCAGGATTCC GATTACGCAA CCAACCTTGA TAAGGCTGTT
GACGCTGAGT CTGACCTTGT TTGGGGTATT GGTTTTGCTA TGGCAGATGC AGTTGTTAAG
GCTGCTAAGG ATAACTCCAA CACCCAGTTT GCCATTATCG ACAACGCTAA TGATTCTGAG
GTTGCTAACT TGACCGGCGT AATGTTCCGC GCTCAAGAGC CTTCCTTTGT TGTTGGTTAT
ATTGCTGCTC GTACTACAAA GACTGGCAAG GTTGGCTTTG TTGGCGGCAT TTCTTCCGCA
CTTATCGACC AGTTTGAGTA TGGTTACCGC GGCGGCGTTG CTTATGCAAA CAACGAGAAC
AACACTAACG TTGAGATTTC TGCACAGTAT GCAGAGTCCT TCTCCGATGC TGCAAAGGGC
AAGGCTATTG CAAACTCTAT GTATTCCGAT GGTTGCGACG TTGTCTTCCA CGCTGCAGGC
GGCGTAGGAA CTGGCGTTAT TGAGGCAGCT AAGGACGCAA ACAAGCTTGC CATTGGTGTT
GACCGCGACC AGGCATATCT TGCTCCAGAG AATGTTTTGA CCTCCGCTCT TAAGCGCGTC
AATGTTGCTA TCGTTGAGAT TTCCGAGAAG ATTTTGAAGA ACGGCCAGAA GGGTGGTAGC
ACCATTTCTC TGGGCGCTTC CGAGGACGCT GTCGGAATTG CTGAGGATCA CCACCTCATG
GCAGACGATA CCTACAAGGC AGCTACTGCC CTTTTAGACA AGATCAAAGC TGGCTCTGTT
GTTCCTCCTG CAAGCAAGGA CGATTTGGAC AAGTACGTTA AGTCTTTGAG CTAG
 
Protein sequence
MNKNVSRRSF VTGATGAGAL AALAGLAGCN QKKDDGGQKS SGEKKKKLTM VTDTGGVNDQ 
SFNQSAWEGM TELKDKNGWD VSYLESKQDS DYATNLDKAV DAESDLVWGI GFAMADAVVK
AAKDNSNTQF AIIDNANDSE VANLTGVMFR AQEPSFVVGY IAARTTKTGK VGFVGGISSA
LIDQFEYGYR GGVAYANNEN NTNVEISAQY AESFSDAAKG KAIANSMYSD GCDVVFHAAG
GVGTGVIEAA KDANKLAIGV DRDQAYLAPE NVLTSALKRV NVAIVEISEK ILKNGQKGGS
TISLGASEDA VGIAEDHHLM ADDTYKAATA LLDKIKAGSV VPPASKDDLD KYVKSLS