Gene Apar_1264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1264 
Symbol 
ID8414143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1419596 
End bp1421656 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content40% 
IMG OID645022856 
Productconserved repeat domain protein 
Protein accessionYP_003180280 
Protein GI257785063 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain
[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA GTTTTCTTAA GAGAATCTCG GTTTTTCTAA CCATGATTAT GGCATGCTTG 
GTTATGTTTG TTAACACAGC TCAGGCATTT GATCGCCAGA AAGACGATAG TGATCTTACT
CAGATTAACA TCTATCAGTT CACTATGACC AGGGATAACA TGACTGTTCT CTCACAAGGG
GATGGGGTTA AACGCGTGGA GATTCCTGCG TCAGATACAG GTAATCTTCC TTCAACAGCT
TTTGTTATGC AGCCTACTAA GGATGGCTCT AACCATCAGT ATAATCAGCC ACTTTCATTG
AAGTTTTCTA ATGCTGGCAC CGTTGATGGT GAATCCGTTG ATGTTTATGT AACGGTTAAT
TCTTTGGACC TTACCCTTAA AAATACAAAT GCAGACTACA ATAATCCTAA TAAGACTGAT
GTTCCGTTTT TGACTGTTGA TGAAAACTGG GGAACAAAGT CTTTCTCGCT TATGGATTAT
ATTGATGTGA ATCACCCTAG TTATACAGCA GACATGCTTG GATCCTATGC AATTAACGCT
AATGTAACTA TGGAATTAAG GTACTCTGAT GGAACGCCGT GCAACCTTAA ACTTGTCATG
CAGCCAAGTG ATATTGACGT TTTAAATGGC GGCACAAATG AGACTTTTTC TCTGGTAAAT
GCAGAGAGCA CCGTTGACAG TATTGTTATG AGCAATAGAA ATGTTCTTAC AGAAACTACA
AATGGTAATA AAATAACGTG GAATCCAACT CGTCCAACTT CTGGAAACGA TCAAGAAAAA
AATCTTGCGG GTTTTGCTGT TAAGTCAAAG TCCAATTCAT TAACTTTTGA GTCTACAAGT
GCTGCTACAA GTGGTAGCCT TTTTGGTGCT TATACTGAAG TGATAAGTCC AGCTCCTGTA
AAAGCGGTTG ATCCAGAGCA GGCTCCTGCT AAGGCTGGGG AAGAAATTAC TTACACTGGA
ACATTTACTT TACCAAGACA AGGCATTGAT ACTATCGGCA AGATCAAGTC GATGAGTATG
GTTGATACGT TTGATGAGCG TCTTGACTAT CAGAGCCTTA GCGTTTCGTT TGATGGACAG
ACTCTTACTG AAGGCACCGA TTACACCGTT TCTGTCGATG GTCAAAAGGT GACTGTAGAC
ATTGATGCTC ATTTACTTAC CAAAGAAAAT GGCGGTAAAA AGTTTGTCAT TACGTATAAA
ACTCTAACTA ATTCAAAGAT AGAGACTGAC AGTTCAAATA TTGATAATGA GCTTACCCAG
GTTGTTGACG GTAACATTGC TCACTCTAAT AAAGTAACCA CAGAGCTTCT TTATGAGAAG
ACTCATGAGT ACGTTAGTGG CACCCCTAAT AAGGAACTTC CACAAGAGGT TCTGGATTTA
CTTCCTGGTA AGCAGACCAG AATTCCAAAC GGCACAACTG TTACACCTGA TCAACCACTT
GGTGGAGTAA CTCGTGTTGA AACTTCTGAT GGAACTTGGG TGTTCATTGG TTACGATCAC
GATTCTGAGA TTATTGATCA CAAGAACGCA CACTTCATTG GTGTTTGGGT GATTTTGCCT
CAGCCAAAGA AGGACGTTCT TGATAGTGAG GGTAATTCTA TTGATGGTAA TAAGGTAACT
GCAGGACAAG TACTCACTTA TTCTGTGACA TATACCAATA CCACCAATAC TGCTCGTGAT
GTTACGGTTA CTGATGTTAT TCCAGAGCAC ACAACTTACG TTGATAATTC TGCTGATAAC
GGTGGAGTTT ATGATAAGGC TACTCGTACT GTAACCTGGA CGAAAAATGT TGCACCTGGT
GAGACCCTCA CGGTTACTTT CCAAGTTAAG GTTAATAAGG GCGTTAAGGA TATTACTGTT
GTGAATACTG CTCACGTCAG TGATGGTCTC ATTGACACCG ATACTAACAC TACAAAAAAT
CCTGTTATAC CTAAGCCACG TAAGTCTCGT GTTCCAAATA CTGGTGACAA CACAATGCGT
GATGTAATTA TTGTTGCTGG TTTAGGTGGA ATAGCTCTTC TTATAGTTAT TGTTTTAAAA
CTTCGCTCTT CGAGAAAGTA A
 
Protein sequence
MKKSFLKRIS VFLTMIMACL VMFVNTAQAF DRQKDDSDLT QINIYQFTMT RDNMTVLSQG 
DGVKRVEIPA SDTGNLPSTA FVMQPTKDGS NHQYNQPLSL KFSNAGTVDG ESVDVYVTVN
SLDLTLKNTN ADYNNPNKTD VPFLTVDENW GTKSFSLMDY IDVNHPSYTA DMLGSYAINA
NVTMELRYSD GTPCNLKLVM QPSDIDVLNG GTNETFSLVN AESTVDSIVM SNRNVLTETT
NGNKITWNPT RPTSGNDQEK NLAGFAVKSK SNSLTFESTS AATSGSLFGA YTEVISPAPV
KAVDPEQAPA KAGEEITYTG TFTLPRQGID TIGKIKSMSM VDTFDERLDY QSLSVSFDGQ
TLTEGTDYTV SVDGQKVTVD IDAHLLTKEN GGKKFVITYK TLTNSKIETD SSNIDNELTQ
VVDGNIAHSN KVTTELLYEK THEYVSGTPN KELPQEVLDL LPGKQTRIPN GTTVTPDQPL
GGVTRVETSD GTWVFIGYDH DSEIIDHKNA HFIGVWVILP QPKKDVLDSE GNSIDGNKVT
AGQVLTYSVT YTNTTNTARD VTVTDVIPEH TTYVDNSADN GGVYDKATRT VTWTKNVAPG
ETLTVTFQVK VNKGVKDITV VNTAHVSDGL IDTDTNTTKN PVIPKPRKSR VPNTGDNTMR
DVIIVAGLGG IALLIVIVLK LRSSRK