Gene Apar_1356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1356 
Symbol 
ID8414247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1529252 
End bp1531177 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content45% 
IMG OID645022959 
Producthypothetical protein 
Protein accessionYP_003180371 
Protein GI257785154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.347792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATTA CACATGACTA TGTTGACTAT CTAGACGAGC AAATTGATAT TGCTCCTGCC 
GGTAGCCAAG AGGAGCTTCA GGCTGCTCAA ACTATTGCTG AAGAAATGAA AATTCACGGA
CTTGAAGCAA CCATTGAAGA ATTTGATGCT AAGTCTATTA AGAGTCTTGG CTATTCAGTC
TATCTAATTT TCCTCTTCAT TGGCATTATT TTTGCAGGAA CTAGTAACGT AGCACTCATT
GTTTTTGGTG TTTTGCTTGT TGTTGGTTTT GGTGCACTCA CTTGTCTTAA GTATTTTGGT
AATGACGTTC TCGGTACATT TGGATCTTCC ATTAGAAGTC AGAATGTTGT CGCTAAGCAT
GAGGCTACCG GTGAGCTTGT TGCAAAAGGC AATCGTCCAA TCGTAATCGT TGCTCATTAT
GATTCTCCAC ATACCAACTT CCTTGTTGAG TCTCCTGTTG CGAAGTACGT ACCTCTTGCA
CAAAGGTATG CACGTTGGTG TGTAGTTGCT GTATTTGTTG CTACTTTTGT TCAGATTCTT
CGTTTCCTTC CTGATTCAGT TCGTATCTTT TTCTGGATTG TTGGTATTCT TGCAGCTCTT
CCGCTTGTTG CGCTATCTAT TGCAACTATC GCAGAGCGTT TTGCACCATG TACTATCGGC
GCAAACAACA ATAAGTCTTC AGTTGCGGCT CTTCTTGGCA TTCTCGAAAA CGTTCGTCCT
ACTGGACATC GTCCTGAGGT TATTCATCAC TTTGCAGGAG ATGCTGCTGC GCTTATCCCA
GAGCCTGATG AGGTGGAAGG CGTTCGTCAT GGCGAGGAAG TTCTCAATTC ATTGGGCATT
CTTCCTAAGG ATTGCGAAGT AAGCTACGTT GCTTATGACA CTACTGGTGC CAGTCAGACC
GCATCTCTTG ATGATGTTGC GGAAGCTGTA AATGCCACAA CAGAAGAAGA ACAGGCTGAG
GATGCTGCAT CTGACAATAC TGTTGTTTAT GACTCTGTTG ATGATGAACT TAATGCAACC
ATGAAGCAGG TTCATGAGTC TGCAGATGCA AACGCAACAT TAGTACAGCC TGGAGAAGCG
CATGAGCTCC ATTCTAAAAA CGATTTTGCT CGTCGCGCGT CACTTTTCGA CCTACCCGAT
CCTTCTGGCG ATGCTGTTGA TCCACTGGCT CCATCTTCTG AACCAGCTCC TCATTATGTT
CCAGCTTCAA CACCTGCACC TACTCCTGAA ACAGAAGATG CGGAGGGTCC ATTTGACACC
ATTTCAGCTG ATGAGAGTCT AACGGAGACA CAAGACGCAA AGACTCCTGA GGCCAAGCGT
CGTTCTTTCA GGCTTTTTGG ACGCAACGAT GGGCCATCAG ATGACTGGAA GGGAGGAGCA
ACTCCATCTG CAGAAAATCG TGAAGAAGAT GACTCTGAGG ATGTATCTGC TATTTCTGAG
GACGATCTTC GTAATGCCGT TCTTTCACTT TCTGATGATG AGCTCATTTC GCATGACATT
TGGTTTGTTG CGCTTGGTGC ATCTGATTTT GATCACGCAG GCATGAGAGA GTTCCTTGCA
AAGCACAGAA CTGATATTCG TGGTGCTTTC CTCATCAACC TTGATTGTGT TGGCGCTGGT
TCGCTCTCTA TTCTTAAGAA TGAGGGAATT GGTAACGTTC GTCGTGCTGA TCGTAGAATG
ACTCGACTCC TTTCTACTAT TGCGACTGAT CTTCATATTG ATGTTGAGCA GAGTTCATTT
GACTGGGGAA CCACTGATGC AACTCCTGCA ATGCAGAATT CAGTTCGTTC TGTTACTTTG
ATGGGAATGA ATGAGGACGG TCTTCCTGCG TTTAGCCGCA CTGCCTCTGA TGTTCGTGAG
AACGTTAGTG CTGATCAGTG TGCTGACGCT GCAGCCCTTG TTACCGAGCT TATTAGACGC
TCATAA
 
Protein sequence
MAITHDYVDY LDEQIDIAPA GSQEELQAAQ TIAEEMKIHG LEATIEEFDA KSIKSLGYSV 
YLIFLFIGII FAGTSNVALI VFGVLLVVGF GALTCLKYFG NDVLGTFGSS IRSQNVVAKH
EATGELVAKG NRPIVIVAHY DSPHTNFLVE SPVAKYVPLA QRYARWCVVA VFVATFVQIL
RFLPDSVRIF FWIVGILAAL PLVALSIATI AERFAPCTIG ANNNKSSVAA LLGILENVRP
TGHRPEVIHH FAGDAAALIP EPDEVEGVRH GEEVLNSLGI LPKDCEVSYV AYDTTGASQT
ASLDDVAEAV NATTEEEQAE DAASDNTVVY DSVDDELNAT MKQVHESADA NATLVQPGEA
HELHSKNDFA RRASLFDLPD PSGDAVDPLA PSSEPAPHYV PASTPAPTPE TEDAEGPFDT
ISADESLTET QDAKTPEAKR RSFRLFGRND GPSDDWKGGA TPSAENREED DSEDVSAISE
DDLRNAVLSL SDDELISHDI WFVALGASDF DHAGMREFLA KHRTDIRGAF LINLDCVGAG
SLSILKNEGI GNVRRADRRM TRLLSTIATD LHIDVEQSSF DWGTTDATPA MQNSVRSVTL
MGMNEDGLPA FSRTASDVRE NVSADQCADA AALVTELIRR S