Gene Apar_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1075 
Symbol 
ID8413948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1216906 
End bp1218147 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content47% 
IMG OID645022664 
Producthypothetical protein 
Protein accessionYP_003180094 
Protein GI257784877 
COG category[S] Function unknown 
COG ID[COG3883] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.855575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTAA AGAAACAGCA AAGCACGACC ATCGAGGAGC AGAGTGCTGC TCTTTTTAAT 
CGTCGTGATG CTCTTAAGAT TTTTGCCGGC ATGGGCATTG CTGCCGCTGC CGTTACTGCC
TCACTGATTA ACACGCGCCC TGCTCTTGGT GTTACACAGT CTCAGATTGA TGCTGCTCGT
AGTGACTACG ATGCTGCTCA GAAGCTGCTT GATGAGATTT CTAACGAAGT TTCTAATATG
CAGGCAAGTC TGAATGACAC CAACTCTCAG ATTGGTGCTA TTTCTGGTCA GATTGCAGAT
AAGCAGAATC AAATTGATGC TAAACAGAAA GAGATTTCTG AAAAAGAAAC TCAGATTACC
GAGAAGAGAA AAGCACTTGG CGCACGTATG TCCAGCAACT ATAAAGCTGG CCCAGCTGGC
GCTTTAGAGA TGATTCTTTC TTCTGCAAGC TTTGAGGAGC TTACTTCTAA TATTTACTAT
CTCGATAAGA TTTCTGAGTC TGATCAGAAG ATGATTGAGG AAGTTAAGAA CTTGAAGGCT
GCCCTTGAGT CAGATAAGGC TGCTCTTCAG TCCGAGAAGA CTGAGCTTGA GAATAAGAAG
ACTGAACTTG AAAATCTTCG TACTTCTCAG GAGTCTCAGC TCAATGAGAT TTCTGCTCGT
CAGGCAGACG CTGCAAACGT TGTTTCTAAT CTTGATGACA ACGTTAAAGA GCTTATTGCT
CAGCGTGATT CTGAGCTTCT TGCTGCCCAG CAGGAGGCAG AGCGTGTTGC TGCTCAACGT
GCTGCTGCGT CAAGTAATAG TGGCGGCGGT AGCAGCTATT CTGGCGGTGG TGGCGGAGGA
GGAACCTCTT CTGCTGGTTC TGGCTCTGCT GCTGCAGTTG TTAATGCTGC AAGTTATACC
GGATCTACAG GCGCAGGTTT CTGTGCTGCT TGGGTTTCTA ATGTCTTCTC AAACGCAGGC
GTTGGCACCT TCTATGGCAA TGCATGCGAT ATGTATTACT CCTGGTGTTA CTCTACCGAC
CAGAGCGCTA TTGAGCCTGG CATGATTATT GCTGTTCCTA CTCTTGGTGG CTCTGCTGCA
GCTTTGATTT ACGGTCACGT TGGTATCTAC ATTGGTGGCG GCATGGTCAG ACACTGCCTC
TCTGGTGTTG TAAGAAGCCA GAGCTTAAGC TCTTGGATTT CTCAACTTGG TGCTTTGAGT
ACTCCAAGGT GGGGCTGGCT TGGTGGCGTT GTTCTGAGCT AA
 
Protein sequence
MSLKKQQSTT IEEQSAALFN RRDALKIFAG MGIAAAAVTA SLINTRPALG VTQSQIDAAR 
SDYDAAQKLL DEISNEVSNM QASLNDTNSQ IGAISGQIAD KQNQIDAKQK EISEKETQIT
EKRKALGARM SSNYKAGPAG ALEMILSSAS FEELTSNIYY LDKISESDQK MIEEVKNLKA
ALESDKAALQ SEKTELENKK TELENLRTSQ ESQLNEISAR QADAANVVSN LDDNVKELIA
QRDSELLAAQ QEAERVAAQR AAASSNSGGG SSYSGGGGGG GTSSAGSGSA AAVVNAASYT
GSTGAGFCAA WVSNVFSNAG VGTFYGNACD MYYSWCYSTD QSAIEPGMII AVPTLGGSAA
ALIYGHVGIY IGGGMVRHCL SGVVRSQSLS SWISQLGALS TPRWGWLGGV VLS