Gene Apar_0567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0567 
Symbol 
ID8413421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp642899 
End bp644074 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content44% 
IMG OID645022139 
Productintegrase family protein 
Protein accessionYP_003179588 
Protein GI257784371 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.563798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00485057 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCGCT CAATAAAAGT TCGATTGAAT TCGAACGGCA TTTGGTGCTG TAGGCTTTAC 
TTAGGAAGAA ATCTTAACGG CAAAATCATT CAGCCTTATG CAAGCTTTCC TACGGCAAAG
ACGCAGAAAG AAGCTGAAGA ATTAGCCACT ATGTGGGCGT CTCATATTAC GTCTGACGGC
AAAGTTAAAA GCACACAGCT TACCGATTTA CTTCTCGAGT ATGTTTCTAT TAAACGCAGG
AATGGCGCGA GCCCTAACAC TACAAGGCAG CATGAAGGCT TCATTAGAAA CCATATCAAT
GGAAGGCTTG GTAAAGAGGA TGTAAGAAGT ATTACATCCT CTTTATTCAC TTCATTTGAG
CAGGATTTAT TGAAGAAGGG TTTGTCTCGA AACAGTGTAA TTAACCTGCA TCAATTCTTG
AGAGGTGCAT ACAATTACTT TGTTTCAGCT GGCATATGCG ACTATAACCC GCTTATTAAC
GTGGCTAAGC CGTCCAGGGA AGTCCATGAA GCTGTATCCA TTGAAGAATG GGGTTTTGCT
GGGATAAGTA CCCTTATTAA TTCCAGGATT ACTACAGCCA TTCAAGAGGA TGAGTTTAAT
TCCCGTGTTG TTTGTGCCTT TGCTGCTTGG CTGTCTTTAG TAACTGGTAT GCGCTGCGGT
GAAGTTTGTG CTATTAGATA CAGTGATGTA AACATGCTAT ATAAGCATAT CCACGTATCT
GGTACCGTCA TTGAAGAGTC TTACAGAAAG CCTTATAGAC GAGAATCAAC CAAGGGTAAG
AGATCTAGAA ACATAGCCAT TACAGACTCG GACATCAGTT TTATTAGTGA CTACATGAAG
CTTCAGCAAG CTCATATTGC CTTTGTGGAG TCTTCTACAC CGTTAATTAG CCTTGATGGC
TCTTACATGC GTCCAACGAG CGTTTCGAGG TCATTTACAC GCATGAGACG CACTCTCCAG
CTACCTCAAG GCATTACCTT CCACTCACTC AGACATACTC ACGCGTCTTG GTGTTTGGCA
AGTGGCGTTG ACTTAAAGAC TCTTTCAGAG CGTCTTGGCC ACGCAGACCC AGCAACAACT
TTGAGGATTT ATTCTCATTT GCTGCCTGGA CGTGACCGGG GAGCGGCAGA AGCGTTTGGA
GACGCGCTTA GGACCATTGA ACAAGGAGCG TTCTAA
 
Protein sequence
MNRSIKVRLN SNGIWCCRLY LGRNLNGKII QPYASFPTAK TQKEAEELAT MWASHITSDG 
KVKSTQLTDL LLEYVSIKRR NGASPNTTRQ HEGFIRNHIN GRLGKEDVRS ITSSLFTSFE
QDLLKKGLSR NSVINLHQFL RGAYNYFVSA GICDYNPLIN VAKPSREVHE AVSIEEWGFA
GISTLINSRI TTAIQEDEFN SRVVCAFAAW LSLVTGMRCG EVCAIRYSDV NMLYKHIHVS
GTVIEESYRK PYRRESTKGK RSRNIAITDS DISFISDYMK LQQAHIAFVE SSTPLISLDG
SYMRPTSVSR SFTRMRRTLQ LPQGITFHSL RHTHASWCLA SGVDLKTLSE RLGHADPATT
LRIYSHLLPG RDRGAAEAFG DALRTIEQGA F