Gene Apar_0546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0546 
Symbol 
ID8413400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp632060 
End bp633271 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content43% 
IMG OID645022119 
Productintegrase family protein 
Protein accessionYP_003179568 
Protein GI257784351 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.849888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000145945 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATATTG CAGTCAAAAG ACATGGAACT TTCTGGCAAG CTCGTGTACG TTTTCGTGGA 
GCTGATGGAA CCATCCAAGA AAAAAGTAAA TCTCTCGGAA TTCCCTGTGC GGCTGGAAGG
GGCAAAAAAG TTGCTCGAGC AGCTGCTGAG AAATGGGTTC AAGATGCAGG GTTTGTTGAG
GTTGTCGAAC AGAATCAAGC AACAAGGCTT GATTGTTCGG CATATACGTA TTGTCTTAAT
TACTTTAAGA GCCTTGTTGC CACACAACAA ATTGAACGTC GTACTTATAC GTCTTATAAG
AATAACGTCC GATATATTGA TTTGTTCTTT GGAGAAAAGC GACTACAAGA GATTACTGTC
ACGGATGTCG AGTTATATGT GTCTTGGCTT TATGATTCTG GCTATGCAGC TAATACGGTT
AAGAAAGCAT TTAACTCCTT CCGTGCTTGT ACGCGCCATG CTGTAGCAAT TAGAGATCTG
CAATACGATC CATGTGCGGC AATTAAAGCG CCAAAAGGTT ATCTTGCACC GCCAAATCCG
CTAAACGAAC CTTCACGCAA GAAGCTTCAA GTCATGCTTT CTGCTCTTGA GCTTTCTCCG
ATGGTACTTG CGACATACCT GGCATATTAC ACAGGTATGA GACGTGAGGA GTGTTGTGGA
CTTCAATGGA AGGACGTGAA GTTTAAAGCT GAAGATGTCA CAGCACATCT CTGTCGAGCC
ATCTCATACG ATGGCGGCAA AACCTACATT AAGGGTTTAA AAAATGGTAA AGATAGAACG
GTACCTATTC CAGCTCCGCT TGTAGACATT CTTAAGCAGT GGCGTTCTAA ATACATTGAA
GATTGTATGT TGATGGGAAT TGCGTTTAGT GAAGAGATGT ACGTTCTAGG TGACTTCTCT
GGCGAATATC TTAGACCAGA GCGAGCGACT GCATGGTGGA AGAGTCACTC TGAGGAGTGG
GGTCTTCTTG GAACGCAGGG GAGAAGACCA GTCTTTCATG ATCTGCGACA CACATATGCA
ACGATTGCAG TTAGAACTAT GGACATTAAG AGCGCACAAG ACATTCTTGG ACACAGCGAT
ATTAATATGA CAATGCGTTA TGCAGATACA GACTTGGAGC AAATTCAAAA GGCAGGGAAA
ATTATTGGAG AGGCTCTTAA CGATGCTCAC AAAGACGGTG CAGAAGTACT ACAACTTAGG
CGAGCGATAT AA
 
Protein sequence
MNIAVKRHGT FWQARVRFRG ADGTIQEKSK SLGIPCAAGR GKKVARAAAE KWVQDAGFVE 
VVEQNQATRL DCSAYTYCLN YFKSLVATQQ IERRTYTSYK NNVRYIDLFF GEKRLQEITV
TDVELYVSWL YDSGYAANTV KKAFNSFRAC TRHAVAIRDL QYDPCAAIKA PKGYLAPPNP
LNEPSRKKLQ VMLSALELSP MVLATYLAYY TGMRREECCG LQWKDVKFKA EDVTAHLCRA
ISYDGGKTYI KGLKNGKDRT VPIPAPLVDI LKQWRSKYIE DCMLMGIAFS EEMYVLGDFS
GEYLRPERAT AWWKSHSEEW GLLGTQGRRP VFHDLRHTYA TIAVRTMDIK SAQDILGHSD
INMTMRYADT DLEQIQKAGK IIGEALNDAH KDGAEVLQLR RAI