Gene Apar_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0474 
Symbol 
ID8413323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp540884 
End bp542515 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content44% 
IMG OID645022042 
ProductDNA repair protein RecN 
Protein accessionYP_003179496 
Protein GI257784279 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.51796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATG AACTTCGCGT GCAAAATGTT GCACTCATTG ATGATGCCTC TTTTGCTCCC 
GCCTCTGGCT TAACTGTTTT AACGGGAGAG ACAGGTGCTG GTAAAACTGC TCTTTTATCC
TCTATTAAGC TCTTAGTTGG TGAGCGTGCA GATGCTTCTG CAGTCAGAGA AGGAACGGAT
GCTCTTCGGG TTGAAGCACG TTTCTTTACC TCTTCAGAAG ATCAAGAGGG AATTGTGGTC
TCTAGAAAAG TCTCTGCTGA TGGTAGAGGT AGAGTAGAGA TTGATGGTCA CATGGCATCA
GTTAAAGAGT TGGCTGGGGG TATTGGAACC TCAATTGATC TTTGCGGACA GCATGAACAC
CAAAGACTTC TTGATGTAAA AAATCATGTC AGCATGTTGG ATGCATGGAT TGGGCCAGAT
ATTCAGTCTT ACCAGGCAGA GTATAAAGAC GCTCTACATG CTTACTATGT TGCAATTGCC
GAGCTCCAAC GAGTTATTGA AGTAAGCCAG TCAAGTAACG CAAAAATTGA TGAAGCATCA
TTTCTTGTCC AAAAAATTGA TGAGGTTTCT CCTAAAGAAG GAGAGTTAGA AGATTTAGAG
GAACAACTTC CTCGGTTTGA GCATGCAGAA GCACTCCTTC AAGCTGCGGG AGGAACACAT
GAGTTACTCT CAAGTGATGA GGGTGTTATT GATTCACTTT CAGAGGCAGT TCAGATTCTT
CAAAATGCTT CAACGTTTGA TGCTACGTTA GGCTCCTATG CAGAGTCTCT TTCAAGCGCA
CTTATAGAGA TTGAAGATGT TTCCTCAGAG CTTAGAAGCT ATGTTGGGTC TCTTGATTTT
GACGAAGAAG CCCTTGAAGA GATGCAAAGC CGTATGGCTC GTCTCCAAGG TCTTATGAGA
ACGTATGGTC CTGGTATGAA AGATGTATTT AAGAAATACC AGGCGGCTCA AAATCTTCTT
GAGGTTACAA GAGATACAAA AAAGCTTGTT CGTGAAGCAC AGGCAGAGGT TGATAGAGCA
AAAGATACTC TTAGCGTAAA GGCTAACGCG CTTAAAAAGG TTCGTGTCGC TGCGGCTCCA
AAATTGTGTG ATGCAGTTAA TCTGCAAATG AGTCATTTGC AGATGGGTTC AGCTCAGATT
GAATTGAATT TTGAAGATCT TCCTTTTGAG CAGTGGAATA AAGTCGGCTC TACAAAGATT
GAGCTTATGT ATAAGCCCTC AGCTCAGATG ACAGCTCGCC CTCTTAGAAA GATTGCTTCT
GGTGGTGAGG TTTCTCGCGT TATGCTTGCG TGCAAAGTGG TCCTTGGAGA GTCTGATGAC
TGCGATACGC TTGTATTTGA TGAAGTTGAT GCTGGCGTTG GCGGAACTAC TGCTGTTGCT
CTTGCAGAGG TTTTAGCGCA GCTTGCTAAA ACACATCAGG TGATTGTGGT AACTCATTTG
CCACAAGTTG CTGTTCTGGC TTCAAAACAC TATGTAGTGT CTAAGACAGA AGAACATAAC
GCCCTTCCTA TAACCTCGCT TACAGAGGTT TCTGGTACTC AGAGAGAAGA AGAGATTGCA
CGTATGCTAT CAGGCTCAAT AACTGAGGCT TCTTTGGCTC ACGCGCATGA ACTACTCACA
GAAACTTACT AG
 
Protein sequence
MLDELRVQNV ALIDDASFAP ASGLTVLTGE TGAGKTALLS SIKLLVGERA DASAVREGTD 
ALRVEARFFT SSEDQEGIVV SRKVSADGRG RVEIDGHMAS VKELAGGIGT SIDLCGQHEH
QRLLDVKNHV SMLDAWIGPD IQSYQAEYKD ALHAYYVAIA ELQRVIEVSQ SSNAKIDEAS
FLVQKIDEVS PKEGELEDLE EQLPRFEHAE ALLQAAGGTH ELLSSDEGVI DSLSEAVQIL
QNASTFDATL GSYAESLSSA LIEIEDVSSE LRSYVGSLDF DEEALEEMQS RMARLQGLMR
TYGPGMKDVF KKYQAAQNLL EVTRDTKKLV REAQAEVDRA KDTLSVKANA LKKVRVAAAP
KLCDAVNLQM SHLQMGSAQI ELNFEDLPFE QWNKVGSTKI ELMYKPSAQM TARPLRKIAS
GGEVSRVMLA CKVVLGESDD CDTLVFDEVD AGVGGTTAVA LAEVLAQLAK THQVIVVTHL
PQVAVLASKH YVVSKTEEHN ALPITSLTEV SGTQREEEIA RMLSGSITEA SLAHAHELLT
ETY