Gene Nmul_A2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2544 
Symbol 
ID3786270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2912837 
End bp2913793 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content57% 
IMG OID637812635 
Producttyrosine recombinase XerC 
Protein accessionYP_413225 
Protein GI82703659 
COG category[L] Replication, recombination and repair 
COG ID[COG4973] Site-specific recombinase XerC 
TIGRFAM ID[TIGR02224] tyrosine recombinase XerC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.774585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAGG AGCAACACCC GGAATCCCTA ACCGGTCTGA GCGATCCCGG CAAGATGCCC 
GAAAAGGGGC AGGCCGAACT TGCCTCCGCC TATCTCGCTT ATCTTTCCGT CACGCGGCGA
CTGTCGCCGC TTACCTGCGA AAGTTATGGC CGCGATCTCG ATGTACTGAT GAATCTCTCG
CGGGGAATCG CGCTGGAGCA TTTGCAAATC CATCATATCC GGCGTTTTCT TGCTCAGTTG
CATGCAAACG GATTTTCCGG CAGGAGCCTG GCGAGAATGC TGTCTGCATG GCGAGGGCTC
TACAATTACC TGGCGCGACA TCATGGTTAC GCGTGTAACC CCTGCGCCGG GGTGCGGGCG
CCGAAGTCCC CCAGGAGCCT GCCTCGCACC CTGTCCCCTG ATGAGGCCTT GAAGCTGCTG
GAGTTTGATA CTCCCGATCT GGTTGCCTTA CGCGATAAAG CCATGTTCGA ACTCTGCTAT
TCTTCGGGAT TGCGGCTGGC GGAACTCGCG AACCTGAAGC CGGAGGATTT AAGTCTGGCG
GAGGGGATCG TGCGCGTTAC CGGCAAAGGG AATAAAACGC GGGATGTGCC GGTAGGAAGC
AAAGCCATGC AAGTAGTGAG GGAATGGATA AAACAACGCG CCACGCTCGC CAAGCCAGGG
GAAACAGGTC TATTCCTGTC CCGCCACGGG AGGAATATCA GTCGCCGGTC GATCGATCAG
CGTCTGAAAA TCCAGGCAGT AAAGCAAGGA ATCAGCGGAC GCATCCACCC GCACGTGCTG
CGGCATTCAT TTGCATCACA TGTACTGCAA TCCAGCGGCA ACCTGAGAGC GGTGCAGGAG
ATGCTGGGGC ATGCCAGCAT CAGCACCACG CAGGTATATA CCCATCTCGA CTTCCAGCAT
CTATCGAAGG TCTATGATGC GACCCACCCC CGCGCACGAA AGAAGAAAGA GTCTTGA
 
Protein sequence
MGKEQHPESL TGLSDPGKMP EKGQAELASA YLAYLSVTRR LSPLTCESYG RDLDVLMNLS 
RGIALEHLQI HHIRRFLAQL HANGFSGRSL ARMLSAWRGL YNYLARHHGY ACNPCAGVRA
PKSPRSLPRT LSPDEALKLL EFDTPDLVAL RDKAMFELCY SSGLRLAELA NLKPEDLSLA
EGIVRVTGKG NKTRDVPVGS KAMQVVREWI KQRATLAKPG ETGLFLSRHG RNISRRSIDQ
RLKIQAVKQG ISGRIHPHVL RHSFASHVLQ SSGNLRAVQE MLGHASISTT QVYTHLDFQH
LSKVYDATHP RARKKKES