Gene Nmul_A0064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0064 
Symbol 
ID3785788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp69640 
End bp70902 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content55% 
IMG OID637810133 
ProductPhage integrase 
Protein accessionYP_410765 
Protein GI82701199 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAGC TAACAGACAT GGAGATCCGC AACTGGATCA AGGCGGGCGA ACGGTTTGAA 
GGCCGTGCTG TAGGTGGTGG CCTATATCTG AGTTTTCGTG AAGGTTATGC CATTCCTATC
TGGCGTTTCA GATACCGTTT TTGTGGCAAG CGCCGCGTTA TGAATATTGG CAGTTACGGT
ACGCTATCCC TGGCGGATGC CAGGGATGAA GCCAAGAAGC TGTCCGCTCG CGTTGCTTTG
GGCTACGACG TGGCCGGAGA AAAACAGCAG CGCAAGGGCG AAGCCATTGC CAGGATAGAA
GCGGAGAAGA ATGCGTACAC CGTGGCGCAG CTGGCCGACG AATATTTTGA AAGGATGATT
GCAGGGCGAT GGAAGCACCC AAACATCGTA CGATCCAGGA TCGAGAAAGA CATCAAGCCC
GCGATTGGCA GCTTGAAGGT TGAGGATGTA AAGCCCAGGC ATATTGATGA TGTGCTCAAG
GCTGTAATGA AACGGGGTGC GCCTTCCATA GCGAACGATA CACTGCGCTG GCTTAAGCGC
ATGTTCAACT ATGCTATCAA GCGCCACATC ATCGAATACA ATCCCGCGGC TGCATTTGAT
CCAGGTGACG CTGGCGGCAA GGAGAAAAGC CGGACGCGCT GGTTGACCAG CGAGGAGCTG
GTCACGCTCT TTGAAGCAAT GCGGCAAGCA CCTGGTTTCA GTGTGGAGAA CGGCTTGAGC
ATCAAACTGC TATTACTGCT TGCGGTGCGA AAGGGTGAGC TGATCGGCGC CAGGTGGTCT
GAGTTCGACC TGGATAAAGC TGTCTGGTAT CTGCCTGCCG AACGCACGAA AACCGAATCT
GCCATTGACA TACCCTTGCC TCCGATTGCA GTAGAGTGGC TGCGCGAGTT GCAGCGCCTG
GCGGGTGTTA GCAAGTGGGT GTTGCCGGCT CGCAAGATGC AGGATCGGAT GATTCCACAT
ATTGCGGAAA GCACGCTGAG CGTGGCTCTG GCAAAGATCA AGCACGGCCT GGAACCCTTC
ACTATCCATG ATCTACGCCG TACTGCGCGT ACGCATTTCG AAGCCCTGGG CGTTGCCCCT
CACATTGCCG AGCGTTGCCT GAATCACAAG ATCAAGGGGA TCGAGGGCAT CTATAACCGG
CACGACTACT TTGAAGAACG CAAGGCGGCA CTGGAGGCCT GGGCGGGACT GTTGCTCCAG
ATCGAGCGGG GCGAGGCTGA TAAGGTTGTG CCGATCAGGC GTGCCGTAGC AACAAAACAA
TAA
 
Protein sequence
MAKLTDMEIR NWIKAGERFE GRAVGGGLYL SFREGYAIPI WRFRYRFCGK RRVMNIGSYG 
TLSLADARDE AKKLSARVAL GYDVAGEKQQ RKGEAIARIE AEKNAYTVAQ LADEYFERMI
AGRWKHPNIV RSRIEKDIKP AIGSLKVEDV KPRHIDDVLK AVMKRGAPSI ANDTLRWLKR
MFNYAIKRHI IEYNPAAAFD PGDAGGKEKS RTRWLTSEEL VTLFEAMRQA PGFSVENGLS
IKLLLLLAVR KGELIGARWS EFDLDKAVWY LPAERTKTES AIDIPLPPIA VEWLRELQRL
AGVSKWVLPA RKMQDRMIPH IAESTLSVAL AKIKHGLEPF TIHDLRRTAR THFEALGVAP
HIAERCLNHK IKGIEGIYNR HDYFEERKAA LEAWAGLLLQ IERGEADKVV PIRRAVATKQ