Gene Nmul_A0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0107 
Symbol 
ID3786374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp113563 
End bp115197 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content42% 
IMG OID637810177 
Productrecombinase 
Protein accessionYP_410808 
Protein GI82701242 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAAG CCTATTCATA CATTCGAATG TCCACAGATA AACAGCTCAG TGGAGATAGT 
CTTCGGAGAC AACGTGAAGC CTCAGAAAAA TACGCCAAGG CTCACGGTCT AGAGTTGATC
GAGACGTTAG ATGGGAAAGA TTTGAGAGAT ATCGGCGTGT CCGGATTCAA AGGTGCTAAC
TCTAGAAGGG GGGTTCTTTC AGTTTTTCTT GAGCATCTTA ACGATGGGAA GATTGACGCT
AATAGCATTC TCTTAATCGA AAGCTTCGAT CGGTTATCCC GTGCGGACGT CCTTGATGCA
TTCTCGTTGT TTACCAATAT TTTAAACAAG GGTATCGAGA TCATAACGCT GTCTGACAAT
CAGCGGTATA CGAGAGATTC AGTAAGGGTA AACCCTGGGC AGCTTTTTCT AACTATTGGA
ATAATGACCA GAGCAAATGA GGAATCTGTA ACTAAGTCTA AACGAGGTCT ATCAGTTTGG
AACAATAAAA GAGACAACGC GCAGAAGAAG CCGATAACAT CACGTTGTCC AGCTTGGTTA
ACTTACTCTT CAGAGAGTGA AAAATTTGAG GTAATCGAAG AGCGTGCAAG GGTAGTTAGA
CTCATTTTTG AGCTCTGCGC AAAGACAAGC GGAACATGGT CGATTTCGAG ATACCTTAAT
CGTCACAATA TCCCTGTTTT TGGTGATGCT CAATTTTGGC AGAAGTCGTA CGTGAACAAA
ATTTTATCGA ACAGAGCGGT GTTGGGGGAA TTTCAGCCTA ATTCACGGGA GAGCGGTCAA
CGTATACCAG TAGGAAAAGT GATCGAAAAT TACTATCCGG CCATAATTTC GGCTACTGAG
TTCCACTTAG CTTGGGCCGC GCGTGACAGA AGAAATAAGA CTGGCAGCGG CAGAAAGGGG
ATAAATTTTT CAAATCTATT CTCCGGTCTG GTTTATTGCG GTAACTGCGG TTCAAAAATG
AGCGTTAGAA ATCGGGGTGT ACCGCCAAAA GGTGGTAAAT GGCTTGTTTG CCAAAATCAT
TTGGCAGGGG CTGGGTGTGA AATGCGCGAA TGGAAACTCC CTCTAGTAGA AGATGCGATA
TTCAAGCATT TATACGAGGT TGATTTCAGC GAATTACTGG GCAACAAATC TGCAGGTTTT
GACATCGAGA AAGAGCTATT TGCATTACAG GTAGAGCAAA AAGAGCTTGA GTCAAAAATG
GATCAAGCTA TTGAAATGTC AGCGGCTCAA GAGTTGAATG AACAATCTCG ATTCAGGTAT
GTGAAGCTGA TCAATCAGCT TGAGAGTGAT ATTAACGAGA AAAAAAAATG TATTTCAAGC
AAAGAGAAGG AATTAGAAGT ATTCAAAAGT CAGCAAAGCC TCCTTCATAC AAAAGAATTA
AAGGAGACTC TCTTGCTGTT AGAAAAAAAT AAAGATGATT ACTACTTTCG CTCCTCTGTT
AATCAGTTGC TGACCAGAAC TATAGCTCGA ATTGATTTGA TTACAGATCA TGGAGGTCCC
TTACCTTGGG AAATTGAACC TGATGACAAG ATCGTAGGAA ATTTCCGTGC GTTGTATCCA
AAACTCAGCA GTTTATCTAT CGATGAATTA GTTCTTCAGA GGGTTTTCCT AGACTACATA
ACACTGATCT GCTAA
 
Protein sequence
MRKAYSYIRM STDKQLSGDS LRRQREASEK YAKAHGLELI ETLDGKDLRD IGVSGFKGAN 
SRRGVLSVFL EHLNDGKIDA NSILLIESFD RLSRADVLDA FSLFTNILNK GIEIITLSDN
QRYTRDSVRV NPGQLFLTIG IMTRANEESV TKSKRGLSVW NNKRDNAQKK PITSRCPAWL
TYSSESEKFE VIEERARVVR LIFELCAKTS GTWSISRYLN RHNIPVFGDA QFWQKSYVNK
ILSNRAVLGE FQPNSRESGQ RIPVGKVIEN YYPAIISATE FHLAWAARDR RNKTGSGRKG
INFSNLFSGL VYCGNCGSKM SVRNRGVPPK GGKWLVCQNH LAGAGCEMRE WKLPLVEDAI
FKHLYEVDFS ELLGNKSAGF DIEKELFALQ VEQKELESKM DQAIEMSAAQ ELNEQSRFRY
VKLINQLESD INEKKKCISS KEKELEVFKS QQSLLHTKEL KETLLLLEKN KDDYYFRSSV
NQLLTRTIAR IDLITDHGGP LPWEIEPDDK IVGNFRALYP KLSSLSIDEL VLQRVFLDYI
TLIC