Gene Nmul_A1271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1271 
SymboluvrC 
ID3784286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1461466 
End bp1463286 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content55% 
IMG OID637811356 
Productexcinuclease ABC subunit C 
Protein accessionYP_411966 
Protein GI82702400 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCAGGCAG AACCGGCTTT CGATGCCAAG GCGTTTTGTG CGGGCCTTCC ATCCCAGCCT 
GGAGTCTATC GCATGATGAA CTCAGCCGGG CAGGTGATTT ATGTCGGCAA GGCGATTGAT
CTCAAGAAGC GTGTTTCTTC CTATTTTCAG AAGAATGGCC TGGCTCCCCG TACGCAACTC
ATGGTTTCCC AGATCGCGGG AATCGAGACC ACCGTTACGC GATCCGAAGC AGAGGCCCTG
TTACTCGAAA ACAACTTAAT AAAGAGCTTA AATCCTCGCT ATAACATACT ATTCAGGGAT
GACAAATCAT ATCCCTACGT GATCTTGAGC GGCCACAGGT TTCCCCGGCT TGGATTTCAT
CGCGGTCCGC TCGACAAGAA ACATCATTAT TTTGGTCCCT TCCCGAATGC GGGAATGGTG
CGAGAGAGTA TCCAGTTGCT GCAAAAGGTA TTCCGGATTC GCACTTGCGA AGACAGTGTT
TTCAGTAATC GTACACGCCC CTGCCTGCTC TACCAGATCA AACGCTGTAG CGGACCCTGC
GTCGATCTGG TCAGCGAAGA AGTTTACGCG GAAGACGCAA GGGATGCGGA GCTATTTCTC
CAGGGCAAGC AAACGGAGGT TCTGAAAAGC ATTACGAGAA AAATGCATGA GGCCGCCGAG
GAACAGGAAT ACGAGCAAGC GGCGCTGTTT CGCGACCAGA TTCAGTCTCT GCGAAAGATT
TGCGAAAGGC AATTCGTGGA TAGCGGGCGA GCGCTGGATG CCGATATCGT CGCCTGCGTG
GCAGAGAATA ACGGCGGCGG ACGGGTGTGC GTCAACCTTG CCATGGTCAG GGGAGGACGC
CACCTCGGGG ACAAGAGTTT TTTTCCGCAA AACGCGGAAG GATACGATCT CGCTACGGTG
GCGGAAGCAT TCCTGGCCCA GCACTACCTC AATCGCAGCA TCCCAGATCT GATCATCGTG
GGCGAGAGAG TTCCGCGGGA ATCTCTCCAG GCTCTGCTCA CTCAGCAGGC TGGCCATAAA
GTGATCATCA ATGTGAATCC GATAGGTAGC CGGCGTGTTT GGCTGGAAAT GGCGACAGAA
AATGCCGCCC TTGCCCTGGA ACAAATGCTG GGCCGCCAGG CGAGCCAGGA GGAGCGGCTG
CTAGCCTTGC AGCAGGCGCT GGATATGACC GGGTTGAGCC GGATCGAATG TTTCGATATC
AGTCATACAA TGGGCGAAGC CACCATCGCT TCCTGCGTGG TTTATGACAA CTTCGGCATG
CGCAATAGCG AGTACCGCCG CTACAATATT ACTGATATCA CGCCGGGAGA CGATTATGCG
GCCATGCGCG ATGTTCTGTC ACGGCGCTAC CATAAGATTG CGGAAGGCGA GGGAAATCTG
CCTGATCTGA TCCTGATCGA CGGGGGAAGA GGGCAGATCA ACGCTGCTCT CGAGGTCATG
GTAGAGCTGG GGTTGAATGA TGCCAACCTG GTAGGCGTGG CAAAAGGCGA GGAGCGCAAG
CCGGGACTGG AGCAATTGAT TTTCCCAGGG GTGAAAAAAC CACTACAATT ATCAAAGGAT
CATCCCGGAT TGCATCTCAT CCAGCAGATT CGGGATGAAG CGCATCGCTT TGCAATTTAC
GGTCATCGCG CAAAACTCGG CAAGGCCCGC GTCAGTTCAA GCCTGGAGCA GATCGCCGGT
ATCGGCGCCA AGCGCCGGCA AAGTTTGCTG GCAAGGTTTG GCGGCCTGAA AGGCGTGCGC
ACTGCGAGCA TCGAAGAATT GCAGCAAGCT GACGGCATCA GCCGCGCGCT CGCAGAGAAA
ATTTACAGGG AACTGCATTG A
 
Protein sequence
MQAEPAFDAK AFCAGLPSQP GVYRMMNSAG QVIYVGKAID LKKRVSSYFQ KNGLAPRTQL 
MVSQIAGIET TVTRSEAEAL LLENNLIKSL NPRYNILFRD DKSYPYVILS GHRFPRLGFH
RGPLDKKHHY FGPFPNAGMV RESIQLLQKV FRIRTCEDSV FSNRTRPCLL YQIKRCSGPC
VDLVSEEVYA EDARDAELFL QGKQTEVLKS ITRKMHEAAE EQEYEQAALF RDQIQSLRKI
CERQFVDSGR ALDADIVACV AENNGGGRVC VNLAMVRGGR HLGDKSFFPQ NAEGYDLATV
AEAFLAQHYL NRSIPDLIIV GERVPRESLQ ALLTQQAGHK VIINVNPIGS RRVWLEMATE
NAALALEQML GRQASQEERL LALQQALDMT GLSRIECFDI SHTMGEATIA SCVVYDNFGM
RNSEYRRYNI TDITPGDDYA AMRDVLSRRY HKIAEGEGNL PDLILIDGGR GQINAALEVM
VELGLNDANL VGVAKGEERK PGLEQLIFPG VKKPLQLSKD HPGLHLIQQI RDEAHRFAIY
GHRAKLGKAR VSSSLEQIAG IGAKRRQSLL ARFGGLKGVR TASIEELQQA DGISRALAEK
IYRELH