Gene Nmul_A2506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2506 
Symbol 
ID3786631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2863682 
End bp2865982 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content58% 
IMG OID637812597 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_413187 
Protein GI82703621 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTGTG GAATTCTGCT GCTTTCCTGC CGTGCCCCTG TTTTTGCTCG GATCGGCAAA 
ATGCTTTGGG TTTGCCTTTT CCTGGGAGCA GGTTTTTACT GGGCGACTGC TTTTGCGCAA
TGGCGCTTGA ACGATGCGCT CCTGCCCGAG TGGGAAGGGA GAGATATCCA GGTAATTGGC
GTGGTTGCGG AGTTGCCGCA AACCGGCAGG ACCAGTGTGC GCTTTACTTT CGATGTTGAG
CAGGTGTTGA CGGAAGGTGC AATCGTGCCT GCCCGTCTTT CTCTTTCCTG GTATAAGGAG
CGCGGAAGCG GCTTTATGCC AGGACCTTCA ACACCTCCCC TCAATGCCGG GGAACGCTGG
CGGCTTACTG CACGGCTCAA GCGCCCCCAT GGAACCGCCA ATCCGCATGC TTCCGATTTC
GAGCAGTGGG CGCTGGAGCG CAACATTGGG GCCACAGGCT ATGTGCGCAA GGATGACGAA
AATATACGCC TGGAGAAAGT GGTGGGACGT CCCGCTTATC AGGTCGAGCG TCTGCGCCAG
GATATCCGCG ATAATTTCCT TGCCGCGCTG CCCTATCAGG CTTATGCCGG TACCCTCGTT
GCGCTGGCGG TGGGGGATCA ACGGGCGATT CCACACGAGC AGTGGCAGGT ATTCACGCGG
ACGGGGATAA ACCATCTCGT CAGCATATCC GGCCTTCATA TCACCATGCT GTCGGGGCTT
GTGTTTACGA TGGCATACTG GCTATGGCGC AGAAGCTATC ATTTGACGCT GCGGTTGCCA
GCCAGGAAAG CCGCAGTCAT AGCCGGGCTG GTGGCCGCGC TCGGTTATAC GCTGCTGGCC
GGTTTTGCGA TTCCGGCACA ACGCACGCTC TATATGCTCA CCGCGGTCGC CGTTACCCTA
TGGCTTGATC GTTCCATTGC CATGACCACA GTCCTGGCCT GGGCATTGCT TGCAGTGGTG
GCTCTCGATC CATGGGCAGT GCTTTCACCC GGCTTCTGGT TATCCTTTGG CGCCATCGCG
GTCATCATGC TGGTATCTGT CGGCCGCGTT GGCAGGCCGC ATTGGCTGAG TAGTTGGACA
ACGGTGCAGT GGGCCATAAC CCTGGGCCTG ATACCTCTTT TATTGGCGAT GTTTCAGCAA
ATATCGCTGG TTTCCCCAAT CGCCAATGCG GTGGCGATTC CACTGGTGAG TCTTGTCGTG
GTGCCGCTGG CCCTGCTCGC AACCCTGCCG CCCCTCGATT TTCTGTTGGT ACCGGCACAT
GCAGCGCTGG ATGGTTGCAT GACCGTAATG GAATGGCTGA GCAATGCGCC TCAGGCCGTC
TGGAGCCAGC AGGCGCCGCC TTTCTGGGCA GTAGCCGTGG GGACGGCGGG CATTTTCTGG
ACGCTGTTGC CTGGGAAATG GGGTTGGCAG CTCGGCCTTG GTACCGGTTT CCCTGCTCGT
TGGCTTGGTC TCGTCGCGTT GCTGCCGCTG TTTCTGCTGC CTCTCCCAAA ACCGGAGCAG
GGAGAGTTGT GGCTGACCGT GCTGGATGTT GGGCAGGGAC TGGCGGTCCT GGCGCGTACG
GAAAATCACA CGCTGCTCTA CGATACGGGC CCTGCCTTTA CGTCCGAAGC CGATAGCGGC
AGCCGCACCA TCGTTCCATT TCTCCGCGGG GAAGGGATAA GGCATCTCGA TGCAATGATC
GTCACACACG CCGATTCGGA TCATAGCGGC GGAGGGTTGT CCGTGCTGCA AGCAGTACCC
GTCGAATGGC TCGTCTCGTC GTTGGGCGAG GATCATCCCA TACAACAGGC TGCTCTCAAC
AAGCGCCGGT GCAAGGCGGG GCAGTCGTGG GAATGGGATG GCGTTCGCTT CGACATGCTT
CATCCCCTGG AAGAGAGCTA CAATGATCTT CGCCTCAAGA GTAACGCCTT GAGTTGCGTG
TTGAAAATTA CCACATGGCA CGGAAGCGTG CTTCTGCCGG GGGATATCGA GAAAAAATCC
GAATATCAAT TGCTCAAGCG TTATGGGGAG GCGCTTTCCT CAACGGTATT GATCGCTCCT
CATCATGGCA GCAAAACTTC TTCCACCGAG GAGTTCGTGC GGAAGGTGAA TCCGCGGTGG
GTAGTATTTA CCGTAGGTTA TCGCAACCGT TTTGGCCATC CAAAAGAGGA AGTTGCGGAG
CGGTATCGAG CGGTGGGAAG CAGATTGTTG CGCAGCGATA TCGATGGTGC CATATTGCTG
CGCTTTGGCG GTAACGACCT TGCCGTGGAG CGGTGGCGCG TTCGGCGGAT GCGTTATTGG
CACGATTCAG CAACGCTTTA A
 
Protein sequence
MACGILLLSC RAPVFARIGK MLWVCLFLGA GFYWATAFAQ WRLNDALLPE WEGRDIQVIG 
VVAELPQTGR TSVRFTFDVE QVLTEGAIVP ARLSLSWYKE RGSGFMPGPS TPPLNAGERW
RLTARLKRPH GTANPHASDF EQWALERNIG ATGYVRKDDE NIRLEKVVGR PAYQVERLRQ
DIRDNFLAAL PYQAYAGTLV ALAVGDQRAI PHEQWQVFTR TGINHLVSIS GLHITMLSGL
VFTMAYWLWR RSYHLTLRLP ARKAAVIAGL VAALGYTLLA GFAIPAQRTL YMLTAVAVTL
WLDRSIAMTT VLAWALLAVV ALDPWAVLSP GFWLSFGAIA VIMLVSVGRV GRPHWLSSWT
TVQWAITLGL IPLLLAMFQQ ISLVSPIANA VAIPLVSLVV VPLALLATLP PLDFLLVPAH
AALDGCMTVM EWLSNAPQAV WSQQAPPFWA VAVGTAGIFW TLLPGKWGWQ LGLGTGFPAR
WLGLVALLPL FLLPLPKPEQ GELWLTVLDV GQGLAVLART ENHTLLYDTG PAFTSEADSG
SRTIVPFLRG EGIRHLDAMI VTHADSDHSG GGLSVLQAVP VEWLVSSLGE DHPIQQAALN
KRRCKAGQSW EWDGVRFDML HPLEESYNDL RLKSNALSCV LKITTWHGSV LLPGDIEKKS
EYQLLKRYGE ALSSTVLIAP HHGSKTSSTE EFVRKVNPRW VVFTVGYRNR FGHPKEEVAE
RYRAVGSRLL RSDIDGAILL RFGGNDLAVE RWRVRRMRYW HDSATL