Gene Noca_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1887 
Symbol 
ID4596388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2012221 
End bp2014539 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content77% 
IMG OID639776485 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_923084 
Protein GI119716119 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein
[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCGAGG TCCAGCGCCG CCGACCGCCG GACCTGCGGA TGCTGCTGCT CGGCGGCGTG 
GCGTGGGGCG GGGCTCTGGC AGCCCGTGCG CTCGGATGGT GGTTGCTCCC GGCGGCGTTG
GGGATCGGCC TGGTGGCCTG GCCGCTCGCC CACGGCCGCG GACGGGCGGC GGTGCGCACA
ACCGCGGCGG TCGTCGTCGT CGCCGCCGCG GTGGGTGCGG TCGCCGTGCT GCGACTCGAT
CAGGTCGCCC ACAGCCCGGT CGCGCGCCTC GCCCAGGAGG GCGCGGCAGT GCGCCTGGTC
GGCACCGTGA CCTCGGATCC GCGGCGCACC GAGGGCCGGT TCGGCGAGGT CGTGACGGTG
CGCCTCGACG TCCGCGAAGT GACCGGACGC GGAGCGACGT ACGCGCTCGT TGCCCCGGTG
CTGGTGCTGG CCGACGACGA CTGGGCGGAC CTGCCGCTGG GCGCCACGGT CACGGGCAGC
GGCCGGCTCT CGGTCGCTGA GGGCGGCGAC CTCGCGGCGG TGCTCGGTGC CCGGGGCCCG
CCGGAGGTGG TCGACGACCC GGACGTGTGG TGGGATGCCG CCGCCGCGGT CCGCGGCTCG
ATCCGCGGGG CCGTCGCCCA CCGGCCCGAC ACCCAGCGGG CGCTGGTCCC CGCGCTGGTC
GACGGCGACG ACGCCGCCGT CGACGAGGCG CTCGCCGAGG ACTTCCGGAC CACCGGGCTG
ACCCACCTGC TCGCCGTCTC GGGCACCAAC CTCACCCTGG TGGTCGGGTT CGTGCTCGTC
CTCGCCCGCT GGTGCGGGGT CCGTGGCCGC CTGCTGTATG CCGTGGGCGC GCTGGGGATC
GTCGGCTTCG TGCTCCTGGC CCGCACCGAG CCGAGCGTGC TGCGGGCCGC GGTGATGGGA
GCGGTCGCGC TGTTCGCGAT GGGTCCCGAG GGTCGGGAGC GCGGTTCCCG GGCGCTCGGC
GTCGCGGTGC TCGCGCTCCT GCTGGTCCGC CCGGATCTCG CCGCGTCGGC GGGCTTCACG
CTCTCGGTGC TCGCCACGGC CGGCATCCTG CTGCTCGCAC CGGGCTGGCG CGACGCCCTC
GCGCGCTGGC TGCCCCGCTG GGTGGCCGAG GCGATCGCGG TGCCGGCCGC GGCCCAGCTG
GCCTGCACCC CGGTCGTCGC AGCGATCTCC GGCCAGGTCA GCCTGGTCGC CGTCGCTGCC
AACCTGGCCG CTGCCCCTGT CGTGGGCCCG GCGACCGTGC TCGGGCTCGG CGGCGGCCTG
GCGGGCCTGG TGTGGGAGCC GGCCGGCCGG CTGCTCGGGA CGCTCGCCGG CTGGTGCGTC
GGCTGGCTCG TCGTGGTCGC GACCCGGGGT GCCCGCCTGC CGTCCGCGGC GCTCGGATGG
GGGACCGGGG CGGTCGCGCT GGCCCTACTG ACCCTGGTGG TGGTGGCGAT CGCGCTCGCC
GGCCCACTGC TCCTGCGCCG CAGGTCCACC GGGCTGGGGT GCTGCGCGCT GCTCGTCGCG
ACCGTGCTGG TGCGGCTGCC GACGCCGGGC TGGCCGCCCG ACGGCTGGGT CCTGGTCGCG
TGCGACGTCG GCCAGGGTGA CGCGCTGGTG CTCAACGCCG GGCCGCACGC CGCGGTCGTC
GTCGACGCCG GACCGGACCC GACCCTGGTG GACGCCTGCC TCGACCGGCT CGACATCGAC
AGCGTGCCGT TGGTGGTGCT CACCCACTTC CACGCCGACC ACGTCGACGG CCTGCCCGGG
GTGCTCGACG GTCGCACGGT CGGCGCCATC GAGACGACCC GGCTGCTCGA CCCGCCGGTG
GGGGTCACCG AGGTCGCCGA CGCGCTGGCC GGGACCGGGC TGGCGACCGG GCCCGCGCCG
TACGGTGCCA CCCGGCACGT GGGTGCGGTG TCCTTCCAGG TGCTGTGGCC ACCGGCGGAC
TCCCCGACCG TCGGCCCCGG CGACGGGAGC ACCGCCAACG AGGCCAGCGT GGTGCTGCTG
GTGGAGGTGC GCGGCGTGCG GATCCTGCTG GGCGGCGACA TCGAGCCCGA GGGCCAGGCC
GCGCTCGCAC GGGCGCTGCC GGGGCTACGG GTGGACGTGC TCAAGGTGCC CCACCACGGC
AGCCGCTACC AGGACGAGGA CTGGCTGCTG AGCCTCGGCG CGCGGGTGGC GCTCGTCTCG
GTCGGGGCCG ACAACGACTA CGGCCACCCC GCCGCGGAGA CCCTCGCGCC CCTCGAGGCG
GCCGGCGTAC GGGTGCTGCG CACCGACCGC GACGGCGACC TCGCGGTGCT CGCCGACCGC
GACGGCTCGG GCGCCGGGCT GTCGGTGGCC ACGCGTTAG
 
Protein sequence
MAEVQRRRPP DLRMLLLGGV AWGGALAARA LGWWLLPAAL GIGLVAWPLA HGRGRAAVRT 
TAAVVVVAAA VGAVAVLRLD QVAHSPVARL AQEGAAVRLV GTVTSDPRRT EGRFGEVVTV
RLDVREVTGR GATYALVAPV LVLADDDWAD LPLGATVTGS GRLSVAEGGD LAAVLGARGP
PEVVDDPDVW WDAAAAVRGS IRGAVAHRPD TQRALVPALV DGDDAAVDEA LAEDFRTTGL
THLLAVSGTN LTLVVGFVLV LARWCGVRGR LLYAVGALGI VGFVLLARTE PSVLRAAVMG
AVALFAMGPE GRERGSRALG VAVLALLLVR PDLAASAGFT LSVLATAGIL LLAPGWRDAL
ARWLPRWVAE AIAVPAAAQL ACTPVVAAIS GQVSLVAVAA NLAAAPVVGP ATVLGLGGGL
AGLVWEPAGR LLGTLAGWCV GWLVVVATRG ARLPSAALGW GTGAVALALL TLVVVAIALA
GPLLLRRRST GLGCCALLVA TVLVRLPTPG WPPDGWVLVA CDVGQGDALV LNAGPHAAVV
VDAGPDPTLV DACLDRLDID SVPLVVLTHF HADHVDGLPG VLDGRTVGAI ETTRLLDPPV
GVTEVADALA GTGLATGPAP YGATRHVGAV SFQVLWPPAD SPTVGPGDGS TANEASVVLL
VEVRGVRILL GGDIEPEGQA ALARALPGLR VDVLKVPHHG SRYQDEDWLL SLGARVALVS
VGADNDYGHP AAETLAPLEA AGVRVLRTDR DGDLAVLADR DGSGAGLSVA TR