Gene Noca_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1886 
Symbol 
ID4596387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2011314 
End bp2012216 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content78% 
IMG OID639776484 
Producthelix-hairpin-helix repeat-containing competence protein ComEA 
Protein accessionYP_923083 
Protein GI119716118 
COG category[L] Replication, recombination and repair 
COG ID[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region
[TIGR01259] comEA protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.528332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACCA CCCGACCCGT CCGTCCTCGC CCCGAGCGTC AGGAGGTCGT CGCCCGCCGG 
CTCGCGTTGC TGCACGCCGA GCTGGCGGGT GCCCGCCCGC CGGCCCCGCC GCCCGACCCG
CCCCCTGCGC CACTGGCCGA ACCACTGGCC GAACCCCTGG CCGAACCCCT GGCCGAACTC
CTGGCCGAGC CGCCCGCGGT GCCCGTCCCC GGACGGCACG CCTCCCGGCG ACCCGGACGT
TCGGTCGCCG TGCTGCCTGC GACCCTGCGC GGTCGGGTGG CCCTGGGGCC GGCGCAGCTG
GCGGTCGTCG CGCTGGTCGT CGCCCTCGGG CTCGGCGTGA CGAGCTGGTG GGTGGTCCGC
GGGGACGCCG ATCGGCTCGA GGCGCCCGCG CTCGAGCCGA CCGGCGCGGC CCTGGTCAGC
GAGGGGCCGC TCTCTGACGC CTCCCCGGTC GCCGCTGAGG CCACGGCTTC CCCTGCGACG
GTCACCGTGG ACGTGACCGG GAAGGTGCGC CGGCCCGGGA TCGTCGTGCT CGACACCGGC
GCCCGGGTCG TGGACGCCCT CGAGGCGGCC GGAGGCGCCC GTCGGGGCGT CGACCTCTCC
GGGCTGAACC TCGCCCGGGT CCTCGTCGAC GGCGAGCAGG TCGTGGTGGG GGAGCCGGCG
CCCACGCCGC TCGGGGCGGC CGCGGTGCCG ACCCCCGGGG CGCCAGGCGG GCCACTGGTC
GACCTCAACA CCGCCACCCA GGCCGAGCTC GAGGCGCTGC CGGAGGTCGG CCCCGTCACG
GCACAGGCGA TCCTCGCGTG GCGCGACGAG CACGGTGGCT TCACCTCGGT CGACGAGCTC
CTGGAGGTCG ACGGCATCGG CGACGCGACG CTCGGGCAGC TCGCCCCGTT CGTGACGGTC
TGA
 
Protein sequence
MPTTRPVRPR PERQEVVARR LALLHAELAG ARPPAPPPDP PPAPLAEPLA EPLAEPLAEL 
LAEPPAVPVP GRHASRRPGR SVAVLPATLR GRVALGPAQL AVVALVVALG LGVTSWWVVR
GDADRLEAPA LEPTGAALVS EGPLSDASPV AAEATASPAT VTVDVTGKVR RPGIVVLDTG
ARVVDALEAA GGARRGVDLS GLNLARVLVD GEQVVVGEPA PTPLGAAAVP TPGAPGGPLV
DLNTATQAEL EALPEVGPVT AQAILAWRDE HGGFTSVDEL LEVDGIGDAT LGQLAPFVTV