Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1886 |
Symbol | |
ID | 4596387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 2011314 |
End bp | 2012216 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 639776484 |
Product | helix-hairpin-helix repeat-containing competence protein ComEA |
Protein accession | YP_923083 |
Protein GI | 119716118 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1555] DNA uptake protein and related DNA-binding proteins |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region [TIGR01259] comEA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.528332 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACCA CCCGACCCGT CCGTCCTCGC CCCGAGCGTC AGGAGGTCGT CGCCCGCCGG CTCGCGTTGC TGCACGCCGA GCTGGCGGGT GCCCGCCCGC CGGCCCCGCC GCCCGACCCG CCCCCTGCGC CACTGGCCGA ACCACTGGCC GAACCCCTGG CCGAACCCCT GGCCGAACTC CTGGCCGAGC CGCCCGCGGT GCCCGTCCCC GGACGGCACG CCTCCCGGCG ACCCGGACGT TCGGTCGCCG TGCTGCCTGC GACCCTGCGC GGTCGGGTGG CCCTGGGGCC GGCGCAGCTG GCGGTCGTCG CGCTGGTCGT CGCCCTCGGG CTCGGCGTGA CGAGCTGGTG GGTGGTCCGC GGGGACGCCG ATCGGCTCGA GGCGCCCGCG CTCGAGCCGA CCGGCGCGGC CCTGGTCAGC GAGGGGCCGC TCTCTGACGC CTCCCCGGTC GCCGCTGAGG CCACGGCTTC CCCTGCGACG GTCACCGTGG ACGTGACCGG GAAGGTGCGC CGGCCCGGGA TCGTCGTGCT CGACACCGGC GCCCGGGTCG TGGACGCCCT CGAGGCGGCC GGAGGCGCCC GTCGGGGCGT CGACCTCTCC GGGCTGAACC TCGCCCGGGT CCTCGTCGAC GGCGAGCAGG TCGTGGTGGG GGAGCCGGCG CCCACGCCGC TCGGGGCGGC CGCGGTGCCG ACCCCCGGGG CGCCAGGCGG GCCACTGGTC GACCTCAACA CCGCCACCCA GGCCGAGCTC GAGGCGCTGC CGGAGGTCGG CCCCGTCACG GCACAGGCGA TCCTCGCGTG GCGCGACGAG CACGGTGGCT TCACCTCGGT CGACGAGCTC CTGGAGGTCG ACGGCATCGG CGACGCGACG CTCGGGCAGC TCGCCCCGTT CGTGACGGTC TGA
|
Protein sequence | MPTTRPVRPR PERQEVVARR LALLHAELAG ARPPAPPPDP PPAPLAEPLA EPLAEPLAEL LAEPPAVPVP GRHASRRPGR SVAVLPATLR GRVALGPAQL AVVALVVALG LGVTSWWVVR GDADRLEAPA LEPTGAALVS EGPLSDASPV AAEATASPAT VTVDVTGKVR RPGIVVLDTG ARVVDALEAA GGARRGVDLS GLNLARVLVD GEQVVVGEPA PTPLGAAAVP TPGAPGGPLV DLNTATQAEL EALPEVGPVT AQAILAWRDE HGGFTSVDEL LEVDGIGDAT LGQLAPFVTV
|
| |