Gene Namu_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1518 
Symbol 
ID8447116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1674391 
End bp1675524 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content74% 
IMG OID645040648 
Productcompetence protein ComEA helix-hairpin-helix repeat protein 
Protein accessionYP_003200905 
Protein GI258651749 
COG category[L] Replication, recombination and repair 
COG ID[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region
[TIGR01259] comEA protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.663544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.893381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCGG CGGCCTTTCC CGAGCCGACG CCGGCCGTTC CGCCCGCGGC CAAGTCTGAG 
TCGGCGCCGG CCGCGCCCGA GCCGAAGCCG CCCGAGCCGG CGGCGGTGCG GCCGGTCCCG
GATGCCGGGG CGATCGGCGG GTGGGTGCCG GAAGTCGCCG GCGCGGATGT CGAGGCTCCG
TCCGATCCGG GCGGCGGTCA GGGCGAATTC GACCCCTTGG CCCAGTTCGA CGACTTCGAC
GACTTCGACG ATTTTGACGA GTCGGACGAG TTTGGCGAGG TCCACGAGTC CGACGCAGTC
GACGAGGAAG AACTCGACCC GCCGGTGGAC GAGCGGGCCG GCGAGATCCC GGCCGAGCCT
GAGCCGACCG CCGAGCCGAT GCGGCCGCGG GCGCATCGGA TCAATGGGCG GTGGGGGCGG
TTCGCCGAGT TGTGGGTGCC TGAGCCGTTG CGCAATTCGC GGGTCGACCC GGGTCGGCGG
GGGATGATCG TGCTCGTCCT GGTGGCGGCG GTCGCCGCGG TGGTGGCCGC GGTCGGGGTG
TGGCGGGATC GTCCGGAGCC GCGTCCGGTG GAGACGTCGA TGGTCGCCGC GGCCGGGCAG
CTGACCGTAT CGTCCGGCGC CGACGCATCC GGGGCCACCG GCAGTGCTTC GGCGAGCCCG
ACGCCGGCAC CCAGCGAGAT CCTGGTGTCG GTGACCGGTC TGGTGGCCAA CCCGGGGGTG
GTGCGGTTGC CGCCGGATGC GCGGGTGGCC GATGCGATCG CCGCGGCCGG GGGGACCGGA
CCGGGGGCCA ACCTGACCGG GATGAACCTG GCCGCACGGC TGACGGACGG AGACTCGGTG
GTGGTCACCG ACACCGGCGT CGCGGCCGAG TCGGCATCGG GAGGGGCCGC TGCGGCAGCT
GCGGGGTCGG GCGGCGGATC CGGGGGCCCG GCGGCCGGGG GACTGGTGAA CCTGAACACG
GCGGACGAGG CCGCCCTGGA CACGTTGCCC GGCGTCGGGC CGGTGATGGC GCAGAACATC
ATCGCCTGGC GCAGCGAGCA CGGGAAGTTC AGCAGCGTCG AGCAACTGCA GGAGATCAGC
GGCATCGGGC CGTCCCGCTA CGCGCAGATC TCGGCCCTGG TCACGGTGGG TTGA
 
Protein sequence
MPAAAFPEPT PAVPPAAKSE SAPAAPEPKP PEPAAVRPVP DAGAIGGWVP EVAGADVEAP 
SDPGGGQGEF DPLAQFDDFD DFDDFDESDE FGEVHESDAV DEEELDPPVD ERAGEIPAEP
EPTAEPMRPR AHRINGRWGR FAELWVPEPL RNSRVDPGRR GMIVLVLVAA VAAVVAAVGV
WRDRPEPRPV ETSMVAAAGQ LTVSSGADAS GATGSASASP TPAPSEILVS VTGLVANPGV
VRLPPDARVA DAIAAAGGTG PGANLTGMNL AARLTDGDSV VVTDTGVAAE SASGGAAAAA
AGSGGGSGGP AAGGLVNLNT ADEAALDTLP GVGPVMAQNI IAWRSEHGKF SSVEQLQEIS
GIGPSRYAQI SALVTVG