Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1249 |
Symbol | |
ID | 4710485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1355204 |
End bp | 1357336 |
Gene Length | 2133 bp |
Protein Length | 710 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855722 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_001002826 |
Protein GI | 121998039 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.435974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGCAG GATCGCTTGC CTTCGCGGCC GGAATCGTAC TGGTCGTGAG CGTGTTACGG GAGCCCCCCG GGGCCTTCTC GGTGGCAGCA GTGGCCGCCT CCGGGGTCGT GTGGGGTCTG CTCGCCCCGC GCGGCTGGCG CTGGCCGGCT GCCGGGCTCC TAGGCTGCGC CTGGGCGCTG CTCAGTGCCT GCTGGCTTTT GGCCCATGAA CTTGCCCCTA GCCATGACGG CACCGACGCC TGCCTGGAAG GGCAGGTCGT TTCCGTCCCT GAGATTGAGA GTCACCGTGC TCGCTTCGAG TTCCGGCCTG ACGGCGTGCT CGATGGCGAG CTGCTGGAGC CATTGCCGCG ACGCATCCGC GTGGATTGGT ATGGGGCGAC GGAGGAGCCA GCGCCGGGGG AGCGCTGGCA GCTGTGCCTT CGCCTGCGCG CCCCCGATGG ATTCCTCAAT CCGGAGGGCT TCGATTACCA GCGGTGGCTG TTCCAACGCC GGATCGGCGC GACGGCCTAT GTGCGGGAAG CGGAGCAGGC CGAACGCCTG TCCGGCGGCT GGCGAATCGA TAGCGTGCGG ACTCGGATCG CCGACGCCAT GGCCGAGCGG CTCGGCGATT CGCGGTACCT GGGCTTGGTC CAAGGGCTGG GCGTGGCCGT GCGCGATAGG ATCGGCGATG ACCAGTGGGA GGTGCTTTCG GTCACCGGGA CGGCCCACCT GCTAGCGATC TCGGGGCTGC ATATCGGTCT GGTTGCCGGC CTCGCCGGCA CGCTGGCCGG CGGACTCTGG CGTGTGGTTC CGTCGCTGGT TCGCCGGGTT CCCGCCCTGA TTGCCGGCAC CCTGGCCGGG GCGCTGGCTG CAGCCGGTTA TGCCGCCCTG GCCGGCTTCA CCCTACCCAC TCAGCGCGCG CTGCTCATGG TGCTGGTCTT CGCCGGCGCG CTCCTGCTGC GCCGCCCGTT AAGCCCCTGG CACAGCCTCG CCGTGGCGGC TGCCGCGGTG CTCGCCCTCG ACCCCTGGGC GCCGCTGGGG GCGGGGTTCT GGCTCTCCTT CGGGGCGGTG GCGATCATCC TGGCAGCTAC AACGGGTCGC CCGATGCAGC GCGGTCCCCT ACCATGGCTG CGGATCCAGG TGCTGGTGGG GGTCGGCATG CTACCAGCGA CTGCGGTCTG GTTCGGGCAC CTGCCGCTTC TCTCGCCGCT GGCCAATCTC GTAGCCGTGC CCTGGGTCAG CGTATCGGTT GTCCCCCTGA TCCTGACCGG AGTCGCGCTC CAGCCCGTGG TGCCAGAGTG GTCCGGCGGG TTGTGGCAGG GTGCGGATGC CGCCCTCGGG GTGCTGATGC GTGTGCTCAG CGCGCTGGCC GAGTTCCTCG GTGCCGTTGA GGTGCCCACC CCATCGGTCG GAGCGGTGGC GCTGGCCGCT GTCGGTGTGG CTCTGGTGCT GGCTCCTCGG GCCGTGCCCG GGCGGTGGCT GGCTGGGCCG TTATTGGCGT TGTTGCTCCT GGGGCCCGGC AGTAGCGGAG GCGGTCCCTC CGGCCGCGTG GTGGTCTTCG ATACCGGCGG AGGTCTGACC GCTCTGGCCT GCGATGGCCG TCAAGCGGTG GTCTACGGCG GGGGTCCCGG AGGGGGGCTC GACGCGGCGA GCGTTGCTGT CGAGCCGTTT CTCGAAGCCC GGGGGCTGAC CCTCAAGGCC TGGCTGGTGC CGCGCGAGCA GGCCCCCTGG GATGGCGCCG TCGCCGAGGC GCGGCAGCGT TGGGACGATG CCGAGTGGCG GGGGGCTGGA CAGGATACGA CGGCCGAGGC GACGGAGTGG TCCCTGGGGC GGCTGACGCT GCAGGAGACG CCCCTCGGCG ACGAAGAGTG GGGCCTGGAT GTCGAGGGAG AGGCGGGTAC CGTATCGCTG CGACCGGACC TGGAGAACGA CACGAAGGGG TTGGTTGTCG GCGCCGCCGG ATGTGACGAG GACCGCCAGG CGTGCAGTGC GGCGGTTGAG TTGAAGGACG GTCGTTTGTT GCGTTCGGAC GAGCAGGGCG CGCTGACTTT GATCGAAGAC GGCGATGCGT GGCGGGTAGA CAGTGAGCTG CAGCGCAGGG GGCGGGTTTA CCATCAGGCC AGTCCGGCTG CGTTGGCGCC CGGGATGCTT TAA
|
Protein sequence | MRAGSLAFAA GIVLVVSVLR EPPGAFSVAA VAASGVVWGL LAPRGWRWPA AGLLGCAWAL LSACWLLAHE LAPSHDGTDA CLEGQVVSVP EIESHRARFE FRPDGVLDGE LLEPLPRRIR VDWYGATEEP APGERWQLCL RLRAPDGFLN PEGFDYQRWL FQRRIGATAY VREAEQAERL SGGWRIDSVR TRIADAMAER LGDSRYLGLV QGLGVAVRDR IGDDQWEVLS VTGTAHLLAI SGLHIGLVAG LAGTLAGGLW RVVPSLVRRV PALIAGTLAG ALAAAGYAAL AGFTLPTQRA LLMVLVFAGA LLLRRPLSPW HSLAVAAAAV LALDPWAPLG AGFWLSFGAV AIILAATTGR PMQRGPLPWL RIQVLVGVGM LPATAVWFGH LPLLSPLANL VAVPWVSVSV VPLILTGVAL QPVVPEWSGG LWQGADAALG VLMRVLSALA EFLGAVEVPT PSVGAVALAA VGVALVLAPR AVPGRWLAGP LLALLLLGPG SSGGGPSGRV VVFDTGGGLT ALACDGRQAV VYGGGPGGGL DAASVAVEPF LEARGLTLKA WLVPREQAPW DGAVAEARQR WDDAEWRGAG QDTTAEATEW SLGRLTLQET PLGDEEWGLD VEGEAGTVSL RPDLENDTKG LVVGAAGCDE DRQACSAAVE LKDGRLLRSD EQGALTLIED GDAWRVDSEL QRRGRVYHQA SPAALAPGML
|
| |