Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3646 |
Symbol | |
ID | 5735507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4585983 |
End bp | 4587464 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280795 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_001546410 |
Protein GI | 159900163 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACTTT GTGGTTTTAC TGCTGGCTGG CTGCTTGGGC TGTGGTTAAA TGATAGGCTA CAAATAGCAT GGTATCTATA TTTTATGGCA TCAATAGCGA TTATTTTACT CATTATCTAT ATCCGAAAAT CATGGCGGGT TATGCTGATA GCAATATATG CTGGTACTAT GCTAGGGGTA GTACGAATGG CGGTCAGTCA AAATTCCTCA AATCTTGACG ATATTCGTCA ACAGATTGGC ATGACAACTC GTTTAGAAGG AGTAATTGTC GGCCAGCCGC AATGGACTCC ACAGCAACAA CGGGTGGTTT TAGCCGTCCA TGCTTATCAA GATAATCAGC AACGAGTTGC CACAACTGGC AAGATTATGT TAACTCTTCC AGCCGAGCCA CCACGCAGTA ATGGCGAACG TTTGTTAGTC AGCGGCACAA TCATCACGCC AACCGCTAGC CCAAATTTCG ATTATGCCGA CTATCTACGT CGCCGCTCGA TTTATGCCAT GCTTGAGCCT GCAACCGTTG AACAAGCCTT GCCTGCAAAA AATTCAGTCT ATCAACGCTT AATCGCGCTC AAACAGCGCT CGCAAACAAT CATCAACCAA ACGTTGCCGC AACCGCAAGC AGCAGTTTTG GTCGGAATGT TGCTGGGAGT CAAAAGCAGC GTGCCTCAAA CTGTGTGGGA TACCTTCAAT CGCACTGGGC TTTCGCATAT CTTAATCATC TCAGGCTGGA ATATTACAAT TGTGGTGGCG GCGTTATTGG GTTTGGGCAA GGCCTTAAAG CTTAGTCAAC GCCACGCCAC CATGGTGGCA ATTGGCGCAA TTGTGGTGTA TGTAGCCTTT GTGGGAGCTA GCGGGGCTGT GATTCGCGCT GCCTTGATGG GCGCAATCGT GGCACTAGCC CAGCCGCTTG GTCGCAAATC CGATGCTTGG GCGGCACTCG CGGCAGCAAC TTGGCTCATG ACCCTGATCG ATCCGCACAC CTTATGGGAT TTAGGCTTTC AACTATCAGC TTTGGCCACG GCGAGCTTGT TTGCTTGGGG CAAGCCAATT GAAGCTCAAT TGCGGCAGTG GCTGCGTTGG CGCTGGCTCG AATGGATGAT CGAGCCATTG ACCGCAACAT TGGCCGCCCA AATTTGGACA CTGCCGATCA TTCTGTATCA TTTTGGTAAT CTCTCGTTGA TTGCACCCGT CGCCAATGTA CTGATTGTGC CAGTTGTGCC GTTGATTATG GCCAGCGGCG CAATGCTAGC ATGCTTGGGG TTGTTTGGCC GTTGGTTAGC ATTGCTGGCC TTGCCAATCA CATGGGCGGC ATTAACTTGG GTCGTTGAGG CCGCCGAATG GCTGGCCGAT TTATCTTGGG CAGCGGTCGA AATTCCTAGG TTTGGCATGA GCTGGCTGGT GCTGGCGTAT GGCTTGAGCG TTGGGGCGAA GGCGTGGATG GTTAACCACG AAGAACGCGA AGGACGCGAA GTAAGAATTT AA
|
Protein sequence | MRLCGFTAGW LLGLWLNDRL QIAWYLYFMA SIAIILLIIY IRKSWRVMLI AIYAGTMLGV VRMAVSQNSS NLDDIRQQIG MTTRLEGVIV GQPQWTPQQQ RVVLAVHAYQ DNQQRVATTG KIMLTLPAEP PRSNGERLLV SGTIITPTAS PNFDYADYLR RRSIYAMLEP ATVEQALPAK NSVYQRLIAL KQRSQTIINQ TLPQPQAAVL VGMLLGVKSS VPQTVWDTFN RTGLSHILII SGWNITIVVA ALLGLGKALK LSQRHATMVA IGAIVVYVAF VGASGAVIRA ALMGAIVALA QPLGRKSDAW AALAAATWLM TLIDPHTLWD LGFQLSALAT ASLFAWGKPI EAQLRQWLRW RWLEWMIEPL TATLAAQIWT LPIILYHFGN LSLIAPVANV LIVPVVPLIM ASGAMLACLG LFGRWLALLA LPITWAALTW VVEAAEWLAD LSWAAVEIPR FGMSWLVLAY GLSVGAKAWM VNHEEREGRE VRI
|
| |