Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1902 |
Symbol | |
ID | 8544284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2611667 |
End bp | 2614309 |
Gene Length | 2643 bp |
Protein Length | 880 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646386607 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_003266342 |
Protein GI | 262195133 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCGTG TGCGGCTCCC CTACCTGGCC CTGGCCTTCG CTCTCGGTGC GGCCTCGCTG TGGGCCGCGA GCGCGTTCAT CCCCTGGGGC CACGATGCGC TGCTGTCGCT GCTGTGGACG GTGTTGGCGG CGCTCGCCTG GCTCGCGCTG GTCGTCGGTC TGCCGCCACG CGCGCGCGTA TTGCTGCTGT TCGCGCTCGC CGGAGCCGGA CGCATGGCCG GCGAGCGTCT CGCCGTGGAA CGCCCGCCCA CCCTGCCCGC GGACGCGCGC GCCGGCGACC GCCAGGCCGA GACGCTGCGC GGCGTCATCG TCGGGCCCAT CACCGCGCGG CACCGCTACC GCGCCTTCGT GCTGGCGCTC GAGGGCGGCG TGCGGGTGTG GGTGAGCGCG CCAGCGGCCG AGCCAGCGCT ACTGCCCGGC GACCGCGTGC GGGTGCGCGG ACGGCTGCGC GTGGCCCGCG GCTATCGCGT GCCGGGTGCC CGCGACCTGC GCCGCGACAT GAGCGCGCGC GGCGCCCATT ACGCGCTTTC AGCTCACGAT GCGACGCAGA TCGCCGTCCT GGGCGCGCGC GGCACGCCCT GGCGGGTGGC CGCCGAGGCG CAGCGCTGGG CGGTCGCCAC CATCGCGCGC GGCCGCTCCG GGCGCGACGC AGACGCAGAC GCAGACGCAG ACGCAGACGC AGACGCAGAC GCAGTTCGTC ACGCGACGGA GCGCGCGGCG GCGGTGGCTG CGGCCCAGGC CATGCTGACG GGCTGGCGCG CGGGCCTTAC CCCTGCAGCC AGCGAGCGCT TTCGCGCCGC CGGGGTAGCG CATGTGCTGG CCGTGAGCGG TCTGCATCTG GCCGTGCTGG CCTGGACCGT GTTCGCGCTG GTTCGCCGCT TGTGGTCGGC CCTGCCCGCG CTCGCCGGAC GCCTCGAAGC CACCCGCGTG GCCGCGCTGC TGGCGGCCCT GAGCGGCGTT GCCTTCACCG GCCTCACCGG CGCCCAGGTC GCGACCACGC GCGCCTTGCT GGTGGTTCTG GTGATCCTGT GCGGGATGGC GGTGTATCGC CGCGCGCGCG TCATCGACGC CCTCGGCGCC AGCGCGCTGC TGCTGCTGCT CGAGCAGCCG CTGCTGGTGT TCGACCCGGC GTTCCAGCTC TCGTTCGCGG CCACGGCGAC GCTAGCGCTG GCGCTCGGCC GGCGCAGCGA TTCCGACCCC GCGGACGAAC CGGGACCAGA GGTCGAAATC CGGCGCAACC GGCACCTGTC CCGCCTGTGG CGTTGGCTCG GAGGTCTGTG GAGCGCCTCC GCGTGGGCCG CGGCAGCGAC TGCGCCCATC GCGGCCCTGG CCTTTGGCGC CGTGGCCACC GGCGGGCTGC TCGCCAATCT CGTGGCCGTA CCGCTGGTCG AACTCGCCGT GGTCCCGGTG GGCATGCTCG GGCTACTGCT CGCGGCCGTG AACCAACCCC TCGGCCAGCT CGTCCTCGAT CTCGCCGTGG GCGCCAGCGG CTGGGTCGTC CGCGTGGCCG AGCTAGCCGC CGCTCACGCA CCCGTCGTGC ACACGCCGCC GCCGAGCGCG CTCGAGCTGC TCGCCTGCGC CGCGCTGTGG GCCGGCGCCA TCGCCTGGCG GCGCAGGCGT TGGCCGCGGC GCACGGCCAT CGCCGTGCTC GCCGCGGGCG CGCTGCTGCT GGCGCTGGCG CTGGCGCTGG CGGCGTGGGG GCCGCTGGCG CGCAGCGGCC TGCGGGTGAC CTTTCTCGAC GTCGGCCAGG GCGACGCCGC CGTGCTCGAA TTGCCGGGCG GCGCGGTTTG GCTGGTGGAT GCCGGCGGCC TGCCCTTCGT CATCCCGCGC GAGCACGCGC GCGCGAGTCG CAACCGCCGC GGGGCTCACG GGGACCGGGA AGACGGGGAG CGCGAGCACG CGGCCGCGTT GCCCGGCCGT CGAGCGGTGC TGCCGTTTCT CGCCGAGCGG CGCATCGAGC GGCTCGAGCT GGTCGTGCTC AGCCACCCGC ATCCCGACCA CTACGAGGGC CTGCGCGCGC TCGCCCGAGC GGTCGCGATC GATGCCGTAT GGGTAGCCCG GCCCGATGCC GAGCAGCCGC ACGCGGGCGG CTACGGCGGA CTGCTCGACG AGCTGCGCGC GCGCGGTACG CGCATCGAGC ATCCGATTCT CGACCGACCG TACGAACACC GCGGCGTCGA ACTCACCGCG CTGGCGCCCT ACTACCTCGA TGCCCGCGCA GCGGTCGACG GCGTCATGGG CGTCAACGAC AACTCCCTGG TGGTGCGCGT CGGCTTCGCT GGACGCGCGC TGCTCTTCGC CGGCGATCTC GAATGGGAGG GCGAGCGAGA GATCGTCAAC CGGCGGGGTT CGGCGCTGCG CGCCGACATC GTCAAGGTGC CACACCACGG CAGCGACACC TCATCGACGC AGAGCTTCAT CGACGCCACC GCGCCGAGCT GGGCCATCAT CTCGTGCGGC GCGGCCAACC GCTTCGGCTT TCCCGCGGCC TCGGTCGTGT ACCGCTGGTG GCGCAGCGGC GCGCGCGTCC TGCGCACCGA CCGCGCCGGC GCCATCACGG TGAACATAGA CAGCGACGGC GTCATGCGGG TCGAGACATT CGATCCCGTC ACCGTGCTTC TGCCGCCGGG GGAGCGCCCT TGA
|
Protein sequence | MVRVRLPYLA LAFALGAASL WAASAFIPWG HDALLSLLWT VLAALAWLAL VVGLPPRARV LLLFALAGAG RMAGERLAVE RPPTLPADAR AGDRQAETLR GVIVGPITAR HRYRAFVLAL EGGVRVWVSA PAAEPALLPG DRVRVRGRLR VARGYRVPGA RDLRRDMSAR GAHYALSAHD ATQIAVLGAR GTPWRVAAEA QRWAVATIAR GRSGRDADAD ADADADADAD AVRHATERAA AVAAAQAMLT GWRAGLTPAA SERFRAAGVA HVLAVSGLHL AVLAWTVFAL VRRLWSALPA LAGRLEATRV AALLAALSGV AFTGLTGAQV ATTRALLVVL VILCGMAVYR RARVIDALGA SALLLLLEQP LLVFDPAFQL SFAATATLAL ALGRRSDSDP ADEPGPEVEI RRNRHLSRLW RWLGGLWSAS AWAAAATAPI AALAFGAVAT GGLLANLVAV PLVELAVVPV GMLGLLLAAV NQPLGQLVLD LAVGASGWVV RVAELAAAHA PVVHTPPPSA LELLACAALW AGAIAWRRRR WPRRTAIAVL AAGALLLALA LALAAWGPLA RSGLRVTFLD VGQGDAAVLE LPGGAVWLVD AGGLPFVIPR EHARASRNRR GAHGDREDGE REHAAALPGR RAVLPFLAER RIERLELVVL SHPHPDHYEG LRALARAVAI DAVWVARPDA EQPHAGGYGG LLDELRARGT RIEHPILDRP YEHRGVELTA LAPYYLDARA AVDGVMGVND NSLVVRVGFA GRALLFAGDL EWEGEREIVN RRGSALRADI VKVPHHGSDT SSTQSFIDAT APSWAIISCG AANRFGFPAA SVVYRWWRSG ARVLRTDRAG AITVNIDSDG VMRVETFDPV TVLLPPGERP
|
| |