Gene Hoch_1902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1902 
Symbol 
ID8544284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2611667 
End bp2614309 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content75% 
IMG OID646386607 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_003266342 
Protein GI262195133 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGTG TGCGGCTCCC CTACCTGGCC CTGGCCTTCG CTCTCGGTGC GGCCTCGCTG 
TGGGCCGCGA GCGCGTTCAT CCCCTGGGGC CACGATGCGC TGCTGTCGCT GCTGTGGACG
GTGTTGGCGG CGCTCGCCTG GCTCGCGCTG GTCGTCGGTC TGCCGCCACG CGCGCGCGTA
TTGCTGCTGT TCGCGCTCGC CGGAGCCGGA CGCATGGCCG GCGAGCGTCT CGCCGTGGAA
CGCCCGCCCA CCCTGCCCGC GGACGCGCGC GCCGGCGACC GCCAGGCCGA GACGCTGCGC
GGCGTCATCG TCGGGCCCAT CACCGCGCGG CACCGCTACC GCGCCTTCGT GCTGGCGCTC
GAGGGCGGCG TGCGGGTGTG GGTGAGCGCG CCAGCGGCCG AGCCAGCGCT ACTGCCCGGC
GACCGCGTGC GGGTGCGCGG ACGGCTGCGC GTGGCCCGCG GCTATCGCGT GCCGGGTGCC
CGCGACCTGC GCCGCGACAT GAGCGCGCGC GGCGCCCATT ACGCGCTTTC AGCTCACGAT
GCGACGCAGA TCGCCGTCCT GGGCGCGCGC GGCACGCCCT GGCGGGTGGC CGCCGAGGCG
CAGCGCTGGG CGGTCGCCAC CATCGCGCGC GGCCGCTCCG GGCGCGACGC AGACGCAGAC
GCAGACGCAG ACGCAGACGC AGACGCAGAC GCAGTTCGTC ACGCGACGGA GCGCGCGGCG
GCGGTGGCTG CGGCCCAGGC CATGCTGACG GGCTGGCGCG CGGGCCTTAC CCCTGCAGCC
AGCGAGCGCT TTCGCGCCGC CGGGGTAGCG CATGTGCTGG CCGTGAGCGG TCTGCATCTG
GCCGTGCTGG CCTGGACCGT GTTCGCGCTG GTTCGCCGCT TGTGGTCGGC CCTGCCCGCG
CTCGCCGGAC GCCTCGAAGC CACCCGCGTG GCCGCGCTGC TGGCGGCCCT GAGCGGCGTT
GCCTTCACCG GCCTCACCGG CGCCCAGGTC GCGACCACGC GCGCCTTGCT GGTGGTTCTG
GTGATCCTGT GCGGGATGGC GGTGTATCGC CGCGCGCGCG TCATCGACGC CCTCGGCGCC
AGCGCGCTGC TGCTGCTGCT CGAGCAGCCG CTGCTGGTGT TCGACCCGGC GTTCCAGCTC
TCGTTCGCGG CCACGGCGAC GCTAGCGCTG GCGCTCGGCC GGCGCAGCGA TTCCGACCCC
GCGGACGAAC CGGGACCAGA GGTCGAAATC CGGCGCAACC GGCACCTGTC CCGCCTGTGG
CGTTGGCTCG GAGGTCTGTG GAGCGCCTCC GCGTGGGCCG CGGCAGCGAC TGCGCCCATC
GCGGCCCTGG CCTTTGGCGC CGTGGCCACC GGCGGGCTGC TCGCCAATCT CGTGGCCGTA
CCGCTGGTCG AACTCGCCGT GGTCCCGGTG GGCATGCTCG GGCTACTGCT CGCGGCCGTG
AACCAACCCC TCGGCCAGCT CGTCCTCGAT CTCGCCGTGG GCGCCAGCGG CTGGGTCGTC
CGCGTGGCCG AGCTAGCCGC CGCTCACGCA CCCGTCGTGC ACACGCCGCC GCCGAGCGCG
CTCGAGCTGC TCGCCTGCGC CGCGCTGTGG GCCGGCGCCA TCGCCTGGCG GCGCAGGCGT
TGGCCGCGGC GCACGGCCAT CGCCGTGCTC GCCGCGGGCG CGCTGCTGCT GGCGCTGGCG
CTGGCGCTGG CGGCGTGGGG GCCGCTGGCG CGCAGCGGCC TGCGGGTGAC CTTTCTCGAC
GTCGGCCAGG GCGACGCCGC CGTGCTCGAA TTGCCGGGCG GCGCGGTTTG GCTGGTGGAT
GCCGGCGGCC TGCCCTTCGT CATCCCGCGC GAGCACGCGC GCGCGAGTCG CAACCGCCGC
GGGGCTCACG GGGACCGGGA AGACGGGGAG CGCGAGCACG CGGCCGCGTT GCCCGGCCGT
CGAGCGGTGC TGCCGTTTCT CGCCGAGCGG CGCATCGAGC GGCTCGAGCT GGTCGTGCTC
AGCCACCCGC ATCCCGACCA CTACGAGGGC CTGCGCGCGC TCGCCCGAGC GGTCGCGATC
GATGCCGTAT GGGTAGCCCG GCCCGATGCC GAGCAGCCGC ACGCGGGCGG CTACGGCGGA
CTGCTCGACG AGCTGCGCGC GCGCGGTACG CGCATCGAGC ATCCGATTCT CGACCGACCG
TACGAACACC GCGGCGTCGA ACTCACCGCG CTGGCGCCCT ACTACCTCGA TGCCCGCGCA
GCGGTCGACG GCGTCATGGG CGTCAACGAC AACTCCCTGG TGGTGCGCGT CGGCTTCGCT
GGACGCGCGC TGCTCTTCGC CGGCGATCTC GAATGGGAGG GCGAGCGAGA GATCGTCAAC
CGGCGGGGTT CGGCGCTGCG CGCCGACATC GTCAAGGTGC CACACCACGG CAGCGACACC
TCATCGACGC AGAGCTTCAT CGACGCCACC GCGCCGAGCT GGGCCATCAT CTCGTGCGGC
GCGGCCAACC GCTTCGGCTT TCCCGCGGCC TCGGTCGTGT ACCGCTGGTG GCGCAGCGGC
GCGCGCGTCC TGCGCACCGA CCGCGCCGGC GCCATCACGG TGAACATAGA CAGCGACGGC
GTCATGCGGG TCGAGACATT CGATCCCGTC ACCGTGCTTC TGCCGCCGGG GGAGCGCCCT
TGA
 
Protein sequence
MVRVRLPYLA LAFALGAASL WAASAFIPWG HDALLSLLWT VLAALAWLAL VVGLPPRARV 
LLLFALAGAG RMAGERLAVE RPPTLPADAR AGDRQAETLR GVIVGPITAR HRYRAFVLAL
EGGVRVWVSA PAAEPALLPG DRVRVRGRLR VARGYRVPGA RDLRRDMSAR GAHYALSAHD
ATQIAVLGAR GTPWRVAAEA QRWAVATIAR GRSGRDADAD ADADADADAD AVRHATERAA
AVAAAQAMLT GWRAGLTPAA SERFRAAGVA HVLAVSGLHL AVLAWTVFAL VRRLWSALPA
LAGRLEATRV AALLAALSGV AFTGLTGAQV ATTRALLVVL VILCGMAVYR RARVIDALGA
SALLLLLEQP LLVFDPAFQL SFAATATLAL ALGRRSDSDP ADEPGPEVEI RRNRHLSRLW
RWLGGLWSAS AWAAAATAPI AALAFGAVAT GGLLANLVAV PLVELAVVPV GMLGLLLAAV
NQPLGQLVLD LAVGASGWVV RVAELAAAHA PVVHTPPPSA LELLACAALW AGAIAWRRRR
WPRRTAIAVL AAGALLLALA LALAAWGPLA RSGLRVTFLD VGQGDAAVLE LPGGAVWLVD
AGGLPFVIPR EHARASRNRR GAHGDREDGE REHAAALPGR RAVLPFLAER RIERLELVVL
SHPHPDHYEG LRALARAVAI DAVWVARPDA EQPHAGGYGG LLDELRARGT RIEHPILDRP
YEHRGVELTA LAPYYLDARA AVDGVMGVND NSLVVRVGFA GRALLFAGDL EWEGEREIVN
RRGSALRADI VKVPHHGSDT SSTQSFIDAT APSWAIISCG AANRFGFPAA SVVYRWWRSG
ARVLRTDRAG AITVNIDSDG VMRVETFDPV TVLLPPGERP