Gene Hoch_5122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5122 
Symbol 
ID8547533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7059937 
End bp7062468 
Gene Length2532 bp 
Protein Length843 aa 
Translation table11 
GC content73% 
IMG OID646389798 
Producttype IV pilus assembly PilZ 
Protein accessionYP_003269503 
Protein GI262198294 
COG category 
COG ID 
TIGRFAM ID[TIGR02266] Myxococcus xanthus paralogous domain TIGR02266 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0427193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCACCA ACGACAATCG CAAGTCCCAG CGCAATCCGG TCACGCTCCG CATCAAGTTC 
AAGAGCGCCA ACCTCGAGCA GTTCATCGAA CGCTACGCCG TGGATGTCAG CCAGGGCGGC
ATCTTCATTC GCACCAAAGA CCCGCTGCCG GTCGGTACGA CGATGCGCTT CGAGTTTCAG
CTCCGCGACG AGTCCCCGCT CATCACCGGT GAAGGCACCG TGGTGTGGAT CCGCGAGCAC
AAAAAGCAGA GCTCCGTGGC GCCGGGCATG GGCGTGCGCT TCGACCGCCT CACCGAGGCC
AGCCAGCGGG TGCTCGAGCA GATCTTGGCC AACAAGAACG GGCGCGCGGG GAGCGCCGCC
GAGCAGCTCG GCGAGGTGCC GACGCGCGTG GTGCCGATCG CCGACATCGC CGAGCAGGCG
CGCGCCAGCA GCGGCGATCG CGCCGGCGTA TTCGGCGCCG AGGGCGTGCC CGACGAGGCC
TTCGAGCAGG CCACCAAGGT CCACAGCCTC GAGGAGCTGG CGTCGATGAG CGCGCGCGCC
GATGAGGACA GCATCGACGA CGCGCCGCTC TCGGCCGACT TCGAAGAGGA CGATCAGACC
AAGGTCTACC CCGCGCTCAC CCAGCCATCG GGCAATCCCA GCGATGTCGC GCTCGGACAC
TCGGGTGTGG GCGCGCCGCT GCCGCCGACC CCGAGCGGCA GCTCGCAGGA TGTCGAGCTC
GGCCGCGCCG GCTACGACGA GCCGCTCGCG CCCGACGAGG CCTTCGACGC CCGCGACGTC
GAGCTGGGCC TGGCCGGACC CACCGAACCC CTGACCCAGG AACGACTGGT CGCCAGCGAC
GAGGTCGAAC TCGGACGCAT CGTTAGCCCC CGCGGCAACC GCCGCGCTCC CGCCGCCCGG
CCGGCGAGCG GCGAACGCCC GGCCATCGAC CTCGGCCCCG GCGGCTCGGA AGATCCCTTC
TCCGATGTGC CGCTGCCCGA GGCCGGCGAC TTCGACCTGG GCCTGGCCGG TCCCACCGAT
CCTCTGCCCG TCGAGCGCGG CAACCGCGCA TCGGAGCTCG AGCTGGGTGC CGCCGGCGGC
AACGATGACG AACAGGAGCG GCCCACCCAG ACCGGTGTGG CCCGGCGCAG CGGACACGTC
GCTCAGGCCG GGGCCGACGT CGACGCCGCC GAGTACAGCG ACGAGCGCGC GGACACCCGC
GAGCGCGCGG CCGCGACGCC GCCGGCCGAC GAGGCCAAGG CAGCCACCGC CGCCGCCGCG
GTCGCGGAGG CCGCAGCGGC CGCAAGCGAC GAGGCCAAGG CCGCCCCGGC CGTGTCCGAT
GCCCCGGTCG CGACCGCGCC GGCCGCCGCG GACGAGCCCC CGGCGACGAG CGACGACGAG
CCCGAGGAGC GCGACTCCCG CGTGCGCGTG GTCGCCCAGG ACAGCACCGA CGCCGTGGCT
CAGCGCAAAT CGTCGCGCGC GCCGCTGATC GCCATCTTGA TCATCGTCCT GCTCGCCGCC
GCAGCGGGCG CCTTCTTCTT CCTGAGCCGC GACAACGCGC CCGCGGGCGC CGCGCCCGAC
GCCGGCGCCG CGATCGCGGA CGACGACGTG GCCGGCGACC AGCTCATGGG TGACGAGGCC
GGCGACGAAA CCGGCGACGA GGCCGCGACC GGCAATGACG AGATCGACGA CGACGAGATC
GTGTTCGAGG AAGACGAGGC GGTCGCCGAC GACAGCGAAG AAGGCGCCGA AGACGCAGCC
GAGGACGGCG ACACCGCCGA GGGCATGGCC ACCGGCGGCA TCACCACCGA CGAGTACACC
CAGCCCGGAG GCCCGACCAG CAAAGTGAGC GTGCGCTCGC AGCCGAGCGG TGCCACCGTC
GAGCTCATCG GCGGCGCGGA GAAAGGCACG ACGCCGTTCA GCATCGAGCT GGCCGCCAAC
GCCAAGCACC GCCTGCGCGT GTCGCATCCC GGCTACCTGA GCCAGGAGAT CGAAGTCGAC
CCTTCGTCCA AAAACGAGGT GTCGGTGACC CTGGCCTCGG CGCCCTACGT GCTGCACATC
AACAGCAACC CGCCGGGCGC CATCGCCTAT ATCGACGGCC GCCGGGTCCC CGGGGTCACG
CCCAACAAGC TGGAGCTGCC GTCCGGCTGG GAGACGCGTC CGCAACACAA GATCAGCCTG
CGCCTGGCCG GCTACGACAA ACTCGATGTC TTCGCCAGCG CCGAGTTCCA CCCCGAGGAC
GGCGCCATGG TGCAATCGGT GAGCGGCGAG CTGGTCAAGA GCGCGCCCGC GCCGACGCCG
CCGCGACCCC GGCCGACGCC CAAACCGGCA CCGACGCCGG CTCCGACGCC GGCTCCGGCT
CCGGCTCCGG CTCCGACGCC GGCTCCGGGC GAGCCCAAGG ACACCGAGCC GGCTCCGGCT
CCGGCTCCGG CTCCGGCTCC GGCGCCAACT CCAACTCCGG CTCCCGAAGA GAGCGCCGAG
CCGCCGGCGA CCGAGACCGG CGACAGCGCG GCTGCCGGCG CCGGCGCCGG CGAAACCGCG
CAGACCAACT GA
 
Protein sequence
MSTNDNRKSQ RNPVTLRIKF KSANLEQFIE RYAVDVSQGG IFIRTKDPLP VGTTMRFEFQ 
LRDESPLITG EGTVVWIREH KKQSSVAPGM GVRFDRLTEA SQRVLEQILA NKNGRAGSAA
EQLGEVPTRV VPIADIAEQA RASSGDRAGV FGAEGVPDEA FEQATKVHSL EELASMSARA
DEDSIDDAPL SADFEEDDQT KVYPALTQPS GNPSDVALGH SGVGAPLPPT PSGSSQDVEL
GRAGYDEPLA PDEAFDARDV ELGLAGPTEP LTQERLVASD EVELGRIVSP RGNRRAPAAR
PASGERPAID LGPGGSEDPF SDVPLPEAGD FDLGLAGPTD PLPVERGNRA SELELGAAGG
NDDEQERPTQ TGVARRSGHV AQAGADVDAA EYSDERADTR ERAAATPPAD EAKAATAAAA
VAEAAAAASD EAKAAPAVSD APVATAPAAA DEPPATSDDE PEERDSRVRV VAQDSTDAVA
QRKSSRAPLI AILIIVLLAA AAGAFFFLSR DNAPAGAAPD AGAAIADDDV AGDQLMGDEA
GDETGDEAAT GNDEIDDDEI VFEEDEAVAD DSEEGAEDAA EDGDTAEGMA TGGITTDEYT
QPGGPTSKVS VRSQPSGATV ELIGGAEKGT TPFSIELAAN AKHRLRVSHP GYLSQEIEVD
PSSKNEVSVT LASAPYVLHI NSNPPGAIAY IDGRRVPGVT PNKLELPSGW ETRPQHKISL
RLAGYDKLDV FASAEFHPED GAMVQSVSGE LVKSAPAPTP PRPRPTPKPA PTPAPTPAPA
PAPAPTPAPG EPKDTEPAPA PAPAPAPAPT PTPAPEESAE PPATETGDSA AAGAGAGETA
QTN