Gene Hoch_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3961 
Symbol 
ID8546357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5460496 
End bp5463672 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content70% 
IMG OID646388633 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003268353 
Protein GI262197144 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.357683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.258592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTGC TGCACCGACG CCATCGCCGC CGCGCCGCGG CCGCCGACAC GCGCCAGCGC 
GGCCGCTTCT CGGGCCTGCG GGCTGGGCTC GCCGGCCTGC TGCTGCTCGG CAGCACCAGC
GCCGCCTTTG CCCAGGCCGA CATCGAGTAC GACCGCGGCC CGGCGGCCGA GCTGTACATC
CGCAAGCGCC CGCCGCCGCC GGCGAGCCCG ACGCTCACGG CCGAGCTCGA GAGCATGCTC
ACGGAGAAGG AGGCCGCGGC CGACGAGAAG CGCCGCGAGG CCATCGAGCT GTTGCGCGCC
TTCATCGACA CCAAGCCCCA GGGCGAGGCC CGCGCCGAGG CGCTGTTCAA GCTGGCCGAG
CTGCTTTGGG AAGACGCGCG CGTGGGCTTT ATCGCCCGCA TGGACCAATA CGAGCGCGCG
CTCGAGGCCT GCCGCCAGGA CGACGAGGGC TGCAAGGAGC GCCCGAGCGA GCCGCGCATC
GACCTCGACG AGCCCGCCGC GCTCTACCGC CAGCTCCTGG CCGAGTTTCC GCAGTTCCGG
CGCGCTGACC TGGTGCTCTA CCTGGTCGGC TTCGCCGCCC GCGAGCAGCA GCAGTACCAG
GAGTCGCTGG AGTATTTCGG CCAGGTGGTC GAACGCTACC CGGACTCGCC GCTGTACGGC
GACGCCTGGA TGATGATCGG CGAGCACTAC TTCAGCACCG GCCAGTGGCC CGAGGCCCGC
GCGGCCTACG CCAACGTGCT GGCGCGCCCG GACTCGCCGA CCTACGACCT GGCGCTGTTC
AAGACCGCCT GGGCCGACTG GAAGCTCGGC GACCCCGATC TGGCCGCGCG CCGCTTCAAG
CAGGTGCTCG ACCTGGCGGT GGAGGCCGAG ACCTCGGGCA GCGCGGTGCA GCGCCGCCGC
CGCGCCCAGC TCCGCGACGA GGCGCTCGAG TACCTGGTGG TGGTGTTCAC CGAGGACCGC
TCGATCTCGG CTCAGGAGGT CTACGACTTC CTGGCCTCGA TCGGCGGCAC GCGCTACTCG
CGCGACGTGC TGGTGCGCGT GGCCGACGCG TATTTCGGAC AGAGCGAATA CGAGCGCGCG
GCCCAGACCT ATCGCTTCCT CATCGATATG AAGCCCACGG GTATCGAGGC CGCCGAGTAC
CAGCGCGCGG TGGTCGAGGC CTACGTGGCC GCGCTGCAGC CCGAGCAGGT CGAGGCCGAG
ATGCGGCTGC TGGTCGAAAA CTACGGGCCG GCGTCGAAGT GGGCCGAGCA GAACGCCAAG
TTCCCGACCC GCAAAGCGCG CTCCGAGCGG CTCACCGAGG CCATGGTGCG CAACACGGCC
AAGAACTACC ACGCCGAGGC CCAGGCCGCC GAGAAGCGCG ACAAGAAGCC GGATCTGGCG
CTCTACACCC AGGCCGCCGA CCTCTACCAG ACCTATCTCA CGGCCTACAC CGAGCACGAG
AACGCCGCCG AGGTGCGCTT TCTGCGCGCC GAGATCCTGT ACTTCAAGCT GGGCAAGCTC
GAGGAGGCCG GCGACGAGTA CCTGGCCGTG GCCCAGCAGA CCCCGGTCGG CAAGTACCAC
AAGGACGCGC TGCTCAAGGC CATGGACGCC TTCGAGAAGG CGCGCCCCGA GAACGCGGGC
AGCGCCGGCC AGCGCGAGCT GTCCGCGGCC GACCGCAAGT TCGCGGCCTC GGTGGACCTC
TACGCCACGC TGTTCCCGGC CGATCCCGAG CTGGTCGGCG TGATCTTCCG CAACGGCGAG
ATGTTCTACG ACTACGGCGA CTACGACGAG GCCATCAAGC GCTACGGCCT CATCGTCACC
AAGTACCCGG ACGACCAGAA CGCGGGCCCC GCCGGTGACC GCATCCTCGA GTCGCTGGCC
AAGGCCGAGG ACTACGAGAA CATCGAGGAG TGGGCGCGCA AGCTCAAGAC CGCCAAGGCC
TTCCAGAGCA AGGAGCAGCA GTCCCGCCTC GACCGGCTGA TCGTCGAGTC GATCGGCAAG
AGCGGCGAGC GCTACGCCGA GGCCGGCGAG TTCGAGAAGG CGGCGAGCTT CTATCTGCGC
ATCCCCCAGG AGTTCCCGCA GCACACCATG GCGGCGCAGG CGCAGATGAA CGCCGGCGTG
ATGTACGAGA AGGCCAAGCG GCCGCAGCGC GCCGGCCAGG CCTATCTGGC GCTGGCCGCG
TCCTATCCCG ACAGTAAAGA GGCGCCCAAG GCGGCCTTTG CGGCCGGCCA GCTCTACGAG
TCGGTGGCGT ATTTCGACCG CGCGGCCGAA GCCTACGAGG TCGTCGCCGA AACATTCCCG
CGCTCGGAGC AGAGCGCGGA CGCTTTGTTC AACGCCGGCC TGCTGCGCCA GTCGCTCGAT
CAGAACGAGC GCGCCATCGA GCACTACCAG ACCTACGCCA AGCGCTACCG CGGCAAGGCC
GACGCCGCCG AGGTCGCCTT CCGCATCGGC GTGGTGTACG AAAACGCCGA GCGCTACGAC
GACGCCGCCG ACGCCTATCG CCGCTACCTC AAGGGTCACG CGCGCAGCGG CCGGCACGTG
GTCGAGGCGC ACACGCGCGT CGGCCGCAGC GAGCTGGCGG CCGGCCGGCT CAAGCGCGCG
GGCAACGAAT TCGACGCCGC GCTCAAGGTG TTCCGCCGGC TCAAGGGCAA GCAACGCGAG
ACCGAGAAGG CGTGGGCGGC CGAGGCCCGC TACCATCAGG GCGAGCTGAT CTACCGTCGC
TTCGAGGCCA TCTCGCTCGA CGTCAAGCCG CGCCGGCTGC GGCGCACGCT CGACAGCAAG
ACCGCGCTGC TGGCCAAGGC CCAGGACGTG TACCTCGACG TGGTCGACTT CGGCGACGCG
CAGTGGGCGA CCGCGGCCCT GTTCCGCATG GGGCGCATCT ACGAGGGCTT TGCCGAGTCG
CTGCGCGACG CGCCGGTGCC CCAGGGGCTG AGCGAGGACG AGGCCGAGAT GTACCGCCAG
GAGCTCGAGA TGTACGTCAT CGAGGTCGAG GAGCAGGCCA TCGACCTGTA CGCGACCGGC
TATCAGAAGG CGCTCGAGCT GGGCGTGTAC AACACCTACA CCAGCCAGAT CCGCACCGCG
CTCGGACGCC TGGACTCGAT CGGCTACCCG CCCGCGCTCG AGGCCCGCGC GCGGGTGCGC
CTGGGCGACC GGGTGCAGCC GCCGAGCGCG GTCGAGGAGG TGGTGCGCGA TGAGTAG
 
Protein sequence
MPLLHRRHRR RAAAADTRQR GRFSGLRAGL AGLLLLGSTS AAFAQADIEY DRGPAAELYI 
RKRPPPPASP TLTAELESML TEKEAAADEK RREAIELLRA FIDTKPQGEA RAEALFKLAE
LLWEDARVGF IARMDQYERA LEACRQDDEG CKERPSEPRI DLDEPAALYR QLLAEFPQFR
RADLVLYLVG FAAREQQQYQ ESLEYFGQVV ERYPDSPLYG DAWMMIGEHY FSTGQWPEAR
AAYANVLARP DSPTYDLALF KTAWADWKLG DPDLAARRFK QVLDLAVEAE TSGSAVQRRR
RAQLRDEALE YLVVVFTEDR SISAQEVYDF LASIGGTRYS RDVLVRVADA YFGQSEYERA
AQTYRFLIDM KPTGIEAAEY QRAVVEAYVA ALQPEQVEAE MRLLVENYGP ASKWAEQNAK
FPTRKARSER LTEAMVRNTA KNYHAEAQAA EKRDKKPDLA LYTQAADLYQ TYLTAYTEHE
NAAEVRFLRA EILYFKLGKL EEAGDEYLAV AQQTPVGKYH KDALLKAMDA FEKARPENAG
SAGQRELSAA DRKFAASVDL YATLFPADPE LVGVIFRNGE MFYDYGDYDE AIKRYGLIVT
KYPDDQNAGP AGDRILESLA KAEDYENIEE WARKLKTAKA FQSKEQQSRL DRLIVESIGK
SGERYAEAGE FEKAASFYLR IPQEFPQHTM AAQAQMNAGV MYEKAKRPQR AGQAYLALAA
SYPDSKEAPK AAFAAGQLYE SVAYFDRAAE AYEVVAETFP RSEQSADALF NAGLLRQSLD
QNERAIEHYQ TYAKRYRGKA DAAEVAFRIG VVYENAERYD DAADAYRRYL KGHARSGRHV
VEAHTRVGRS ELAAGRLKRA GNEFDAALKV FRRLKGKQRE TEKAWAAEAR YHQGELIYRR
FEAISLDVKP RRLRRTLDSK TALLAKAQDV YLDVVDFGDA QWATAALFRM GRIYEGFAES
LRDAPVPQGL SEDEAEMYRQ ELEMYVIEVE EQAIDLYATG YQKALELGVY NTYTSQIRTA
LGRLDSIGYP PALEARARVR LGDRVQPPSA VEEVVRDE