Gene Hoch_3191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3191 
Symbol 
ID8545579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4397191 
End bp4399902 
Gene Length2712 bp 
Protein Length903 aa 
Translation table11 
GC content57% 
IMG OID646387858 
Productpentapeptide repeat protein 
Protein accessionYP_003267586 
Protein GI262196377 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0671183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCTTCA AGCAGAAGGA CCTCGCGAAG TTCAAGAAGG AACTCAACAG CGGGTCTGGT 
CTTGCCGGCC TGTGGGCAAA GACGAACGAT CCGGACGATG TCGCCGTGCT CCACCTAGCG
ATGGTCACGT GCGCGTTTGG CTCAGCCCTA CACCGCTACT GGCGAACGCC GGCGATGGTG
CCAGAGAAGA GCCTCGAGGC CCTCGCGAAA CACGCAGGCA TTCGACTCGC CGATGACGGC
TCAGCAGACA AAGATGCCGT GGCGCGCTTG GCAAGGCTCT GCGGCAATCC GCTTAACACA
CCGTATTACT GCGCCCTGTG GGAGGCGCTC AACGACCCGG CCACCGATCT CGACGCACCG
ACCGAACCGA GGGAGTTCGA GAGCCACTTC CGCCGCGCGT ACTCGTTGGC GACCGCGACC
GACCAGGGAC AAATCGTCCA ACGCTGGCTG CTGAGCCTGG CCGATGAGAG CGTCGAGGTG
ATGCGGCGGA TTCTGGCCGA AGACCTCGCC GCCTGGGGCG AAAGGCACGT ATTCGGCAAC
GTCCGCAAAC AATCCAAGGG CGACCCCATG CCCTTCCTGG GATTGGATGC ATCGTACGTC
GAGCCGTACG GAGAGTACGA AGACGAGTTT AAACCCATCC TCAAACTCAT CGGCGAGCTG
CTCGAGAAGC AGAAAATCGT GGTGGTCTCG GCGGATTTCG GACACGGCAA GTCGCTCACC
GCCCGAAGAT TGGCCCGCGA CACAGCGCGC GCATGGCTGG AGAGCGACAC TCCCAGCCCA
CAGAACCGCT ACCCCGTGTT CATCAAGTGC GCCCGCGATA TCCGCGACGC CTCTTACAAG
CACGACGAGG TAGCCCGGCG CGCACTGTGG GAAGCGGCCA CGGAGGCTCT CGGTGAGGAG
TCTTCGAGTG AGGAGCCGCA GTTTCAACCA CCAGACAATC AGCATGCCGC ATTGTTCATC
CTCGACGGCC TGGACGAAGT CGCATTCTCG CCCAACCAAC TCGAAGACCT ATTCCGCAGC
CTGCGCGAAA AACTAGGCAA GCAGCAACGG GCTATCATCT TTACGCGCCC GAGCACTTTC
GACGACCGCC ACGGCCGCCC CGCAGAGAAC ATCCCTCTCA TATCTCTACT CCCATTCGAC
GAGTTGCAAA TCGAAGAATG GCTGACGCGT TGGAACAACA ACCCACAGTC CGAATCCGTC
AACATCGAGG AGTTGCGCGA GCATCAACCA ATTGCCGAGC TAGCCCACAC GCCCATCCTC
TTGCTAATGA TTGCGATGAC CTGGCAGGAA AGCCTAGCCA AAGGAGAAAT GCGCCGCGGA
GTTCTATACG AGCAGTTCTT CCGCCAAATC GCACGCGGTA AATACGAGAG CGACAGCGAC
AAACACCCGG TCATCCGTAA AGCGTCCGAA CTCATCGCCG ACCGCCTCAC CGAACTCAAA
TATCTCGATG CAAAGCAGTT CAGCGGCGAA GAAAGAAGTA TAGAGGCCAT GTTGTGGTTG
ATGTCGCGCA TAGCCTGGGA GGCGCACGCC TATGCGTACG AACCAGAGTT TGCCGAAGAT
GACCTCTCTG ACGGCGATAT CGAGCAATTG CTCAAAGAAG AGTTAAATAT TCGACGTGGC
AAGTCGCCTC TTCTACAAAT GGGGTTACTG CTTGCGTTAC AGTTCGATCC ATCAGGATCA
CAAACCACTG TGCTATTTGA GCATCAGTCT TTTCGAGAAT TCCTTGTTGC TCGATACTGG
CAGTCGCAAC TTTTATTCCT CACCGATCCA GAGAGAGACA GCGTAGAGAT CGAGCGAGCA
GAAAAGATAC TCGCCCAAAC GAGTCTTCTA CAAGACGACG ATCGCGCGTT CGATGCGCTC
ATCGAAGGGT TGCAACATCT TGAAGAGCCC AAACGCACGC AGATCAAAGA CTGGGCGGCT
CAATGTCTAA AAACAAGGTC CCTGACCGTC TCGCATATTC CTGACAAGGA GGACCGAGCA
CCGATCCTTC GAGAAAGTGC TTTGGCGATT GGCAGCACAA TCTTCGAAGA TAGCGGATTA
TCCATAGGCA TCCACGAGAT CCGATATTCA GCATTCTGGC ACGACTTTCA CAATCGCATC
CTACGCATCA TTGCTCCTAA CGTAAAAAGT CCCAAAAGCA ACCTCGCCAG AGTAGACCTC
GCCAGAGCCT ATCTCGCAGG CGCCGATCTC GCAGGCGCCG ATCTCGCAGG CGCCGATCTA
TCCCTTGCCC ACCTTGAGCG GGCCAGCCTT GAGCGGGCCA ATTTCCGCTC TGCCAAACTC
CTATATTCCA ATCTCCGATA TGCCGACCTC CGGCATGCCG GCTTCGAACA AGCCAACCTC
GTACAAGCCA ACCTCATACA AGCCAACTTC GGATACGCCC GGTTCCTAGG CGCAGATCTC
CGCGGCGCCC AGCTCCTAGG CGCCAACCTA CAAGACGCAA AACTTCAAAA TGCCAACTTA
CAAGGCGCCA ACCTACAAGG CGCCAACCTA CAAGGCGCAA AACTTCAAAA TGCCAACCTA
CAAGGCGCCG ACCTACAAGG CGCCGACCTC CGAGCCGCTA ACTTGTCCGC AGCGAACTTC
CTGGGAGCGC AGTATTCGAC GGAGACCAAA TGGCCAGACG GTGTCGATCC GGAAGCGCTT
GGGTGTATTT TCGTCGATTC GTCCGAGGCT GAATCGGATG CTCTCGATGA GTACGGCGAC
GAGGAAGCGT GA
 
Protein sequence
MAFKQKDLAK FKKELNSGSG LAGLWAKTND PDDVAVLHLA MVTCAFGSAL HRYWRTPAMV 
PEKSLEALAK HAGIRLADDG SADKDAVARL ARLCGNPLNT PYYCALWEAL NDPATDLDAP
TEPREFESHF RRAYSLATAT DQGQIVQRWL LSLADESVEV MRRILAEDLA AWGERHVFGN
VRKQSKGDPM PFLGLDASYV EPYGEYEDEF KPILKLIGEL LEKQKIVVVS ADFGHGKSLT
ARRLARDTAR AWLESDTPSP QNRYPVFIKC ARDIRDASYK HDEVARRALW EAATEALGEE
SSSEEPQFQP PDNQHAALFI LDGLDEVAFS PNQLEDLFRS LREKLGKQQR AIIFTRPSTF
DDRHGRPAEN IPLISLLPFD ELQIEEWLTR WNNNPQSESV NIEELREHQP IAELAHTPIL
LLMIAMTWQE SLAKGEMRRG VLYEQFFRQI ARGKYESDSD KHPVIRKASE LIADRLTELK
YLDAKQFSGE ERSIEAMLWL MSRIAWEAHA YAYEPEFAED DLSDGDIEQL LKEELNIRRG
KSPLLQMGLL LALQFDPSGS QTTVLFEHQS FREFLVARYW QSQLLFLTDP ERDSVEIERA
EKILAQTSLL QDDDRAFDAL IEGLQHLEEP KRTQIKDWAA QCLKTRSLTV SHIPDKEDRA
PILRESALAI GSTIFEDSGL SIGIHEIRYS AFWHDFHNRI LRIIAPNVKS PKSNLARVDL
ARAYLAGADL AGADLAGADL SLAHLERASL ERANFRSAKL LYSNLRYADL RHAGFEQANL
VQANLIQANF GYARFLGADL RGAQLLGANL QDAKLQNANL QGANLQGANL QGAKLQNANL
QGADLQGADL RAANLSAANF LGAQYSTETK WPDGVDPEAL GCIFVDSSEA ESDALDEYGD
EEA