Gene Hoch_3508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3508 
Symbol 
ID8545897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4834396 
End bp4837098 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content66% 
IMG OID646388176 
Productpentapeptide repeat protein 
Protein accessionYP_003267903 
Protein GI262196694 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.432343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA CGCCCGACTT GCTGATTCAA AAGCCCGTCT CGGTCCTGCG CAAACCGCTC 
AAGCTCAAAG ACGGACGCGG CCTGGTCCGC TCGCTGGTCA GCGTCGCGCT CGCGGCGGGA
ACGCTCCAGG CCGGCAAACT CGCCAGCAGC GTCGCCGACA TGGGCGCCTC GCTCAGCCTC
GATACGCCAC CGGGCGAACG CGCGGGCGCC TTGGTGTTGC GGGCGCTCGG TGTGGCGTTG
GCCGACGTCG TCGCCGAACA CCGGGCCGAG CTGAGCGAGC AGGCGCTCGC AGGCGCGCAA
CTCGACCACC TCGACGCCAT GATCGACGGC CTCGAGTTGT CGATCTCACG CGACTTCCTG
GAGCGACCTG AGTCGCTGCC CTTGCCGGAG CGTGTGGCTC CGCTGCTCGC CGAGTGGTTC
GAGACGCAGG GCCTATCGAT GCGTCACGAA GCGCTCACGC AGCGCCTGCG CAGGTATTTC
GTGTACTCGC TCATCGCCGA GTGGCGCGCG CGCCCAAACG ACTACGAGCC GCTGCGCAAA
GCGCTCGATA CGCCGCTGAG CGGCGCAGGG AGGCGTGTTC AGGATTGGCG GCGCTACGGC
GCCTGGCTGG CCCGCCAGGT CGAGGCGCCC ATGTTCGGCG AGCACTTCGG CCTGGCCCGC
GTGTACGTGC CGCTGCGCGG CTACTTCAGC CCCGCGCAAG AGCGCGACCC GCGTGACGTG
TTCGAGCAAC GCCGCGAGGA GGACAAACCC AAGGTGGTCG CCATGCACCA GCACGTGAAC
GCGTGGCTGA GCGCGCGCGA CCCGCGCGAC GCGCTGCGCC TGCTCAGCGG CGGCCCGGGC
AGCGGCAAAT CCTCGTTTGC ACGCATGCTC GCGGCCGAGT TGGCGGCGAC GCGTCGGGTG
CTGCTGGTGC CGCTGTTCGA ACTCGACCTA AAAGACGATC TCGCCAGCGC CGTGCACGCG
TATCTGCGCC GCAAGTCCCT GTTTGATGAC GAGGTGTTGG CGCCCGACGC GATCGAGGAA
CCGCTCGTGC TGTTGTTCGA CGGCCTGGAC GAACTCAGCG TACGCGGGGC GCTCGGACGC
GAGGCGGCGC GGGAATTCGT CCGCCAGGTC GAACGACTGC TGCGCGACCG CAACCACGAC
GATTGCCGTG TGCAGGTCCT CATCACCGGC CGCGACCTCG CCATCCAGGG CGCTGACGAC
GAACTGCACG CTCCTCACCG CATCTTGCAT TTGCTGCCGT ACTATCCCAA TTCAAGCGAG
AGGGAACGCT TCGATGATCC CGAGGGGCTG CTCGCGCACG ATCAGCGCGA CGCGTGGTGG
GCTAAGTACA GCGCTCTGCT CGGACGCGAC GAGCAAGCGG CTTTCCCCGT CGAACTCGCC
CGTCCGGCCC TGGTCCACCT GTCGGCCGAG CCGCTGCTCC TCTACCTACT CGCCTTCAGC
TACTGCGCGG GCAAACTCGA CCTCTCCGAG GCATCGCTCG ATGTCAGCAA GGTCTACGAC
GGGCTGCTGA GAGGTGTGTA CCAACGCGGC TACGAGCAAC GACCACACAA GGCCGCCATG
AAGTCCTTCG AGGATTTCAC GGCTGTGCTC GAAGAGATCG GGCTCGCCGC CTGGCACGGC
GAGGGCCGCA CCGCGACGCT CTCGACCATC TGCCATTACT GCGACCAGAA CCCCCGCCTG
AAACGTCTCT TCGCCCAGTT CGAAGACGAT GCGCGGGCCG GCGTCGGCAG CCTCTTGCTC
GCGTTCTATT TCCGCCAGAG GGACGCCGCG CTCGACCCTA CCTTTGAATT CACACACCTG
AGCTTCGGCG AGTATCTGGT CGCGCGTCGC TTGGTGCGCG CGCTTGAGCG ATTGTGCAGC
AAGCTCGAGC AAGGCGATGA AGGCAGCGAA GACGGCTGGC GCGAACAAGA CGCCCTACTT
TCATGGGCCG AATTCTGCGG CCCAACCCTG ATGGAACCGA ACCTGGCCGA ACTGTTCAGC
GCCGCGATTC ACGCTTGTCC GGTTGAGACC GCACGGGCGT GGCAGCACGT GCTGTGCCGC
TGCATCGGCT TCGTCATGCG CAACGGCATG CCCATGGAGA AGCTGAATCC TCGCCCGAGC
TTTCGCGAGG AAGTGCAGCA CGCCAACCAC GCCGAGATCG CGTTGTTCGT GGCGCTCAAC
GCATGCGCGA CCAAGTCCAG GAAACTAAGC AAGAGCGACT GGCCCACCCC GGCGAGTTTC
AGGGCGTGGC TTGCACAAAA AACCGAATCG AGCACTGGAC ACCCGCCGTT TGTGCTCTCG
CGAGCGTTGT CGTTTCTCGA ACTTCCTGAG GCCAACCTCC AACGCGCCAA CCTCCAACGC
GCCAACCTCC AACGCGCCAA CCTCCGAGAC GCTAACCTCC GAGACGCTAA CCTCCGAGAC
GCTAACCTCC AACACGCTAA CCTCCGGGGC GCCGACCTCC GAGGCGCCAA CCTTCGAAGC
GCCAACCTCC GAGGCGCCAA CCTCCGAGGC TCCAATCTCC AACACATCAA CCTCCAACAC
GCTAGCCTGA TTAGCGCCGA CCTCCGAGGC GCCGACCTCC GAGGCGCCAA CGTCCGAGGC
GCCAATCTCC GGATAACCAA CCTTCGCGGT GCCGATCTCA CCGGCAGCCA CTACAGTAAA
ACCTCTACGC AGTGGCCCGA TGGCTTCGCC CCCGTCGCTG CTGGCTGCAC CCTCATCGAC
TGA
 
Protein sequence
MATTPDLLIQ KPVSVLRKPL KLKDGRGLVR SLVSVALAAG TLQAGKLASS VADMGASLSL 
DTPPGERAGA LVLRALGVAL ADVVAEHRAE LSEQALAGAQ LDHLDAMIDG LELSISRDFL
ERPESLPLPE RVAPLLAEWF ETQGLSMRHE ALTQRLRRYF VYSLIAEWRA RPNDYEPLRK
ALDTPLSGAG RRVQDWRRYG AWLARQVEAP MFGEHFGLAR VYVPLRGYFS PAQERDPRDV
FEQRREEDKP KVVAMHQHVN AWLSARDPRD ALRLLSGGPG SGKSSFARML AAELAATRRV
LLVPLFELDL KDDLASAVHA YLRRKSLFDD EVLAPDAIEE PLVLLFDGLD ELSVRGALGR
EAAREFVRQV ERLLRDRNHD DCRVQVLITG RDLAIQGADD ELHAPHRILH LLPYYPNSSE
RERFDDPEGL LAHDQRDAWW AKYSALLGRD EQAAFPVELA RPALVHLSAE PLLLYLLAFS
YCAGKLDLSE ASLDVSKVYD GLLRGVYQRG YEQRPHKAAM KSFEDFTAVL EEIGLAAWHG
EGRTATLSTI CHYCDQNPRL KRLFAQFEDD ARAGVGSLLL AFYFRQRDAA LDPTFEFTHL
SFGEYLVARR LVRALERLCS KLEQGDEGSE DGWREQDALL SWAEFCGPTL MEPNLAELFS
AAIHACPVET ARAWQHVLCR CIGFVMRNGM PMEKLNPRPS FREEVQHANH AEIALFVALN
ACATKSRKLS KSDWPTPASF RAWLAQKTES STGHPPFVLS RALSFLELPE ANLQRANLQR
ANLQRANLRD ANLRDANLRD ANLQHANLRG ADLRGANLRS ANLRGANLRG SNLQHINLQH
ASLISADLRG ADLRGANVRG ANLRITNLRG ADLTGSHYSK TSTQWPDGFA PVAAGCTLID