Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3508 |
Symbol | |
ID | 8545897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 4834396 |
End bp | 4837098 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646388176 |
Product | pentapeptide repeat protein |
Protein accession | YP_003267903 |
Protein GI | 262196694 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.432343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACCA CGCCCGACTT GCTGATTCAA AAGCCCGTCT CGGTCCTGCG CAAACCGCTC AAGCTCAAAG ACGGACGCGG CCTGGTCCGC TCGCTGGTCA GCGTCGCGCT CGCGGCGGGA ACGCTCCAGG CCGGCAAACT CGCCAGCAGC GTCGCCGACA TGGGCGCCTC GCTCAGCCTC GATACGCCAC CGGGCGAACG CGCGGGCGCC TTGGTGTTGC GGGCGCTCGG TGTGGCGTTG GCCGACGTCG TCGCCGAACA CCGGGCCGAG CTGAGCGAGC AGGCGCTCGC AGGCGCGCAA CTCGACCACC TCGACGCCAT GATCGACGGC CTCGAGTTGT CGATCTCACG CGACTTCCTG GAGCGACCTG AGTCGCTGCC CTTGCCGGAG CGTGTGGCTC CGCTGCTCGC CGAGTGGTTC GAGACGCAGG GCCTATCGAT GCGTCACGAA GCGCTCACGC AGCGCCTGCG CAGGTATTTC GTGTACTCGC TCATCGCCGA GTGGCGCGCG CGCCCAAACG ACTACGAGCC GCTGCGCAAA GCGCTCGATA CGCCGCTGAG CGGCGCAGGG AGGCGTGTTC AGGATTGGCG GCGCTACGGC GCCTGGCTGG CCCGCCAGGT CGAGGCGCCC ATGTTCGGCG AGCACTTCGG CCTGGCCCGC GTGTACGTGC CGCTGCGCGG CTACTTCAGC CCCGCGCAAG AGCGCGACCC GCGTGACGTG TTCGAGCAAC GCCGCGAGGA GGACAAACCC AAGGTGGTCG CCATGCACCA GCACGTGAAC GCGTGGCTGA GCGCGCGCGA CCCGCGCGAC GCGCTGCGCC TGCTCAGCGG CGGCCCGGGC AGCGGCAAAT CCTCGTTTGC ACGCATGCTC GCGGCCGAGT TGGCGGCGAC GCGTCGGGTG CTGCTGGTGC CGCTGTTCGA ACTCGACCTA AAAGACGATC TCGCCAGCGC CGTGCACGCG TATCTGCGCC GCAAGTCCCT GTTTGATGAC GAGGTGTTGG CGCCCGACGC GATCGAGGAA CCGCTCGTGC TGTTGTTCGA CGGCCTGGAC GAACTCAGCG TACGCGGGGC GCTCGGACGC GAGGCGGCGC GGGAATTCGT CCGCCAGGTC GAACGACTGC TGCGCGACCG CAACCACGAC GATTGCCGTG TGCAGGTCCT CATCACCGGC CGCGACCTCG CCATCCAGGG CGCTGACGAC GAACTGCACG CTCCTCACCG CATCTTGCAT TTGCTGCCGT ACTATCCCAA TTCAAGCGAG AGGGAACGCT TCGATGATCC CGAGGGGCTG CTCGCGCACG ATCAGCGCGA CGCGTGGTGG GCTAAGTACA GCGCTCTGCT CGGACGCGAC GAGCAAGCGG CTTTCCCCGT CGAACTCGCC CGTCCGGCCC TGGTCCACCT GTCGGCCGAG CCGCTGCTCC TCTACCTACT CGCCTTCAGC TACTGCGCGG GCAAACTCGA CCTCTCCGAG GCATCGCTCG ATGTCAGCAA GGTCTACGAC GGGCTGCTGA GAGGTGTGTA CCAACGCGGC TACGAGCAAC GACCACACAA GGCCGCCATG AAGTCCTTCG AGGATTTCAC GGCTGTGCTC GAAGAGATCG GGCTCGCCGC CTGGCACGGC GAGGGCCGCA CCGCGACGCT CTCGACCATC TGCCATTACT GCGACCAGAA CCCCCGCCTG AAACGTCTCT TCGCCCAGTT CGAAGACGAT GCGCGGGCCG GCGTCGGCAG CCTCTTGCTC GCGTTCTATT TCCGCCAGAG GGACGCCGCG CTCGACCCTA CCTTTGAATT CACACACCTG AGCTTCGGCG AGTATCTGGT CGCGCGTCGC TTGGTGCGCG CGCTTGAGCG ATTGTGCAGC AAGCTCGAGC AAGGCGATGA AGGCAGCGAA GACGGCTGGC GCGAACAAGA CGCCCTACTT TCATGGGCCG AATTCTGCGG CCCAACCCTG ATGGAACCGA ACCTGGCCGA ACTGTTCAGC GCCGCGATTC ACGCTTGTCC GGTTGAGACC GCACGGGCGT GGCAGCACGT GCTGTGCCGC TGCATCGGCT TCGTCATGCG CAACGGCATG CCCATGGAGA AGCTGAATCC TCGCCCGAGC TTTCGCGAGG AAGTGCAGCA CGCCAACCAC GCCGAGATCG CGTTGTTCGT GGCGCTCAAC GCATGCGCGA CCAAGTCCAG GAAACTAAGC AAGAGCGACT GGCCCACCCC GGCGAGTTTC AGGGCGTGGC TTGCACAAAA AACCGAATCG AGCACTGGAC ACCCGCCGTT TGTGCTCTCG CGAGCGTTGT CGTTTCTCGA ACTTCCTGAG GCCAACCTCC AACGCGCCAA CCTCCAACGC GCCAACCTCC AACGCGCCAA CCTCCGAGAC GCTAACCTCC GAGACGCTAA CCTCCGAGAC GCTAACCTCC AACACGCTAA CCTCCGGGGC GCCGACCTCC GAGGCGCCAA CCTTCGAAGC GCCAACCTCC GAGGCGCCAA CCTCCGAGGC TCCAATCTCC AACACATCAA CCTCCAACAC GCTAGCCTGA TTAGCGCCGA CCTCCGAGGC GCCGACCTCC GAGGCGCCAA CGTCCGAGGC GCCAATCTCC GGATAACCAA CCTTCGCGGT GCCGATCTCA CCGGCAGCCA CTACAGTAAA ACCTCTACGC AGTGGCCCGA TGGCTTCGCC CCCGTCGCTG CTGGCTGCAC CCTCATCGAC TGA
|
Protein sequence | MATTPDLLIQ KPVSVLRKPL KLKDGRGLVR SLVSVALAAG TLQAGKLASS VADMGASLSL DTPPGERAGA LVLRALGVAL ADVVAEHRAE LSEQALAGAQ LDHLDAMIDG LELSISRDFL ERPESLPLPE RVAPLLAEWF ETQGLSMRHE ALTQRLRRYF VYSLIAEWRA RPNDYEPLRK ALDTPLSGAG RRVQDWRRYG AWLARQVEAP MFGEHFGLAR VYVPLRGYFS PAQERDPRDV FEQRREEDKP KVVAMHQHVN AWLSARDPRD ALRLLSGGPG SGKSSFARML AAELAATRRV LLVPLFELDL KDDLASAVHA YLRRKSLFDD EVLAPDAIEE PLVLLFDGLD ELSVRGALGR EAAREFVRQV ERLLRDRNHD DCRVQVLITG RDLAIQGADD ELHAPHRILH LLPYYPNSSE RERFDDPEGL LAHDQRDAWW AKYSALLGRD EQAAFPVELA RPALVHLSAE PLLLYLLAFS YCAGKLDLSE ASLDVSKVYD GLLRGVYQRG YEQRPHKAAM KSFEDFTAVL EEIGLAAWHG EGRTATLSTI CHYCDQNPRL KRLFAQFEDD ARAGVGSLLL AFYFRQRDAA LDPTFEFTHL SFGEYLVARR LVRALERLCS KLEQGDEGSE DGWREQDALL SWAEFCGPTL MEPNLAELFS AAIHACPVET ARAWQHVLCR CIGFVMRNGM PMEKLNPRPS FREEVQHANH AEIALFVALN ACATKSRKLS KSDWPTPASF RAWLAQKTES STGHPPFVLS RALSFLELPE ANLQRANLQR ANLQRANLRD ANLRDANLRD ANLQHANLRG ADLRGANLRS ANLRGANLRG SNLQHINLQH ASLISADLRG ADLRGANVRG ANLRITNLRG ADLTGSHYSK TSTQWPDGFA PVAAGCTLID
|
| |