Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3564 |
Symbol | |
ID | 7295045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 3961087 |
End bp | 3962958 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643591970 |
Product | heparinase II/III family protein |
Protein accession | YP_002489609 |
Protein GI | 220914300 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATCC CCGCAGCGGG CAGCGCGGCG TGGCCTGCCG AGGGGCTGCA TGTGCCCGGG CCTGTTGCCC GGCAAGCGCA GGAGGAACGG GGAACGCCGT GGCCCCAGCC CCTGGTATCC CACTACGCCC GGTACTTCCG GGACGGGAAC CGCACGGCCT ACGAGGGGCT GGTGGCTGCA CGCCAGCAGC GGCTAACCCG CGCCGTCGTC ATGGCACTGG CCAGTGGCCC CGGCAGCCCT GGTGCGGACG GGGTCGACGC GGAAGCGTGG CTGGACGAGG TCATCGACGG CGCATTCCTC CTCTGCGAGC AGAGCTCGTG GAGCTGGGCC GCGCACGACG ACGTCTTCCG CCGCACGGGC TGCGTTGTTC CGGACCTGGC CACTCCGTAC CTGGACCTGG GCGCCGGCGA GGTGGCCGCG CAGCTGGCAT GGCTGGACCA TGTCCTGGGC CTGCAGCTCG ATGACAGGGC GCCGGGGCTG CGGCGGCGCA TCCGCGAGGA GGTCGCCGGA CGCGTCATCC GGCCTTTCCT CGACCGGCTG GACTGGCACT GGCTGGGCCT GGACGGGGAC GTCCACAACT GGAATCCCTG GATCCACTCC AACCTGATCG CCGCGGCACT GTTCCTTGTG GATGACCCGG ACACCCGGGC ACACACCGTG GCCCGCTGCA TTGAGGGCCT GGACCGCTTC CTGGCCTCCA TCCCGGCCGA CGGCGCCATC GACGAAGGCT TCGCCTACTG GTGGAACGGT GCCGGGCGGG CGCTGGAGGG GCTGGCCCTG CTGGAACAGG CCACAGGCGG CGTGCTCGAC GGCGGCCTCC CGGTGGTCCG CGAACTGGTG GCTTTCCCGC ACCGCATGCA CATCGGCGGC GCCTGGTTCC TCAACGTGGC CGATGGCCCG GCACGCGCTG CGGCCGCCCT TCCCTGGGAC ATGCTGCACC GCTGGGCGGC CAGGCTGGGT GATCCCGAGG CGGCCGCCCA CGCCGCCGCC ATGGCCACCG GCGCACCGGA CCCCGCCGCG GGTCTCGGCC GGGTCCTGCA CGCGCTCCTC GAACACCCGG AACCCGCTTC CAACGCCCTT CCGCTGGTGG CAGCCACCTA CCTGCCATCG GTCCAGATCA TGGTGGCCCG GGAAACCGCG GGAACGGTGC AGGGGCTGTT CCTGGCCGCC AAGGGAGGCC ACAACGGCGA GCACCACAAC CACCGCGACG TCGGCTCGGT AGTGGTTGCC GTGGACGGGG TTCCCCTGCT GGTTGACGCC GGGCAGCCCA CCTACACGGC CCAGACGTTC GGCCCGGACC GCTACGGCAT CCGCGCGATG CAGAGCAAGT GGCATAGCGT CCCTGCCCCG TTCGGCCTTG AACAGGGAAC CGGCAGGGAC TTCGCGGCCG GCGTCCTCAA TGCCCCCACC CCTGAGCGGC CGCAGTTGGA GCTCGCGCTG GGAGCGGCCT ACGGGTTCGA ACCCCTGGCT TGGATCCGGA CGGCGGAACT GCGGCGGGCA CCCGGCCGCA TCACCATCGC CGACCGCTGG GATCTTCCCG CGCCGGGCGC AGGCGGAACG TCCGACATCG ACATCACCTT CCTGACGGCC GGAACCCTGG TCCCCGGACC GGACGGGACG GCGACCGTCC GTCCGGACGG CATCCCGGCG GTTGGTGCGG CAAACGCGAC CCCGCGGGGA GCTGCCCTGC GCTGGGACCC TGCAGCTGTC GTCGTCCTCG TGGACGAATG GGAGCTGGAC GATCCGCTCC TCGCCGACGC GTGGGGACCA CGGCTGACCC GGCTGCGCTT CCGAACAACG GCACCTGCTG CACCATCCCC TGCTTTCCGC GCCGCCGGCG CATTCACCCT CACTGTGGAG GCAACACCAT GA
|
Protein sequence | MDIPAAGSAA WPAEGLHVPG PVARQAQEER GTPWPQPLVS HYARYFRDGN RTAYEGLVAA RQQRLTRAVV MALASGPGSP GADGVDAEAW LDEVIDGAFL LCEQSSWSWA AHDDVFRRTG CVVPDLATPY LDLGAGEVAA QLAWLDHVLG LQLDDRAPGL RRRIREEVAG RVIRPFLDRL DWHWLGLDGD VHNWNPWIHS NLIAAALFLV DDPDTRAHTV ARCIEGLDRF LASIPADGAI DEGFAYWWNG AGRALEGLAL LEQATGGVLD GGLPVVRELV AFPHRMHIGG AWFLNVADGP ARAAAALPWD MLHRWAARLG DPEAAAHAAA MATGAPDPAA GLGRVLHALL EHPEPASNAL PLVAATYLPS VQIMVARETA GTVQGLFLAA KGGHNGEHHN HRDVGSVVVA VDGVPLLVDA GQPTYTAQTF GPDRYGIRAM QSKWHSVPAP FGLEQGTGRD FAAGVLNAPT PERPQLELAL GAAYGFEPLA WIRTAELRRA PGRITIADRW DLPAPGAGGT SDIDITFLTA GTLVPGPDGT ATVRPDGIPA VGAANATPRG AALRWDPAAV VVLVDEWELD DPLLADAWGP RLTRLRFRTT APAAPSPAFR AAGAFTLTVE ATP
|
| |