Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1300 |
Symbol | |
ID | 3904349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1554173 |
End bp | 1556215 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637878633 |
Product | heparinase II/III-like |
Protein accession | YP_480406 |
Protein GI | 86740006 |
COG category | [S] Function unknown |
COG ID | [COG5360] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGCGT CGGTTCCGCT CGGTCGCTAC CTGCGCACCA CCGCGGGACT GCGACCGGTC CAGCTCGGTG CCCGTGTCCG GCTGCGCGGG CAACGGGCCG TCCTGAGTCG CCACCCGGGC CTGGGCGAGA TGCTGCTGCG CGGCAGGCCG CCAGCGGAAT CCTGGCCGAT CGGGTTCCTG CCGTTCGACG GCAGATGCCC GCCGGCTCGG CCGATGTTCG ACGAGATCGT CGCGGGCCGC CTAACCCTGC TCGGTCACTC CCGCGATCTG TGTCCCCGGG ACTCCAACAC TCCCGCTACT GGAACCCCCG TACCCCCCAC CCCCACGACC GGCACCCCCG CGGGGGCCGG CGCCGCCGCG GCCCCCGCCC GATGGGACTG GAGGCAGGCC GACGCTCCGC TGCTGTGGCG CTACCATCTG CACTACTGGG ACTGGGCATG GGCATTCACC ACCGAGGCTG TCCAGGGGCC CGCGATGTTC GCCCGGCTGT ATCTGGCCTG GCGGACCTCG GTGGCCCTGG GCGATCCGGT TGCCTGGTCG CCCTACGTCG TGTCACTCCG GGCCTGGACG CTGTGTGCCT TGTGGCCCCG GCTCGCGCGG GGAACGCCCG CCGAGATGGC GGTCCTCGCC GACCTCGGCG TCTGCCGGTC CTTCCTGCGC ACTCATCTGG AGACCGACGT CGGCGGGAAC CACCTGCTGA AGAACTACAA GGCGCTGATC GGGCTGGCGG TCGCCGATGA CGACGCCCGC GGCCGCCGAC GATGGGTCGA CGCGCTGCTG CGCGAGCTCG ATCGGCAGGT CCTCAGCGAC GGTGGACACT ACGAGCGCTC CCCCACCTAT CACTGTCAGG TGCTGGCCGA CCTCGACGAC GTCGCCGGGC TGCTCACCGC GGCGGGCCAC GTGGTGTCCG GCGCCCTGCT CGACGCGGCC GGCAGGATGC GGTCCTGGCT GACGGCCGTA CTCGGACCGG ACGGCGTGGT TCCCACGCTC AACGACGGGT TCGCGGTGCC CGCCGAGGCC CTGCGGCTGC TTCTCCCGGC TCCGGTAAGA CCGACGCCGA TCGCAGTCCA GGGTCCGGGT CCCGTTCCGA TCCCGGCACC GCGCCTGGCC GAGATGCCCA CCTCGACGTC GCAGTCGACC GTGGGGACCG TGCCCCGTCC GCGGACGCCC GCATCTGACC GGGCCGACGC CCTTCTGCTG GTCGACAGCG GTCTCGCCGT GCTGACGGCT GGACCGTGGC ATCTGCTCGC CGACGTGGGC CTGCCCTGTC CCGAGGACCT TCCGGCCCAC GCCCACGCCG ACACCCTCGC GTTTCTGCTC TGGCACGACG GCCGGCCGCT CCTGGTGGAT ACCGGGACCT CGACCTACGC CCCCGGACCG GATCGGGACG CCGAACGCGG TACCGCCGCC CACTCGACCG TCATCGTCGA TCACGCCGAC TCGACCGAGG TCTGGGGCGC GTTCCGGGCG GGCCGCCGGG CCCGTCCCAC TCTTGTCACG ATGTGCCACC ACGAAAACGT GGCGACGTTG GCCGCCGGCC ATGACGGCTA CCGCCACCTT CCCGGGCGAC CCGTGCACTG GCGGACGTGG CGACTCGACC CGAGTGGGTT GTCCGTCGAC GATCGGATCA CCGGGGAGGG CCGCCACCAC GTCGAGGTGC TGTTCCACTT CGCCCCCGGG GTGTCCGTCA CCGCCGCGGC GGCCGATACC AGGCGAACCA GTACCACCTC GATCGGGACC ACCTCGATCG GGACCAGCCG GAACGCGGCC ACCCGGAGCG GGGCCACGGA TGCGCTCACG GTGACCACCT CCCAGGGGCG GCTGGTCCTG CGGGCCGGCG GACCGGGGCG GTGGGTGGTG CGCTCCACCC GCCGGGCGGT CGGATGGAGC CGTACGGTGC CGGCGTACAC CGCCGCGTAC GTGATCGATG CCGAGCTGCC TGTCGCGGTA CGCACCACGG TCGTCCACGA AGTCCGCTCC GGCGGCCCGC GCTCCGGCAA CCTGCGCTCC GGCGACACGG AATCGTCGCC CGCCCGCCTC TGA
|
Protein sequence | MTASVPLGRY LRTTAGLRPV QLGARVRLRG QRAVLSRHPG LGEMLLRGRP PAESWPIGFL PFDGRCPPAR PMFDEIVAGR LTLLGHSRDL CPRDSNTPAT GTPVPPTPTT GTPAGAGAAA APARWDWRQA DAPLLWRYHL HYWDWAWAFT TEAVQGPAMF ARLYLAWRTS VALGDPVAWS PYVVSLRAWT LCALWPRLAR GTPAEMAVLA DLGVCRSFLR THLETDVGGN HLLKNYKALI GLAVADDDAR GRRRWVDALL RELDRQVLSD GGHYERSPTY HCQVLADLDD VAGLLTAAGH VVSGALLDAA GRMRSWLTAV LGPDGVVPTL NDGFAVPAEA LRLLLPAPVR PTPIAVQGPG PVPIPAPRLA EMPTSTSQST VGTVPRPRTP ASDRADALLL VDSGLAVLTA GPWHLLADVG LPCPEDLPAH AHADTLAFLL WHDGRPLLVD TGTSTYAPGP DRDAERGTAA HSTVIVDHAD STEVWGAFRA GRRARPTLVT MCHHENVATL AAGHDGYRHL PGRPVHWRTW RLDPSGLSVD DRITGEGRHH VEVLFHFAPG VSVTAAAADT RRTSTTSIGT TSIGTSRNAA TRSGATDALT VTTSQGRLVL RAGGPGRWVV RSTRRAVGWS RTVPAYTAAY VIDAELPVAV RTTVVHEVRS GGPRSGNLRS GDTESSPARL
|
| |