Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1146 |
Symbol | |
ID | 3903574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1360375 |
End bp | 1362795 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637878478 |
Product | hypothetical protein |
Protein accession | YP_480254 |
Protein GI | 86739854 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0503297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCC GGCACGAGAT GGGGCAGACG GGGCCTGTCT CCGTCGAACC CGGGACAGAG CCGCAGGCGC CCTTCGACGT CATCGATGCG CAGGTTGTCG TGAGTCCCGG CTTGGAGAGC GGCCTTGCGC ACGGGACACC CGGATCGGTG GTCGTCGCCG CGCCCAGGGG TGACGCCGTG CCCAGGGGTA ACACTGGGCT CGGGGGTAAC ACTGGGCTCG GGGGTGACGC CGAGCCCCGG AACGACACGC CGCGGTCCCG GCCGGACGGA GAACAACCGC CGGCCGGGCC GCCGGCCTCC GACGCCGCTC GCGGTCGGCG GACTCCCGTC CAGCGGCTGC GCGTTCGTCT CGCGAGCATC TCGGGGCCCG CTCGGTACGT GATGGTGCTC GCCGTTCTGA GCGCGGCGGT CGCCGTCGTC CTCCGGCACA CCGTGTTCCC GTTCCTGTCG GTCAACAACG ACGAGGGCAT CTACCTCCTG CATGCCAGGA CCCTCGCCGA GGGCCGCCTG TTCCCGCCCG CCCCGGACCC GGCACGGTCC TACCTGCCGT GGCTGGGAAC CGTTGTCGGC GATCACTACG TGTTGAAGTA CACGCCCTTC GTACCCGCGA TCTTCGCAAT CGGCATCGTG CTGACGGGAA GCATCGATCC GGCGCTTGCA ATGATCGTCA TGGCGGCGGT CGTCGTCACC TACCTGCTGG GCGTCGAGCT GTTCGGTAGC CGGCGGACCG CGGCGTTGGC GGCGACCCTG CTCGCCCTCT CCCCGCTGGT GATCCTGCAG AGCGCGATGG TGCTGAGCTA CTTGCCGACC CTGGTGCTGC TGCAGGTGAC CGTTCTGGGG GTGTTGCGGG GACGGCGCCG CCGACGGGCG GCACCGGTGG TCGTCGCGGG CCTCGCCCTC GGCGTGGCGG CGGCGGTGCG GCCGTATGAC GTCGTCCTGC TCCTGGCTCC GCTCGCGATC TGGCTGCTCC TGACCTCCAC GGGCGGGCGC TGCTGGCTGG GCCGGTGGCT GCTTTGCGGG CTCGCCGCGC CCGTGGCGCT TGTCCTGCTC AGCAACGCCG CTGCCACCGG TGACCCGTTG CGGCTGCCGT TCACGCTCCT TGAACCCGAC GACAAGCTCG GCTTCGGGGT GCGCAAGCTG TATCCCGGCG ACGGCAGGCA CGACTTCGGC TTGCTCGAAG GGCTGCGGTC GGTCGGCGAC CACCTGTGGC TGTTCGGCGG CTGGGCCTGC GGGGGAGTGG TGCTGGCCGG CTGCGCGATC GTCGTGGCGG CTCGGCGGCA GCTGTCCGCA CCCGCCGCGC TCCTGGGTGC GGGTGGGCTG CTCTTCCTGG GCGGATACGT CGGCTTCTGG GGCGCGTGGA ACGCGGCCGA GTTGTGGGGC GGCATCCGCT ACGTCGGCCC TTTCTACCTG GTCCCGGTGC TCATCCCGCT CGTCCACCTC GGCGCGGAGG GCCTGGTCAG GGCCGGCGAA GCCGTCTTCG CCCGCGGGCG AAGACTCGGG CTCATCGCCG TCGCGGGGAC CGGCGCCGCC ATCCTGGCGC TCACGGCCGT CGTCCTGGTC GGGGCGGTGC GCGCCAACCT CACGCTCACC GGGCACGACC GTGACCTCAG CGCGATGCTG GACCGGCTGC CGGGCGATCC GCTCGTGTTC GTCGCGGCGA ACCCGCCGTT TCTCGGGCAT CCCACTCCGG TGACCGCGAA CGGGCCGAAG CTCGAAGATC CTGTGCTGTT CGCCGTGTCG CGGGGGGTCG ACGATCTCAT CGTGGCCGCT GACCACGCGG ACCGCCCGGC CTATCTGCTC CGCCTGGCAT CGGCCTACGA CCGCTCACCC GCCTCGCCCT CCACCGCGCG GGTCGAACGG CTGGCGGTCA GGAGCGGCAG GCGGGTGGCG GTGACCCTGA GAGTGGATGC CGCGCCCGCG GGCGCCCGTT CCGCCCGCGT CGTGCTCACC GAGGGTGTCC GCCGCCTGTC GATTCCGGTC CATCCGCGGA TCCCCACCTC GATGATCCTC ACCGTCGACG CGGACGGCCT CGATACCACC GATGTCATCA GTATCACCAG TATTGCCAAC ATCGCTGGCG GCATGGACAG CACCGGAATC GTGACTGACC GCAACGACTC CGGTCGGTCC CGGGTGCATT CCGGATCCCT CGGCCGTCCG TCGGCAAGCA CCGTGCGGGA CGGTCGGGGC ACCTCCATCA CCGTGGCCTA CTACGCGACG AGCCCTTCCG GACGGGAACA TTTCCTCGAT CAGCAGGTGT TGCCCGTGCG GCTGGCCGCA CGGGACGGTT CCCGGCCGGA CGGTTCCCGG CCGGCTGCCG GCCCGCCGGT CACCGTGCTC GGATCCCTCG GGGAGGTCGA TGAGGTCGGC GAGGGACGTC GGCCGCCGCT GCGCATCATC ATCGATGATC GTCGAGGCTG A
|
Protein sequence | MSSRHEMGQT GPVSVEPGTE PQAPFDVIDA QVVVSPGLES GLAHGTPGSV VVAAPRGDAV PRGNTGLGGN TGLGGDAEPR NDTPRSRPDG EQPPAGPPAS DAARGRRTPV QRLRVRLASI SGPARYVMVL AVLSAAVAVV LRHTVFPFLS VNNDEGIYLL HARTLAEGRL FPPAPDPARS YLPWLGTVVG DHYVLKYTPF VPAIFAIGIV LTGSIDPALA MIVMAAVVVT YLLGVELFGS RRTAALAATL LALSPLVILQ SAMVLSYLPT LVLLQVTVLG VLRGRRRRRA APVVVAGLAL GVAAAVRPYD VVLLLAPLAI WLLLTSTGGR CWLGRWLLCG LAAPVALVLL SNAAATGDPL RLPFTLLEPD DKLGFGVRKL YPGDGRHDFG LLEGLRSVGD HLWLFGGWAC GGVVLAGCAI VVAARRQLSA PAALLGAGGL LFLGGYVGFW GAWNAAELWG GIRYVGPFYL VPVLIPLVHL GAEGLVRAGE AVFARGRRLG LIAVAGTGAA ILALTAVVLV GAVRANLTLT GHDRDLSAML DRLPGDPLVF VAANPPFLGH PTPVTANGPK LEDPVLFAVS RGVDDLIVAA DHADRPAYLL RLASAYDRSP ASPSTARVER LAVRSGRRVA VTLRVDAAPA GARSARVVLT EGVRRLSIPV HPRIPTSMIL TVDADGLDTT DVISITSIAN IAGGMDSTGI VTDRNDSGRS RVHSGSLGRP SASTVRDGRG TSITVAYYAT SPSGREHFLD QQVLPVRLAA RDGSRPDGSR PAAGPPVTVL GSLGEVDEVG EGRRPPLRII IDDRRG
|
| |