Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4526 |
Symbol | |
ID | 3907503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5400766 |
End bp | 5403963 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881859 |
Product | glycosyl transferase family protein |
Protein accession | YP_483601 |
Protein GI | 86743201 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0744] Membrane carboxypeptidase (penicillin-binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCTCTC CCTACGATCA GGGCTCTAGT CGGGGTGGTC CTGCACCTGA CCGAGCATGG CCTAACCCGG ACGGCCAGTC CCGGGTACCC GCGGACCGGG TACCCCCGGA CCGGGTACCC CCGGACCGCG CCCGGCGGCC GGAGCAGGCC AGGCCCGGCC AGGCCCAGCC AGGCCAGGCC CGGGCAGGCC AGGCCCGGGC AGGCCAGGCC CGGGCAGGCC AGGCCCAGGG ACGCCGGCTG GTGCCGCCGA CGGGCCGGCC CACCGGCGGG AGAGGGGAAC GCACCGCTCT CGTGCCACCC GCGGCTCCCT CCGCCGCGGC GAGGACGAAC ACGCCGCCGG ACGGTGGGAC CGGTCGGGGT GCCGGTGCCA CCGTCGGGGG GTGGGTGCCG GACCGTGGCA TCGGGAAGTC CGACGCGCCC GGCCGTGAAC CGTCCAGCGC TCGTCCCGGC ATGCCCGTCG GCGAGGCGAC CGGTCGTCGC CGAGGCGGCG GCGCGCCCTC CGGTCGGGCG ACCCCCACCC GGCGGGTTCC CGGCCGTGGG TTCTCCGATC GGACCGCCTC TGACCGCGGC GTCCCCGATC GTGGGCCGGA CCAACCGGCA GACCGGACAG CCGGCGAGGT CAACACCCTG TACGACACCG TCGACCAGGA AGCCCTACGA CGGGTCGATG TCCGGGAACG CACCGTGGAC GCCACCGTTT ACCACAAGGT GATGCCGGAG CGGGACTCCC GGAATGGCCG CCGCAACGGT GTGTCGGACC GCGACCGCAT GACCGAGGTG ACGCCCGCTC AGCGGGTGGC CGGTCGGCGC GACCTCACCC GCGCCACCGA ACTGATGGCC GTCCGGAGCC GGGCACGGGA GGGAGACGGA TCGGCCGCTA CCCGCCGGAT GGCGGGTGGG CGACCGGGCG CGGATCGGAT CGAGCCCCGC CGGACCGGTC CAGGCCGGCC CGGTCCAGGC CGGGCCGGCC CGGGCCGATC CTCCCGGGCG GAGATGGGTG TCCCGGGCGG GGCGGGAGGA CCCGGGCGCC GCAACGGGAA GGCCCCCCGG CCGGGCCAGG GGCGACACGG CCCGATGTGG TGGCGACGGC GCCCGCGCTG GCTGCGCCGT CTGGTCTTCG CCAGCTTCTT CTCCGGTCTT CTCCTGTTCG CCGCCGGCAT CGGCGTCATC TACGCGGCGA CCCGGGTCCC GCTGCCTGCC GAGGTGAAGA CGGACCAGAC GTCGATCATC TTCTACGGAC CGCCGGCCGG CTCGCACCAG GACAACGGTG AGGAGCTTGC CCGGATCGGC ACCGTGAACC GCACCGACGT GCCGCTCGCC GAAGTGTCCG TGGACATGCA GCACGCGGTG CTCGCGGCCG AGGACAAGAA CTTCTATCAC GAACCCGGTA TCTCTCCCAA GGGCATTGCC CGGGCGCTCT ACGTGAACGT CACCGGCGGG GAGCTGCAGG GTGGTTCGAC CATCACCCAG CAGTACGCCA AGAACGCGTA CCTGTCGCAG GAGCGGACCT TCACCCGCAA GATGCGCGAG ATCGTGCTCG CGGTGAAGCT GGGCCAGAAG TACTCCAAGG GACAGATCCT GGAGTTCTAC CTGAATACGA TCTACTTCGG CCGGGGCGCC TACGGTATCG AAGCGGCGGC GAAAGCCTAC TTCAACACCT CGGCCGCCAA ACTCACCCCC GCCCAGGGGG CGGTCATCGC GGGCCTGATC CGATCACCCA ACTACCTCGA CCCGGCCAAG AACCCGGGTC CGGCGGCGAA CCGCTGGCAC GACGTGGTCG CGACGATGGT CGCCGAGGGC TGGGCTCCGC CGGGGCTCGC CCAGCAGAGC CCCCCACCCG TGGCCGCGAA GGCGCAGGAC GCCGCCGCCT CGTCCGACCA GATCGCGTAC ATCCGCGATC AGGTCAAACG TGAGCTTACG ACCGTCGGGA AGATCACCGA GGATCAGATC AACCGCGGTG GACTGCGCAT CACGACCACG ATCGACAAGG GGCGGCAGTT CAAGGCCTTC CAGGCGGTGA GCGACGTGCT GGGCCCGGCG TACGCCGCCG TGCCGGATCT GCGTACCGGG CTCACCGCGA TAGAACCCGG GACCGGCAGG ATCCTCGCCT GGTACGGCGG GTCGCTCTAC GGCAAGGACG CACAGGGCCA CGAGCAGTAC GTCGACAACG TGTCAGGGGC GCAGGTGCAG TCCGCCTCCA CCTTCAAGAC GATCACTCTG GTCGCCGCGT TGCGTCAGAA CATCAACCTC AAGTCGACCT TCGCCGCTCC CGCCAAGATC ACACTGCCGG GGAATTACGT GGTCAGCAAC GACGAGGGTG AGGCGGGCGA TCTCGGATAC AAGAACCTCA TCGAGGCGAC GGCGGGCTCC ATCAACACGG TGTACGTACC GCTGGGGCAG AACATCGGCG TCTCGAACAT CATCAAGACC GCGCGCGACC TTGGCATCCC GGCCAGTACC CAGCTGCGCA ACGAAGCGGG TATCACGCTG GGCCAGGACG ACGTGCACGC GGTGGATATG ACCACGGTCT ACGCGACCCT CGCCGGCGCC GGAGTGCGGG CCACACCGCA CATCGTGGAC AAGGTGGTCG ACGGCAACGG GCAGGTGATC TATTCGGGTA CCCCGGATGT GAAGCAGGTC ATCCCGGCCA CGGTGGCCCG CGACGCCACG TACGCCCTGC AGAGCGTGTT GACGGATTCG AGTGGCACCG GAAAGCGGGC CCGGCTGGAC GGCGGGCGGG AGGCCGCCGG CAAGACCGGG ACGTCGACCA ACTTCCGGTC GGCCTGGTTC TGCGGGTACA CCAGGGAACT CGCGTCCTGC GTGAACATGT TCCGGGGCAA GGGCACCGAG CAGGATGTGC TGAAGGGCAT TCCCGGCGCC GAGAAGGGCG TCTACGGGGG TACCTACCCG GCCAAGGTGT GGAAGGCGTT CATGGACGCC GCGCTGACGG GGGTGCCGCC GTCGAAGTTC GATCCGCCGG CCTTCGGTGG CCTCGTGCAG GATAACGAGC CCGAGCCGAC TCCCACACCC ACCCCGGCGC CGAGCGCCTC ATCGAGCCAG CCCGGCGATA CCGGCGTCAA CCTGGGCGAT CTCCTGAACC CCAGTGGCAA CGGCAACGGC GGTGGTCAGC AGCAGGGTGC CGGCCAGGCG GGCCGGCCGG CTCGACAGAC GGGGATCTTC TCCGACCCGT TCAACTGA
|
Protein sequence | MSSPYDQGSS RGGPAPDRAW PNPDGQSRVP ADRVPPDRVP PDRARRPEQA RPGQAQPGQA RAGQARAGQA RAGQAQGRRL VPPTGRPTGG RGERTALVPP AAPSAAARTN TPPDGGTGRG AGATVGGWVP DRGIGKSDAP GREPSSARPG MPVGEATGRR RGGGAPSGRA TPTRRVPGRG FSDRTASDRG VPDRGPDQPA DRTAGEVNTL YDTVDQEALR RVDVRERTVD ATVYHKVMPE RDSRNGRRNG VSDRDRMTEV TPAQRVAGRR DLTRATELMA VRSRAREGDG SAATRRMAGG RPGADRIEPR RTGPGRPGPG RAGPGRSSRA EMGVPGGAGG PGRRNGKAPR PGQGRHGPMW WRRRPRWLRR LVFASFFSGL LLFAAGIGVI YAATRVPLPA EVKTDQTSII FYGPPAGSHQ DNGEELARIG TVNRTDVPLA EVSVDMQHAV LAAEDKNFYH EPGISPKGIA RALYVNVTGG ELQGGSTITQ QYAKNAYLSQ ERTFTRKMRE IVLAVKLGQK YSKGQILEFY LNTIYFGRGA YGIEAAAKAY FNTSAAKLTP AQGAVIAGLI RSPNYLDPAK NPGPAANRWH DVVATMVAEG WAPPGLAQQS PPPVAAKAQD AAASSDQIAY IRDQVKRELT TVGKITEDQI NRGGLRITTT IDKGRQFKAF QAVSDVLGPA YAAVPDLRTG LTAIEPGTGR ILAWYGGSLY GKDAQGHEQY VDNVSGAQVQ SASTFKTITL VAALRQNINL KSTFAAPAKI TLPGNYVVSN DEGEAGDLGY KNLIEATAGS INTVYVPLGQ NIGVSNIIKT ARDLGIPAST QLRNEAGITL GQDDVHAVDM TTVYATLAGA GVRATPHIVD KVVDGNGQVI YSGTPDVKQV IPATVARDAT YALQSVLTDS SGTGKRARLD GGREAAGKTG TSTNFRSAWF CGYTRELASC VNMFRGKGTE QDVLKGIPGA EKGVYGGTYP AKVWKAFMDA ALTGVPPSKF DPPAFGGLVQ DNEPEPTPTP TPAPSASSSQ PGDTGVNLGD LLNPSGNGNG GGQQQGAGQA GRPARQTGIF SDPFN
|
| |