Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1586 |
Symbol | |
ID | 3903721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1902072 |
End bp | 1903406 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637878923 |
Product | glycosyl transferase family protein |
Protein accession | YP_480691 |
Protein GI | 86740291 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.807446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.639173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGC ACGGGAATCG GGAGTCCCCG GCCGGCCCAG CCGCGGCCGA CGAGGGTATC TGGTGCTGTG AGCTCGAGGT GTCGGCCGGG GGCGTCGTGC GCTCGGTCGT GCCGCCACGC GACCGTCAGC GACGTGCGCG GGTCCTCGCC CTGCTGCACG GTGAGCCGCT CGGCTACGCG GCGGCCGCCA GGACGGCCAC CGGCATCGAC GTCGATGAGC TGCTCGGCAG GGTCTGGTCG GAGTTCGGCG GGCGCATCAA CGATCACCTG GTCGGCGAGG GTCTCCGCCC GCTCGAGCGC CTCGACGCCC GCACCAGGCC TGCCGCGGCC ACCGAGGCCT GCCCGAACAA GGTTCAATCC ACCGATCTCG ATTCCACGGA TCTCGTCTCC GTCGTCGTCG CCACCAGAGA CCGCAGTGAG ATCCTGGCCA ACTGCCTGCG TCGACTCACG GAGGTCACCT ATCCGGCCGT CGAGTTCCTC ATCGTGGACA ATGCCCCGTC GAGCGACGCC ACGAAGCGGG TCGTCGACTC GTTTTCGGCC ACCGACGAAC GATTCCGCTA TGTCCCCGAA CCACGCCCGG GTCTCTCCCG CGCCCGCAAC CGCGGGCTCG CGCTGGCGCG GGGGGTGTAC GTCGCCTATA CCGACGACGA CGTCTCGGTT GACCCCGGCT GGATCGACGG GCTGGTTCGG GGTTTCCGGC GCCGGCCGGA CGTTGCGTGC GTCACGGGAC TCGTCTGCAC GGCGAGCATC GTGAGCGCGG CCGAAGTCTA CTTCGACGCC CGGGCGTCGT ACTGGTCCAC CCGCTGCGAG CCGGTGCTTT TCGACCTCGC CGACAACGAG CGGCACGGTC CGCTGTACCC CTATATAGGT TTCGTCGGCA CCGGCGCGAA TGTCGGGTTC GACGTCGCGT TCCTGCGGGA CCTGGGTGGC TTCGACGAGG CGTTGGGAGC GGGCACCAGG AGCCGGGGCG GCGAGGACCT CGATCTCTTC GTGCGCATGT TGCGGGCGGG CCGAGCGATC GCGTACGAAC CAGCCGCCTT CGTCTGGCAT CACCACCGCG CCGACGACGC GGCGCTGCTC GCTCAGATGT TCGGCTACGG CTCCGGTTTC ACGGCCTTCC TGGCCAAGCT GCTTCTCCAG CGGTCGACCC GGGGTGAGGT GGTACGTCGC ATCCCACGAG GCCTGCGTGG GATGGCCCGC ATCGGCCATG CGACGAGCCA GCGGCTCGAC GGGCGGGTCC AGGCCCCGAA GGGTGCCCTG CTGCGGGAGT TCGCCGGCTA CGCCGCCGGA CCGTTGCTCT ACGCCCGGGA GCGGCAAACG GCCGCCTGGC GGTGA
|
Protein sequence | MNLHGNRESP AGPAAADEGI WCCELEVSAG GVVRSVVPPR DRQRRARVLA LLHGEPLGYA AAARTATGID VDELLGRVWS EFGGRINDHL VGEGLRPLER LDARTRPAAA TEACPNKVQS TDLDSTDLVS VVVATRDRSE ILANCLRRLT EVTYPAVEFL IVDNAPSSDA TKRVVDSFSA TDERFRYVPE PRPGLSRARN RGLALARGVY VAYTDDDVSV DPGWIDGLVR GFRRRPDVAC VTGLVCTASI VSAAEVYFDA RASYWSTRCE PVLFDLADNE RHGPLYPYIG FVGTGANVGF DVAFLRDLGG FDEALGAGTR SRGGEDLDLF VRMLRAGRAI AYEPAAFVWH HHRADDAALL AQMFGYGSGF TAFLAKLLLQ RSTRGEVVRR IPRGLRGMAR IGHATSQRLD GRVQAPKGAL LREFAGYAAG PLLYARERQT AAWR
|
| |