Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3078 |
Symbol | |
ID | 3904280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3647638 |
End bp | 3649353 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637880399 |
Product | hypothetical protein |
Protein accession | YP_482164 |
Protein GI | 86741764 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGCAG CGCGTGGTCG TGGCCGGTGG CCCGCCGGCC CGCGGCGGCC GACGTCGTGG TGGCGGGAGT CGTCGTGGTG GCGGGAGTCG TCGTGGCGGG GCATGGCCGG CGGTGGACGG CGGGCCGAGC GGGTCGCCGA CCTGCTGGAC GGACAGCGCC CCCCGTATGA CGAGACCGAC GCGCGGCTGA TCGACACCGT GGCGGCCCTC CGGGACCTGC CGTCACCGCG GCTGCATCCG GCACGGCACG CCGCCCTGCG CGGACAGCTG TTCGCCGCCG TCACCGGCTC CCCGGCATGC GATCCCGTCC GTTCCCCCGC CGCCGTTTCC CCCGGCACCT CCGTGAAGGG CCACGAGACC GAGGCCCGTC CCACAAGAAC CCGTCCCGCA CGCACGCCTC CCGAGGGCGT CGATCCTGAC CTGCCGGGAA ACGATCCGGT CGACGCGCGG TGGGTACGCC GGACCGGTGC CCGCGGTGTC TCTGCCTGCG GTGTCTCCGG CCGGGGCAGG ATGACCCGGG CCGCCCGGCC GCTGCTGGCC GGAGCACTCA CCGCCGCCGT CACCACGGCG GCCCTCGCGG TCAGCTCGGG GGACTCGCTA CCCGGCGATA CCCTGTACGG CGTCAAACGA CAGGTCGAAG ACCTCCAGGT GTCGCTGGTC CGCGATCCGG TCGAGCGGGC GAAGACCCGG CTGGGCATGG CCGGCTTGCG GATGAGCGAA CTGCGCACGA TCACGGTGAA CGACGGCGGG GTGATCGCCC CGGAAACCGG TGCCGGCGCT CCGGAGACGA GCCCGCGGGT GCCGGTGGTG AACCCCACGG CGACCGCCGC ACCACCGACC ATCGCGGTGT CACCGACCAT CGCGGTGTCA CCCGCTGCGG TGTCACCCAC TGCGGGGACG TGGCCTCCCG GTCCCCGCAC ACCCGCCGTC TCCGGCGACG CCGGCGAATC CGATGGCCCC GACGCGCCGA GCGACCCGCC CGGCCCCGGC AACGGGGACC GCCTCGACCC CGAGCTGGTC AACGCGCTGC TGCGGGACTG GATCGCCGAG GTGCGCGCCG GCACGCAGGT ACTGCTGGCC CGGGTCGCCG CCGGGGACAC GGACGCCTGG ACCACGGTGA ACGCCTTCAC CACCGAGCAG TCCCGCGGGC TGAAGAACCT GCTGAGATCG CTTCCCGTGG GCTCGGTCGG ACCGGCGCAT GCGGCTCTGG ATCTCATCGA CGACGTCAGG CGCAGGCTCG GCCCGCGGGC ACCAGCGCCG GTCCGGGCCC CCTCACCCGT CCGCCAGATC TCTCCGATCG TGCCGACCGG TGACGCCCTC ACCCCCCCGC CCTACGTGGC GCCGCTGCTC TCCGCACCTC GGCCCACGGC GACCGGGGCC ACCGCCGCGC CCGCCCTCAG CACTCCCACC CCCGGCACTC CCACCCCTGC CGGCACCGGC ATCGGCTCGC CGGGGCCCCC GAGCCCGGCG CCGGGTGGTG CCACGAGCGG AACGACCATC CCGACCCCGA CCCCCGCCAC GACCCCGACC CCCGCCACGA CCCCGACCCC CGCCACGACC CCGACCCCCG CCACGACCCC GACAGACGAC ACGACCCCGA CAGACGACAC GACCCCGACC CCCGCCACGA CCGCATCCGA TGTCACGCCG ACCACGGTGG GCGCGCCCTC GCCGCAGAGC CCCCCGAACG GGGCGCCGAC GCCGGGATCC CGCTAG
|
Protein sequence | MKAARGRGRW PAGPRRPTSW WRESSWWRES SWRGMAGGGR RAERVADLLD GQRPPYDETD ARLIDTVAAL RDLPSPRLHP ARHAALRGQL FAAVTGSPAC DPVRSPAAVS PGTSVKGHET EARPTRTRPA RTPPEGVDPD LPGNDPVDAR WVRRTGARGV SACGVSGRGR MTRAARPLLA GALTAAVTTA ALAVSSGDSL PGDTLYGVKR QVEDLQVSLV RDPVERAKTR LGMAGLRMSE LRTITVNDGG VIAPETGAGA PETSPRVPVV NPTATAAPPT IAVSPTIAVS PAAVSPTAGT WPPGPRTPAV SGDAGESDGP DAPSDPPGPG NGDRLDPELV NALLRDWIAE VRAGTQVLLA RVAAGDTDAW TTVNAFTTEQ SRGLKNLLRS LPVGSVGPAH AALDLIDDVR RRLGPRAPAP VRAPSPVRQI SPIVPTGDAL TPPPYVAPLL SAPRPTATGA TAAPALSTPT PGTPTPAGTG IGSPGPPSPA PGGATSGTTI PTPTPATTPT PATTPTPATT PTPATTPTDD TTPTDDTTPT PATTASDVTP TTVGAPSPQS PPNGAPTPGS R
|
| |