Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2999 |
Symbol | |
ID | 3905496 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3552610 |
End bp | 3554121 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637880319 |
Product | hypothetical protein |
Protein accession | YP_482085 |
Protein GI | 86741685 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.280207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.727721 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGG CGCTTCTGTT GGTACTCAGC CTGGTGCTCG TGGCCGCCTG CGGCGTTTTC GTCGCGGCCG AGTTCGCTTT CGTCACGGTG GACCGGCCCT CGGTGGAGCG GGCCGCCGAA CGCGGTGATC GGGGCGCCCG CGGGGTGCTC ACCGCGCTGC GCGGTCTGTC CACCCAGCTG TCCGGCGCCC AGCTCGGGAT CACCGTCACC AACCTGATGA TCGGTTTTCT GGCGGAGCCC GCCATCGCAC GGCTGCTCGA GGGGCCGATC ACCTCGCTCG GCGCCTCGGA CGGGCTCGCC CGGGCGGCGT CGGTGGCGAT CGCTCTGGTC CTCGCGACGG GCCTGACGAT GGTGTACGGC GAGCTGCTCC CGAAGAACCT GGCGATCGCC CATCCCCTCG GCACCGCGTT GGCGGTGCAG GCTCCCCAGC GTGCCTTCAC GAAGGCCACC GGCCTGCTGA TCCGCTCGCT GAACATCACG GCGAACGCGG TGCTGCACCG CCTCGGCATC TCGCCACGCG AGGAGCTGGC CTCGGCCCGT TCGGCTCAGG AGCTGTTCTC CCTCGTCGGG CGGTCCGCCG AGCACGGCAC CCTCTCCCAC GAGACGGCGA CGCTGGTGCA GCGCTCCCTG CTGTTCGGTG ACCGGACCGC CGAGGACGTC ATGACACCCC GGATGCGCAT GCGCACCATC CACGCCGACG AACCGGTCAG CGAGGTCATC ACCCTCACCC GGCGCACCGG GCACTCCCGC TTCCCGGTAC TCGGCACGGA CAGCGACGAC GTCGTGGGCC TCATCCACGT GAAGAACGCG GTCGCCGTCC CCGAAGACGC CCGGGACCAT ACCCCGGTAC GCGACGTGAT GGTCCCGCCG GTGACCGTGC CCTCGACGAT CCTGCTCGAC CCCCTGCTGG AGACACTGCG TGCCGGCGGC ATGCAGATAG CGATCGTGGT TGACGAGTTC GGCGGCACCG ACGGGCTGGT CACCGCCGAG GACCTCATCG AGGAGATCGT CGGCGACGTC GTCGACGAAC ACGACCGGGT CAGCCCGCGC GCCCTGCGCC GGCGGGACGG CAGCTGGCTG CTCTCCGGCC TGCTGCGCCC GGAGGAGGCC CGCAACGTCA CGGGGATCGA CATCCCCGCG GACGACACCT ACCAGACCCT GGGCGGTCTG ATGGCCCGGG CGCTGGGACG CATCCCCCGG GCGGGCGACA CGGCGGCCGT GGAGGGCGTG CGGTACACCG TCGAGCGGAT GGACGGCCGG CGGGTCGACC GGATCCGTCT GGGCCCGATC GCACCGACGG ACGCCACGCC GGAGACGGAT GCGGATCCAC CGGTCGCGGA CCCCGCGGAT CGCCGCCCCG GCGCAGACCC GGCAGACCCC GCAGACCCCG CAGACCCCGC AGACCCCGCA GACCCCGCAG ACCCGGCAGA CCCGGCAGAC CCGGCAGACC CGGCCGCCGA GGGGGCCGAG GAGGCCGAGA CCGCCTCGGC CGGAAGGGCG GGGTGGGGAT GA
|
Protein sequence | MIEALLLVLS LVLVAACGVF VAAEFAFVTV DRPSVERAAE RGDRGARGVL TALRGLSTQL SGAQLGITVT NLMIGFLAEP AIARLLEGPI TSLGASDGLA RAASVAIALV LATGLTMVYG ELLPKNLAIA HPLGTALAVQ APQRAFTKAT GLLIRSLNIT ANAVLHRLGI SPREELASAR SAQELFSLVG RSAEHGTLSH ETATLVQRSL LFGDRTAEDV MTPRMRMRTI HADEPVSEVI TLTRRTGHSR FPVLGTDSDD VVGLIHVKNA VAVPEDARDH TPVRDVMVPP VTVPSTILLD PLLETLRAGG MQIAIVVDEF GGTDGLVTAE DLIEEIVGDV VDEHDRVSPR ALRRRDGSWL LSGLLRPEEA RNVTGIDIPA DDTYQTLGGL MARALGRIPR AGDTAAVEGV RYTVERMDGR RVDRIRLGPI APTDATPETD ADPPVADPAD RRPGADPADP ADPADPADPA DPADPADPAD PADPAAEGAE EAETASAGRA GWG
|
| |