Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4233 |
Symbol | |
ID | 3907199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 5050565 |
End bp | 5051902 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881559 |
Product | hypothetical protein |
Protein accession | YP_483308 |
Protein GI | 86742908 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.855014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTCG CCGCGATCCT CGCCCCCCGA ACCGGGATCA TCCGCCGGAT CGAACGCAAC GCCATTCCCG CCACCCTGCC GCCCGAATTC ACCATGTACA CCGCCGTTCT CTCCGACACC ACCAGGTTCT CCGCCTGGGC CAGTGACTTC GCCGGAGCCG GCTACGCCCT GCTCGACGAC GACGCCGCCC TCGGGCCCGC CGTCGGCGAG GCCGTCGAAC GCTACTGCGG GAACCTCGTC CCCGCCGGAC TGCGCCGCGC CACCCACAAA GAGCTCACGG CGGACCGCGC CACCGTGCTT GACCCGCGCT CGGTGGTGCT GTACTCACCC GCCCAGTACG CCGGGCCCTA CTTCCCGTTC ACCGAGTACC GCGAGGATCT CGAGCTGGAA TGGACGTCCG GCACCGACCT GCTTGCTGGC ACCCCCGTCT GGGTACCCGC GCAGCTCGTC TGGGTGTCCT ACGCCCACCA GGCCCAGGCC CGCGGGTTCC CCTACCTCAG CCCGGTCCTC AGCGCCGGCC TGGCCGCCGG TATGGACCAG CGCTCGGCGC AATGGTCGGC GATCTGCCAG ATGATCGAAC GTGACACGCT GACCATGGCC TGGCACGGCC GCCGGCCGCT GCGGGCCATC ACCCCGCCCC CCTGGATCGC GCAGCTTGCC ATGGGCGCGT ACGGAAGCAT GACGACCCGG TTCGTCGAAT TCCCGAACGA GTTCGGCCTC GTCGTCGTCG GCGCCCTGGT GCACGACACG GCGACCGGCT ACCTGACCAT GGGAGCAGCG TGCCGGACCA CCACCACGCC GGCGCTGCGC AAGGCCCTTG CCGAAGCCTT CCAGCTGCAG ATGTTCGTCG CCGACCTCGA CGACTCCGAC GGCCCCTACA TGCGCGCCGC ACGCAACCCG CACAGCCCCC TCAAGCCGTG GCGAGCTGAC CGACGGTACC TCGACGACTG CCGAGACGAC CTGGCGGACG TCGTCGAGTA CTGCACCCAC CTCCAGCTCT TCCTCGACTC TCGCCTGCAG GACCGGCTGG AGGCCGAACT CGCCGAGGCC CTCACCGGCA CGATCGGCTG GGAGACCCTC GACCGGGACG CCCGGCACGC CGGAGTCGAC GACCCGACGG TGCTCGCGCG CACGCTCGCC GACGCCGGCC ACCCGGTGAC ATCGGTCGAC GTCACCACCG AAGACGTGCG CCCCACCGGC ATGCGGGTCG TGCACACCTT GGCCCCCGGC CTGTACTCGA ACACCTCTGT CGGCCTTCCG TTCCTCGGCG GCGCCCGGCT GGCCAGGCAA CTCGCCGCGG CGGGCACCAC CCGGCGCGAC CTCCCGCTGC CGCATTAG
|
Protein sequence | MDLAAILAPR TGIIRRIERN AIPATLPPEF TMYTAVLSDT TRFSAWASDF AGAGYALLDD DAALGPAVGE AVERYCGNLV PAGLRRATHK ELTADRATVL DPRSVVLYSP AQYAGPYFPF TEYREDLELE WTSGTDLLAG TPVWVPAQLV WVSYAHQAQA RGFPYLSPVL SAGLAAGMDQ RSAQWSAICQ MIERDTLTMA WHGRRPLRAI TPPPWIAQLA MGAYGSMTTR FVEFPNEFGL VVVGALVHDT ATGYLTMGAA CRTTTTPALR KALAEAFQLQ MFVADLDDSD GPYMRAARNP HSPLKPWRAD RRYLDDCRDD LADVVEYCTH LQLFLDSRLQ DRLEAELAEA LTGTIGWETL DRDARHAGVD DPTVLARTLA DAGHPVTSVD VTTEDVRPTG MRVVHTLAPG LYSNTSVGLP FLGGARLARQ LAAAGTTRRD LPLPH
|
| |