Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2745 |
Symbol | |
ID | 3906456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3233966 |
End bp | 3235897 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637880068 |
Product | hypothetical protein |
Protein accession | YP_481834 |
Protein GI | 86741434 |
COG category | [S] Function unknown |
COG ID | [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.925989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTGC TGCCCCCCTG GAAACAGCCG TGGGCGCCTC GGATCGCCGC GCTGCTGGCT CTGGTTGTCG GCCTGGTGGA CGTCATCTCC GCGGTCACCC CGGAGTGGCG CTCCCGCCTG GAGGACCTGC GGGTGCTCCT GCCCCCCGCG GCCTCCCAGC AGGCCGCCGC GTTCACCGTC GTCGTCGGCA TGCTGCTCGT GCTGCTGACG CCGGGGCTGC GCCGCCGCAA GCGGCGGGCT TGGCGGGCCG TGGTGGTCCT GCTCGGCGTG AGCATCGTGC TCCACGTTGC CAAGGGTCTG GACTACGAGG AGGCGGCCGG ATCGGCCGCG CTGCTCATCG CCCTGCTGCT GGTCCACGGG GAGTTCCGGG CCAAGGGCGA CCCGTCGACG CGTTGGCGGG CGCTCGGCGT CGGCCTGCTG CTCACTGTCG TCTCGATCGG GATCGGGTTC CTGCTGCTTT ATGTGCGCCA GGACCGGATC GTCGGCCCGC ATTCGCTGTC CGCCCAGTTC GAACAGATCG TCGAGGGACT CGTCGGGATC CCCGGTCCGC TACGGTTCAC CTCCGACCGG TTCTCCGGCC TCGCCGCCCG CGTGCTGCTG ACGATGGGCC TGTTGACCAT AGTGACCACC GGCTACCTCG CGCTGCGACC GCCCGAGCCC CGGCCCCGGC TGACCGACTC CGACGAGGAA CGGATACGTG CCCTGCTGGC CCGGCACGGC AAGGCCGACT CGCTGGGATA CTTCGCGCTG CGGTCGGACA AGTCGGTGAT CTGGTCACCG ACCGGCAAGG CCTGCGTGGC CTACCGGGTG GTCTCCGGCG TGATGCTGGC CAGCGGAGAC CCGCTCGGTG ACCGGGAGGC CTGGCCGGGG GCCATCAGGG AGTTCCTGCG TGAGGCGGCC GATCATGCGT GGACGCCGGC GGTCCTCGGC TGCTCGGAGG CGGGTGGCTT CGCCTGGACC CGCGCCGGGC TGTGCGCGCT GGAATTCGGT GACGAGGCCA TCGTCGACAC GGCGTCGTTC ACGCTGGAGG GCCGGGCGAT GCGTAACGTT CGCCAGGCGG TCGCCCGGGT GGAACGCGCC GGCTACACCG CGGTCGCGCG GCGGGTCGGC GATCTCGCGC CGGCCGACAT CGCCCGGCTG AAGGCACAGG CCGCTGCCTG GCGGGGTACC CAGACGGAAC GGGGCTTCTC GATGGCGCTC GGCCGCCTCG GCGGCGGCGC GGACGGCGAC TGCGTAGCCG TGATGGCGTT CTCCCATGAC GCGGGTCCCC ATGACGCGGG TCCCCACGAC GCGGACGACC CGGTGAACGG TGCTCCGGGC CACCCGGCGA ACGACACGAC AAGCGGCATC GAGGACGGCA CGTCGCATGC CGACGCGGCC GGCACCGAGC CCCGGCTGCG CGCCCTGCTG CATTTCGTGC CGTGGGGCCC CAACGGCCTG TCGCTGGACG CGATGACGCG GGACCGGACC GCCGACAACG GGCTGAACGA GTTCCTCATC GTCAGCGCCC TGCGCCAGGC CCGCGAGCTG GGCGTCGAAC GGCTGTCGCT GAACTTTGCG TTCTTCCGGT CCGCGCTCGA ACGCGGCGAG CGCCTGGGCG CCGGGCCGGT CATCCGCTGC TGGCGTTCCC TGTTAATATT CTTGTCCCGC TGGTTCCAAA TCGATAGCTT GTACCGGTTC AACGCCAAAT TTCAACCCAC CTGGCAGCCC CGTTATATCT GCTACCCCGC CAGTTCCGAG CTACCACGGA TCGCCTTGGC GATGCTCGAA GCCGAGGCCT TCCTGGTCTG GCCCTGCTGG CGCGACCATC TGTCCGGGCT ATCCCGGTTG TCCCGACTAT CCCGGTTGTC CCGACTGCCC CGGCCTGGGA CGGCCGGCCT CCATCACCGT GCCCGAAGAA AGGACACCTC GACCGGCCCC GGGCCGGACT GA
|
Protein sequence | MPVLPPWKQP WAPRIAALLA LVVGLVDVIS AVTPEWRSRL EDLRVLLPPA ASQQAAAFTV VVGMLLVLLT PGLRRRKRRA WRAVVVLLGV SIVLHVAKGL DYEEAAGSAA LLIALLLVHG EFRAKGDPST RWRALGVGLL LTVVSIGIGF LLLYVRQDRI VGPHSLSAQF EQIVEGLVGI PGPLRFTSDR FSGLAARVLL TMGLLTIVTT GYLALRPPEP RPRLTDSDEE RIRALLARHG KADSLGYFAL RSDKSVIWSP TGKACVAYRV VSGVMLASGD PLGDREAWPG AIREFLREAA DHAWTPAVLG CSEAGGFAWT RAGLCALEFG DEAIVDTASF TLEGRAMRNV RQAVARVERA GYTAVARRVG DLAPADIARL KAQAAAWRGT QTERGFSMAL GRLGGGADGD CVAVMAFSHD AGPHDAGPHD ADDPVNGAPG HPANDTTSGI EDGTSHADAA GTEPRLRALL HFVPWGPNGL SLDAMTRDRT ADNGLNEFLI VSALRQAREL GVERLSLNFA FFRSALERGE RLGAGPVIRC WRSLLIFLSR WFQIDSLYRF NAKFQPTWQP RYICYPASSE LPRIALAMLE AEAFLVWPCW RDHLSGLSRL SRLSRLSRLP RPGTAGLHHR ARRKDTSTGP GPD
|
| |