Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3323 |
Symbol | |
ID | 3904109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3937627 |
End bp | 3939327 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637880648 |
Product | hypothetical protein |
Protein accession | YP_482409 |
Protein GI | 86742009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.34323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.797481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGAC CGGTCTACCA TCTCGCCCCG CGCCGAGCCG GGACCGCGCT GTTCGGCGCC AGCGTCCCGC AGGTCATCCT GATTGGCCTC GGTGTCGGCG GGCTCGCGGC GGGCCCGAGG CTGCTCGGCG GCGGTTCCGG CACGACGGCC GGGGTTGGTG TGGCGGTGGC CTGCCTGCTG TTCGCGTTCG TCCGGGTCGG CGGGGAACCC CTGGTCCATC TCCTGCCGGT CGTCGCCGGC TACCTGCTGC ACACCCGCCT GCTGCACACC CGCCTGCTGC ACACCCGCCT GCTGCACACC CGCCTGCTGC ACACCCGCCT CCTGCACACC TGCGGCGGAT GCCGGCCGTG CGCCGTCCCC TCCTCCGCTC GGCCGACGGG CATCGGGGCG GGGGCGGCCG GGTGGGGATC CGCCGGCCGC CGGTCCGAAC GGGTGGATCT TCCCGCGGTG CCGCGGTCGG TCGAGGTGGT AGCGGCGGCG ACCGGTTGCC GGGTACCAAC CGCAGACGGG CAACCGGCCG GTCTGGTTCG TGACCGGCGC ACCGGGACGA TCACCGTGGT CCTCGACGTG CGCGGTGGCC CGTTCGGACT GCTCGACGGC GCCGGGAAGG ACCGCCAGAC CGCCGGATGG GCCCGGGTGC TGACCCAGTT CGCCCGGGAA ACCCCGGTGG CCCGGCTCGG CTGGACGGTC CGCTCCGGAC CGGCGACCGC CCTGGACCTC CCCGTCGAAC CCCGCCGCCA GCCGGAATCC GCGGCGGCGG CCCGGTCGCG GCAGCCGGCG CGCCCGCCGG CGGGGGAGCT GCTGGCCTAC CGGCGGCTCC TCGCCGAGGC ACAGCCCGCC CTGATCCGCC ACGACCTGCG GCTATGGCTG ACCGTCCGCC CGACACGCGG AGGCCGCCAC GCCGACGGCC GGGCCACCGC GCTGGCCGCC GCCGAAACCC TGGCCGACCG ATGCGCCAGC GCCGGCCTGC ACGTCCGCGG CCTGCTGTCC ACCGCCGAGC TCACCAAGAC CGTGCTCGAC CACGCCGACC CGCCGCCTCC CGAAGCCTCG AAGGCCCTCG AAGCCCCCAG TCGTGCGGCG GAGCCGGACA GCGCATCGAC TCCGGGCCTG GCGGCCCGCG CCCATCTCCC GGGAGCCGGC ACACCGCGAC CGCCGCAGCG CCTCCAGCCG GACAGCCTCA CGCTGCGGGC GTGGTGGGAC GCGGCCCGGA TCGGCGACAG CTGGCACCGG GTGTTCTGGA TCGCAGGCTG GCCCACCGGC GGACTGCGCC CGGGCTGGCT CGACCCCCTG CTCCATGACG TTCCCTGTGT CCGCACCCTC GCGCTCACAA TGACCCCGGT GCCGTGGCGG GTCTCCCGCC GGCGCATCAA CAGCGACACC GTCTCCGTCG ACACCGCCGT CCACCTCCGC GACCGGCATG CCTTCCGCGT CCCCGTCCAC CTCACCCAGG CCCACGACGA CATCGACCGC CGCGACGCCG AACTCACCGC CGGCTACCCC GAATACGCCT ACCTCGGCCT CCTCGACGTC ACCGCCCCCA GCCGGCACGA CCTCGACGAC GCGTCCGCCG CGATCGTCGA CCTGGCCGCC CGCTGCGGCA TCGTCGACCT GCGCCCCCTG CACGGCCGGC ATCACACCGC CTGGGCCGCC ACCCTCCCCC TCGGCCTCGC ACCCCGGCCG ACCGTCACCG GAGCACCCTG A
|
Protein sequence | MSGPVYHLAP RRAGTALFGA SVPQVILIGL GVGGLAAGPR LLGGGSGTTA GVGVAVACLL FAFVRVGGEP LVHLLPVVAG YLLHTRLLHT RLLHTRLLHT RLLHTRLLHT CGGCRPCAVP SSARPTGIGA GAAGWGSAGR RSERVDLPAV PRSVEVVAAA TGCRVPTADG QPAGLVRDRR TGTITVVLDV RGGPFGLLDG AGKDRQTAGW ARVLTQFARE TPVARLGWTV RSGPATALDL PVEPRRQPES AAAARSRQPA RPPAGELLAY RRLLAEAQPA LIRHDLRLWL TVRPTRGGRH ADGRATALAA AETLADRCAS AGLHVRGLLS TAELTKTVLD HADPPPPEAS KALEAPSRAA EPDSASTPGL AARAHLPGAG TPRPPQRLQP DSLTLRAWWD AARIGDSWHR VFWIAGWPTG GLRPGWLDPL LHDVPCVRTL ALTMTPVPWR VSRRRINSDT VSVDTAVHLR DRHAFRVPVH LTQAHDDIDR RDAELTAGYP EYAYLGLLDV TAPSRHDLDD ASAAIVDLAA RCGIVDLRPL HGRHHTAWAA TLPLGLAPRP TVTGAP
|
| |