Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2180 |
Symbol | |
ID | 3906780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2553665 |
End bp | 2554933 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637879513 |
Product | hypothetical protein |
Protein accession | YP_481279 |
Protein GI | 86740879 |
COG category | [S] Function unknown |
COG ID | [COG1479] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.912338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.467765 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGTAT CCTGTGCGAC AGCTTGGCCA CGTGGGCTGG TCGACTCGGC GAAGGCTCCC GCCACCCGCG CTGGACGGTC CCACTTGGCG GCCTGCCCTG AGTTCTTACT GACTCGGCAC CATGTCGCAC GGCTTTCCGA GGGGGTGACC GAGCTGGCTG AGCAAGCTGG CAACGACAGC GACTCCGAGG ACCTGCGCGA CGACGACAGG GCCGAGCTTC AGGAGGTGGA CCCCGACCCT CCGCAGATCA GCTACAGCGG CACCGACTTC GACGTCGAGG GTCTGGTCCG TCGGCACGAT CGGGGCGACA TAATAGTGCC GTCCTTTGGC AATGATGATC CCGGCATCGA GACAGCCGGC TTCCAGCGCG AGTTCGTTTG GAAGCGTTCG CAGATGGACC GTTTCATCGA GTCCCTACTG CTCGGATACC CCATTCCGGG TATATTCCTT GTCCAACAAC AGGACCGCCG CTATCTCGTT CTTGATGGAC AGCAGCGGAT AAAGACCCTG AGCCTCTTCT ATAATGGCAG CATCAACGGG CGCGAGTTCG CACTTCAGAA CGTGGCCGCC AGATTCCAGG GGCTGACCTA TCAAACTTTT TCACCCGAAC AGCGTCGCAC GCTCGACAAT ACCTTCATCC AGGCGACAAT AGTCAAAACC GACGGCACCC GCGAGTCACT CGACGGCGTT TATCAGATCT TCGAGCGGCT GAACTCGGGC GGTACGCAGC TCACGCCGCA CGAGATTCGC GTGGCGCTCT ATGCAGGCGA GTTCATCAAG TTCCTCACCG CTCTGAACGA AAACCCGGCG TGGCGCGCTC TCTACGGGCC GCCATCACCA CGGCTACGCG ATCAGGAGAT CGTGCTCAGA TTCATCGCCC TCTACGTGTC ACCGGGTAGC TATAAGCGCC CCCTCAAGAA ATACCTGAAC GATTTTGTTG GCGCTCACCG CCGACTGAAC GAACTGGACG CCGAGTTGAT CGAAAAACGA TTCGACAGGG CAGCACAGCT TGTGTTGGAG GAGGCCGGAA GAAGCGCCAT TCGCGGCCGG GGGCGTCAGC TCAATGCGGC TCTCACCGAG GCGCTTTTGG TAGGATTGGC CCGTAGGCTT GATGCCGGTA GCGAACCGAC CGCAGCTGAG GTCAGCCGCG CCATCGACGC GCTCCTCAAC GAACCCGACC TGGATTACGT GACCACGCGC GCAACGGCCG ACGAGGAGAG TGTGCGGATG CGCCTGGCGC TGGCAACGAG AGCTTTCTCC CGCATCTGA
|
Protein sequence | MVVSCATAWP RGLVDSAKAP ATRAGRSHLA ACPEFLLTRH HVARLSEGVT ELAEQAGNDS DSEDLRDDDR AELQEVDPDP PQISYSGTDF DVEGLVRRHD RGDIIVPSFG NDDPGIETAG FQREFVWKRS QMDRFIESLL LGYPIPGIFL VQQQDRRYLV LDGQQRIKTL SLFYNGSING REFALQNVAA RFQGLTYQTF SPEQRRTLDN TFIQATIVKT DGTRESLDGV YQIFERLNSG GTQLTPHEIR VALYAGEFIK FLTALNENPA WRALYGPPSP RLRDQEIVLR FIALYVSPGS YKRPLKKYLN DFVGAHRRLN ELDAELIEKR FDRAAQLVLE EAGRSAIRGR GRQLNAALTE ALLVGLARRL DAGSEPTAAE VSRAIDALLN EPDLDYVTTR ATADEESVRM RLALATRAFS RI
|
| |