Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1886 |
Symbol | |
ID | 3906835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2217416 |
End bp | 2218279 |
Gene Length | 864 bp |
Protein Length | 287 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637879224 |
Product | luciferase-like |
Protein accession | YP_480991 |
Protein GI | 86740591 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03619] probable F420-dependent oxidoreductase, Rv2161c family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.5944 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATCG GGATCTTCAC CGCGGTCACT GACGAACAGA TCAGGCCTGC TCAGCTCGCC CGGGTGATCG AGGAGCGCGG CTTCGAATCA CTGTTCGTCA CCGAGCACAC CCACATCCCC GCCCGACGGG AGACACCGCA CCCCGAGCTT GGCGAGATTC CCCGCGACTA CTACCGAAAT CTTGATCCTT TCGTCAGTCT GACGGCCGCG GCGACCGCGA CCACCCGGTT GCGCGTGGGC ACCGCGGTCG CGCTCGTGGT CCAGCGGGAC CCGCTCCTGT TGGCGAAGGA AGCGGCCAGC CTGGACCTGG TCAGCGAGGG CCGTTTCGAA CTCGGCGTCG GCGCCGGCTG GCTGCGTGAG GAGATGCGCA ACCACGGCAC CGACCCGGCG ACTCGAATGG CGCTGATGCG GGAGCGGCTG GCCGCGGTGA AGGCGCTTTG GACCACCGAG GAGGCGGAGT TCCACGGTCG CTTCGTCGAT TTCGATCCGG TCTTCCAGTG GCCCAAGCCT GTGCAGCGAC CGTATCCGCC GGTCTGGATC GGGGGATGGG GCCCGACGAC CTTCCGCCGG ATCATCGCCG GGCGCGACGG CTGGCTCACT CCACCCCTTC CGGTCGACCA GCTGATCCAA GGCCTGGACG AGCTGGCCGA GGAGGCGGAC CGGGCCGGGG TGGCGACGCC ACCGGTGACC GTTCCGCTCA TGGATCCCGA CGAGGCCACG CTCGAGAGGC TGCGCGCTCG TGGTGTCCGT CGCGCCCTGT TCGGGCTGCT CACCATCACC GACGCCGACG CGACGCTGCG CGCGCTGGAC CAGCTGGGAC CGCTGGCCAG GGCCGGAGGC AGCGTCAAGC AGCCAACTAC CTGA
|
Protein sequence | MDIGIFTAVT DEQIRPAQLA RVIEERGFES LFVTEHTHIP ARRETPHPEL GEIPRDYYRN LDPFVSLTAA ATATTRLRVG TAVALVVQRD PLLLAKEAAS LDLVSEGRFE LGVGAGWLRE EMRNHGTDPA TRMALMRERL AAVKALWTTE EAEFHGRFVD FDPVFQWPKP VQRPYPPVWI GGWGPTTFRR IIAGRDGWLT PPLPVDQLIQ GLDELAEEAD RAGVATPPVT VPLMDPDEAT LERLRARGVR RALFGLLTIT DADATLRALD QLGPLARAGG SVKQPTT
|
| |