Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3787 |
Symbol | |
ID | 3906072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4537962 |
End bp | 4539242 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637881114 |
Product | UBA/THIF-type NAD/FAD binding fold |
Protein accession | YP_482867 |
Protein GI | 86742467 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0499391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.724918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCCGA TCCTCAAGCC GGGCCTGCGC CGGCTCTGGC ACAACAGCTC CACCCTGCAG CTCGGTATCG ACACCGCACG CGCCGTCATC CTCGCCGACC TGGCCCCGTC GGACGCCGCC CTGCTCGACG CCCTCGACGG CAGCCGCACG CTGGAGCAGC TGCGTCTCGA TGGCACTCCC GCCGGCCGGG TCCTGCTCGA CCTGCTCGCC GAGGCCGGTC TGCTGGACGA CGCCACGGCG CCTCCCGGTG CGGCACCGCT GCCCGCGGTC GAGCATGACC GTCTGTCGCC CGACCTGGCG GCGCTGTCCC TGGTGTCGGA GGATCCGGCC GGTGCCCGCC GGGTCCTCGC CACCCGCCGG GCGATGCGGG TCCTCGTGCG CGGCGCCGGT CGCGTCGGTG CTCAGGTCGC GGCCCTGCTC GCCGCGGCCG GGATCGGTCG GGTCGTCATC GACGATCCCG AGGTGACGAC CGCCGCCGAC GTCTCCCCGG GCGGCCTGCG CCTCGACGAC GTGGGGCGTC CCCGGTCCCT GGCCACCGGC GCGGCGGTGA CCCGGGCCAG CCGGCAGGGC GCCGGGCACC GTCGGCCCGA CGAGGACCGG TTCGCCGCGG ATCTGATCGT GCTGGCTCCG GTCGGTCTGC CGATGATCCA CCCTGGGGAG TGCCTCGATC TGGAAGGCCG AGGCACCGCC CATCTCGTGG CCGGAGTGCG GGAGACCACG GGGATCGTCG GTCCGCTCGT CGTGCCGACC GTGACCGCGT GCCTGCACTG CCAGCACCTC CACCGGTACT CCCGGAACCC GGTCTGGCCG GTGCTCGCCA TGCAGATGGT CCACCGCCCG GCCACCGGGC CGGACGCCTG CGAGATCACC CTGGCGGGTC TTGTCGCCGC GCTCACCGCC ATGCAGGCCC TGGCCTTTCT GGACCTCGCC GCCTCGCCTC CACCCGACCT ATCCGCCCTA TCTGCCCTAT CTGCCCTATC TGCCCCACCC GACCTGTCCG GCGTCACGGG ACCCGCCCCG GGCCAACATC CCGATAACTC GGACCATCTG CCACCGACAG CAAGCAGCCA TAATGAGACC TTCATTACAT TGACTTCCCC GCGCACGGTC CTACCGGTGA CCGCGGTCCT ACCGGTGACC GCCGACGGCA CGCTGGAGTT CACCGTTCCC GGCTGGCGGA TTCGCCGACG CACCTGGCCG GTCCACCCCA ACTGTCCCTG CCGCACCGCC CGATCGGCGG CCGCGTCGGC GGAGGCCTCT GCGCCGGTGG CACCCGCGTG A
|
Protein sequence | MRPILKPGLR RLWHNSSTLQ LGIDTARAVI LADLAPSDAA LLDALDGSRT LEQLRLDGTP AGRVLLDLLA EAGLLDDATA PPGAAPLPAV EHDRLSPDLA ALSLVSEDPA GARRVLATRR AMRVLVRGAG RVGAQVAALL AAAGIGRVVI DDPEVTTAAD VSPGGLRLDD VGRPRSLATG AAVTRASRQG AGHRRPDEDR FAADLIVLAP VGLPMIHPGE CLDLEGRGTA HLVAGVRETT GIVGPLVVPT VTACLHCQHL HRYSRNPVWP VLAMQMVHRP ATGPDACEIT LAGLVAALTA MQALAFLDLA ASPPPDLSAL SALSALSAPP DLSGVTGPAP GQHPDNSDHL PPTASSHNET FITLTSPRTV LPVTAVLPVT ADGTLEFTVP GWRIRRRTWP VHPNCPCRTA RSAAASAEAS APVAPA
|
| |