Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4015 |
Symbol | |
ID | 3906976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4802257 |
End bp | 4803687 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637881344 |
Product | UBA/THIF-type NAD/FAD binding fold |
Protein accession | YP_483094 |
Protein GI | 86742694 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCACC TCGCAGTGAA CCCACCGGCC ACCGGCTGGA GCCTGACCAT CCCGGTCAAG CTCTGGACGA CGCTGGCCGA CCACCTGTTC TCCGACGGCG ACGAGCACGG AGCCGTCATC CTCGCCGGCT ACGCGGACGG CCCCCGCGGC CCCCGCCTTT TGGCCCGCGA CGTCATACTC GCCGCCGACG GCACCGACTT CGTCGACGGC ACGACCAGCT ACCGGGCGCT CGACGCGACG TTCGTCCGCG ACCAAGCCCT CCGCGCCCGC GACGAGAAGC TCGTCTACCT CGCCGTCCAC AACCACGCCG ACATCCTCCA GCCCGGAGCC GTCGCGTTCT CGACGATCGA CATGGCCAGC CACGAACGCG GATACCCAGC CCTGCGGCAG ATCACCCGCC AGATCGTCGG TGGGCTCGTC CTCACCCCCC GGGCCGCCGC CGGCGACCTT TGGCTCCCCG ACGGCACCCG CGCTGTCCTC GCCGAGACCG TCGTTCCGGG AAACAACATC ATCTGGCTCC GTCCACGGCC GGCACCTGCA CCTGACATCG ACCCGCGGTG GGATCGACAG GCGCTGCTGT TCGGCCCAGC CGGGCAGCAG ACCTTCGCCA GAATGCGCGT CGCGGTCGTC GGCCTCGGCG GAGCAGGGAG CATCATCACC GAACTCCTCG CACGCCTCGG CGTCGGCGAA CTCGTCCTGA TCGACGGCGA CCGGGTCGAA GCCACCAACC TGCCGAGGCT AGTCGCCGCC GAACCAGACG ACGTCGGCGA ACTCAAGGTC AATATCGCCG CGCGAAACGC GCGCCGAGCC AACCCCTCCA TCCAGATCAC GGCCATCGCC GACCGCGTCG AGCATCCTGA CGCCCGCGAC GCACTGACCA CCTGCGACTG GATCTTTCTC GCCGCAGACG CTCACTCCGC CCGACACTGG GTCAACCTCA CCGTCCACCA GTACCTGATC CCCGCCACCC AGGTCGGCGT CAAGATCCCA GTAGGTCCAG CCGGCGAGAT CGGTGAGATC CACACCGTGG CCCGTCTACT GCTGCCCGCC GAAGGCTGCC TGTGGTGCAA CGGGCTGATC GACTCGACCC AGCTCGCGAT CGAGATGCAC TCCGCAGCCG ACCGACGCAA CGCGCAGTAC GTGCCGGAGG TCCCGGCAGC GAGCGTCATT GCGCTGAATG CGCTGCCCAC CGCCGAGGCT GTCAACCACT TCATGTTCGC CGCCGTCTGC CTCCACGACG ACCCCACCGA CAGCGCCTCG GTCCTGCATC ACCCGCGTGC GCGCGGCCGA GCCCTCCAGG ACGGCCGGCA AGATCCCGAC TGTCCTTGGT GCACCAAGGC CGGCAGCCTC GCGCGCGGCG CAGGTGACGT CACGGAGGGG GCCGCCCGGC TCGTCGGCGT GCCCCGCGCG CAGGCCCAAA GTGGCTCGTG A
|
Protein sequence | MAHLAVNPPA TGWSLTIPVK LWTTLADHLF SDGDEHGAVI LAGYADGPRG PRLLARDVIL AADGTDFVDG TTSYRALDAT FVRDQALRAR DEKLVYLAVH NHADILQPGA VAFSTIDMAS HERGYPALRQ ITRQIVGGLV LTPRAAAGDL WLPDGTRAVL AETVVPGNNI IWLRPRPAPA PDIDPRWDRQ ALLFGPAGQQ TFARMRVAVV GLGGAGSIIT ELLARLGVGE LVLIDGDRVE ATNLPRLVAA EPDDVGELKV NIAARNARRA NPSIQITAIA DRVEHPDARD ALTTCDWIFL AADAHSARHW VNLTVHQYLI PATQVGVKIP VGPAGEIGEI HTVARLLLPA EGCLWCNGLI DSTQLAIEMH SAADRRNAQY VPEVPAASVI ALNALPTAEA VNHFMFAAVC LHDDPTDSAS VLHHPRARGR ALQDGRQDPD CPWCTKAGSL ARGAGDVTEG AARLVGVPRA QAQSGS
|
| |