Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2007 |
Symbol | |
ID | 3906723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2356343 |
End bp | 2359210 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637879343 |
Product | FAD linked oxidase-like |
Protein accession | YP_481110 |
Protein GI | 86740710 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTTG AGGGGCGTCC CACCGGCGCG GAGGTAGCGG GGAGCCGCGG CGGACCTGGC CAGGGGCCGG CCGTGGCAGG CAGCCCGGTC CTGCGCTACG CGATGGCGCG GGATGCGTCG CACTATCACC TGGTGCCCTC CGCGGTGGAA CGTGTTGCCG GCGTGGGCGA TGTGGCGGGG CTGTTTGCGA GATGCCGGCG GAGTGGTTCG TACCTGACGT TCCGCTCGGG CGGAACGAGC CTCAGCGGCC AGGGCGTCAC CGACGGGATC CTGGTCGACG TGCGACACGG TTTTCAGTCG GCCGAGGTGC TCGACGGTGG TCACCGTTTG CGCGCCGAGC CTGGCGTGAC CGTCCGTGCG GCCAACGCGC GGCTGCGTCC GTACGGGCGG AAGCTCGGCC CCGACCCAGC GAGTGAGGTC GCCTGCACCC TGGGTGGAGT CATTGCCAAT AATTCCAGCG GAATGGCCTG CGGCACCGGG CAGAACGCCT ACCAGACGCT TGAGGCGATG ACCGTGGTGC TACCCAGCGG ATCCGTGATT GACACCGGTG CGCCGGACGC GGATGAACGG CTGCGCGCAC TGGAACCCGA TCTGTATGCG GGCTTGCTGC GGCTCCGCGA GCGGATCTGC CGCAACCCGG CGTCCGTGGC GACGCTGCGT CGGCAGTTCT CAATGAAGAA CACGATGGGC TACAGTCTCA ACTCGTTTCT CGACTACGAG CGTCCCGTCG AGATCCTGGC CCATCTGATG GTCGGTAGCG AAGGCACGCT CGGGTTCGTC GCCTCGGCCA CGTTCCGGAC GGTGGAGCTC TTTCCCCACG CCTCGACCGG CTTGGCGGTG TTTCGTGACC TCGCCACCGC GACCGCCGCG TTGCCCGAGC TGGTCGACGT GGGGTTCGCG ACGATCGAAC TGATGGACGC TCGGTCCCTT GCCGTTGCGC AGACCCTCGG CGCGACACCG GCTGAGATCG GCTCGCTGGA GGTGCGTGAC CATGTTGCGC TCCTGGTGGA GCTGCAGGCA GCGACCACCG AGGAGCTTGC CGACAAGGTG TCCGGGGCCG GCGGCGTGTG CGATGGGCTC GAGATCGAGT CGCCCCTGGA GTTGACCTCG GACCCGTGGC GCCGAAAGGA CCTGTGGCAT GTGCGCAAGG GTCTCTATAC GGCCGTCGCG GGAGCGCGCC AGGCGGGGAC CACGGCGTTG CTGGAGGATG TGGCGGTTCC GGTGCCGCAC CTGCGGGCAG CGTGCCAGGA ACTCACGAAG CTGTTCGACG CGCACGGCTA TCAGAACAGT GTCATCTTCG GCCATGCCAA GGACGGCAAT ATCCATTTTA TGCTCACCGA GACCTTCCGG GATCCCGCGC GGTTGGAACG CTACCACGCA TTCACCGAGA AAATGGTTGA GCTGGTGCTC GAGCACAAGG GGACGCTCAA GGCGGAGCAT GGCACCGGCC GGATCATGGC GGGGTATGTC CGTCGCCAGT ACGGCGATGA ACTGTATGAC GTCATGACGG AGGTGAAGCG GCTGTTCGAC CCGCTGGGAA TCCTCAACCC GGGGGTCGTG CTGTCCGACG ATCCTCGCTC CTATCTGCGC AATCTCAAGG ATGTGCCGAC GGTCGGCTAC GGCGCGGACA TGTGCGTGGA GTGCGGGTAC TGCGAACCGG TCTGCCCGAG CCGGACGCTG ACCCTGACTC CTCGGCAGCG GATCGCCCTG CTGCGGGAGC GTGAGGCCGC GCGGCGGGAG GGAGACGAGG GGCTCGCCGA TGAGCTGTCC GCGGCCTACC GCTATGACGT GGTCGACACC TGCGCGGTCG ACGGTATGTG CCAGACCGCT TGCCCTGTGC AGATCAACAC CGGATCACTC GTGCGAGAGC TGCGCGCCGA GAGGGTGAAC AAGGCCGAGG ACGCGCTGTG GCGCTCGGCG GCCCGCCACT GGGGGGCGAC CACCACACTG GCTGGGAAGG CGCTTTCTGC CGCCGCCGCA CTGCCGCCGA CGTTGCCGAC AGCCGCGGCC TCCCTGGCCC GCAGAACGTT GGGCACCGAC AGGATGCCTC AGTATGACGC CTCGCTGCCC CGCGGTGGGT ACCGGCGCCG GGCGGTCGCG GCTGCCGCAG AGGCGTGCGC CGTGTACTTC CCGGCGTGCG TCGGGGCCAT GTTCGGCTCG TCGTCATCCA GCGGCGGGGT CATGCCCGCG ATGCTGACGC TGTGCGCGCG TGCCGGGGTA GCGGTACGGG TACCCAGGGG TATCGCGTCG ATGTGTTGCG GTATGCCGTG GAAGTCGAAG GGTCTCAGGG GTGGCCACGA GGTCATCGGG GCCAAGGTCC TGCCGGCGCT GCTCGCGGCG ACTGACGGCG GCCGCCTGCC GGTCGTGTGC GACGCGGCGT CATGTACAGA AGGTCTGGAG GAGCTTCGCG CCGAGGCAAA GCGGCTTGGC GGCGCCTACG AGGCACTTCG TTTCGTGGAC GCGCTCGAAT TCGTGCGCGC CGAAGTCGTG GGCCGCCTCT CGGTGACCCG CCGGGTGGCG TCCCTGGTAC TGCACCCTAC GTGCTCGACC GAGCGGCGGG GCACCACGAC TTTACTCAGG GAACTCGCCG AGCTGGTCAG CGACGAGGTG GTCGTGCCGC TGGACTGGAA TTGCTGCGCG TTCGCCGGTG ATCGCGGACT TCTGCATCCC GAGCTGACTG CGGCGGCGAC GCTGAACGAG GCACGTGAGG TCAACTCCCG CGCCTTCGAG GTGCATGCGT CGGCCAACCG GACCTGCGAG ATCGGAATGT CACGCGCAAC GGGGCGCGAA TACGTCCACA TTGTCGAGGC GCTGGAGTAC GCGACTCGCC CGATCCGTGA TTCTGCCCAT CCAGGCGGTG CCGGCTGA
|
Protein sequence | MSLEGRPTGA EVAGSRGGPG QGPAVAGSPV LRYAMARDAS HYHLVPSAVE RVAGVGDVAG LFARCRRSGS YLTFRSGGTS LSGQGVTDGI LVDVRHGFQS AEVLDGGHRL RAEPGVTVRA ANARLRPYGR KLGPDPASEV ACTLGGVIAN NSSGMACGTG QNAYQTLEAM TVVLPSGSVI DTGAPDADER LRALEPDLYA GLLRLRERIC RNPASVATLR RQFSMKNTMG YSLNSFLDYE RPVEILAHLM VGSEGTLGFV ASATFRTVEL FPHASTGLAV FRDLATATAA LPELVDVGFA TIELMDARSL AVAQTLGATP AEIGSLEVRD HVALLVELQA ATTEELADKV SGAGGVCDGL EIESPLELTS DPWRRKDLWH VRKGLYTAVA GARQAGTTAL LEDVAVPVPH LRAACQELTK LFDAHGYQNS VIFGHAKDGN IHFMLTETFR DPARLERYHA FTEKMVELVL EHKGTLKAEH GTGRIMAGYV RRQYGDELYD VMTEVKRLFD PLGILNPGVV LSDDPRSYLR NLKDVPTVGY GADMCVECGY CEPVCPSRTL TLTPRQRIAL LREREAARRE GDEGLADELS AAYRYDVVDT CAVDGMCQTA CPVQINTGSL VRELRAERVN KAEDALWRSA ARHWGATTTL AGKALSAAAA LPPTLPTAAA SLARRTLGTD RMPQYDASLP RGGYRRRAVA AAAEACAVYF PACVGAMFGS SSSSGGVMPA MLTLCARAGV AVRVPRGIAS MCCGMPWKSK GLRGGHEVIG AKVLPALLAA TDGGRLPVVC DAASCTEGLE ELRAEAKRLG GAYEALRFVD ALEFVRAEVV GRLSVTRRVA SLVLHPTCST ERRGTTTLLR ELAELVSDEV VVPLDWNCCA FAGDRGLLHP ELTAAATLNE AREVNSRAFE VHASANRTCE IGMSRATGRE YVHIVEALEY ATRPIRDSAH PGGAG
|
| |