Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4132 |
Symbol | |
ID | 3907097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4932998 |
End bp | 4934248 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637881460 |
Product | XRE family transcriptional regulator |
Protein accession | YP_483209 |
Protein GI | 86742809 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATCG TTGGCGGGAC GGGTTCGGAT GACGCGCTGT CCGTCGGTCG ACGGCTGAAG GTTCTGCGCA CGCGGCGGGG GATGACGCGG GAGGTTCTGG GCGGGCTGGT CGGTCGCTCG GCGTCGTGGG TGAAGGCGGT GGAGACGGGT CGGTTGGCTG CTCCGAAGTT GTCGATGTTG CTTCGGTTGG CGGAAGCACT CAGGGTGCGT GACCTTGCGG AACTGACGGG TGGTCAGTCG ATCCCGGTGG TGTTGTTCTC CGGCCCAGGG CATGATCGGC TCACTGCTGT GCGGGCTGCT GTCAACAGGT TGCCGGTCTC AGCCGCCGAT CAGCCTGCGC CCTCGGTTGC AGATCTACGG GGGCGGGTGA CCTGGGCCTG GAGGGCGAGG CATGCGGCGC CGAATCATCG GGAGGTTCTC GGGGGGCTGT TGCCCGGGCT TCTGGATGAC GCGCAACGCA CTGCCCGCGC GGAGGCGGAC GGTCCGCAGC GCCGTGCGGC GCTGGCCGTG TTGGCGGAGG TGTACGCGCT GACGCAGTTT TTCGTGTCCT ACCAGCCCGC ACAGGATCTA GTCTGGCGGG TGGCGGAACG TGGTGTCTCG ACCGCGCTGG ACTCGGACGA CCTGCATGCG GTCGGGGTGG CAGCCTGGCT GATGACACAG GCGCATCGTG AGGCCGGGGA CTGGGATGCA GCGGACGTCG TGGCCAGTCA GGCGACGGCG CTACTGCGGG ACAGCCTGTC CAGCGATGAC GCCACTGACG ACGTGGCGGC GCTGTGGGGA GCGTTGCAGT TCGAGAGCGG CTACACGGCG GCGCGGCGTG GCGAGATCGG GAACGCTTGG AGGTACTGGG ACGCGGCGGA CGCCGTCGCG CGGCGACTGC CGGACGACTA CTTTCATCCT GTCACGTCGT TTTCGCAGAC GGTCATGCAC GCGCACGCCG TGACGGTTGC GGTCGAGCTG CGACAGAGTG GTGAGGGCGT ACGACAGGCG GAACGGTGGC GCGCGGCGGT GATCCCGTCC CATCCGCGGC AGGCGCGGCA TTGGATCGAG CAAGCACGGG CTTACCAGAT CGACAGGAAG TACGACGAAG CGCTTCGTCT CCTCGATCAC GCCTACGACT CGGCGCCGGA GACGATCCGG TACAACGGCC ACGCGCGGCG GATCATTCTG GAAGAGCTGG ACGCGCGGGA TGGACGGCGT CGGGAGCAGG CAAGCGAGCT GGCCCGGAAG GTGGGTCTGT TGGGGGTATA G
|
Protein sequence | MAIVGGTGSD DALSVGRRLK VLRTRRGMTR EVLGGLVGRS ASWVKAVETG RLAAPKLSML LRLAEALRVR DLAELTGGQS IPVVLFSGPG HDRLTAVRAA VNRLPVSAAD QPAPSVADLR GRVTWAWRAR HAAPNHREVL GGLLPGLLDD AQRTARAEAD GPQRRAALAV LAEVYALTQF FVSYQPAQDL VWRVAERGVS TALDSDDLHA VGVAAWLMTQ AHREAGDWDA ADVVASQATA LLRDSLSSDD ATDDVAALWG ALQFESGYTA ARRGEIGNAW RYWDAADAVA RRLPDDYFHP VTSFSQTVMH AHAVTVAVEL RQSGEGVRQA ERWRAAVIPS HPRQARHWIE QARAYQIDRK YDEALRLLDH AYDSAPETIR YNGHARRIIL EELDARDGRR REQASELARK VGLLGV
|
| |