Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7187 |
Symbol | |
ID | 5675488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8775000 |
End bp | 8778596 |
Gene Length | 3597 bp |
Protein Length | 1198 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641246024 |
Product | transcriptional regulator |
Protein accession | YP_001511412 |
Protein GI | 158318904 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGGCG TGCCGTACTA CCGCCTGTTC GGGTCGATCG AGGTCGTCCG GGACGGCCGG CCGGTCCAGC TCGGCGGCCC CAAGCAGCGG GCGGTGCTGG CCGCCCTGCT CCTCGATGCC GGCCGGGTGG TCTCCGTCGA CCGCCTGGCC GGCGCCGTGT GGGGCGACGA GCATCCGCCG AGCATGCTCT CCAGCCTGCA CGCGCACATC TCCAACCTGC GGCGGCTGCT GCGCGACGAC GAGCGGGCCA CCTCGCCGAT CGTCCGCCGC ACCCCGGGCT ATCTTGCGGA CGTCCCCTCG AACGACCTGG ATCTGCGGCT GTTCGAGCGG GAGTGCGACC GGGCCCAGGC CGCCGCCGAC GCCGGGGACT GGCCGGACGC GGTGGCGGCC GCCGACCGGG CGGCGGCGCT GCGCCGGGGC CCGCTGCTCG CCGAGTTCGG CGACGAGCCG TGGGTGCGCG GCGTGGCGAA CGCGGTCGAC GAGCGGTGGG CGCAGTGCGA GCGGAGCGCG GTCGTCGGCC TGCTCGGCTC CGGGCGGATC ACCGCCGCGG TGCTGCGCTC CCGCCAGCTG GTGCACGACG CCCCGCTAGC GGAACGCGCC TGCCACCTGC ACATGATCGC GCTCTACCGG GCGGGGCGCG CGGCCGAGGC CCTCGACGCC TTCCGCGACC ACGCCCGGCG GCTGGCCGAC GAGCTCGGTC TGGAGGCCAG CCCGGCGCTG CGCGACCTGC AGGGCGCCAT CCTGCGCCAG GACCCGGCGC TGGATTCCTG GCCAGCCTCC CCCCGCACCG CCGACCGAGC CACTCCCACC ACCCCGGCAG CTCCCACCGG CCCAGCCAAC GCTTCCACCG GCCCAGCCAC CACCGCCGCC GCGGCCAACC CCGCCAACCC GATCGCCGCC AACCCGATCG CCGTCGGCGC GGCCCGCCCC GCCGCCCCGA CCACCATGGC CGGCAGCGGG GCGTCCGAAG GCGAGGGAAC GCCGGGAGCC CGGTACGGGG AGCTCGTCGG CCGGGTGCGC GAGATCGCCG TGCTCGACTC GGTGCTGTCC GAGGCGATGA CCGGGCCCGT CCGCTGGGTG GTTCTCACCG GCCCCGCGGG CATCGGCAAG AGCCGGCTCG CCGAGGAGGC CGCCGCCGGC TGGCACCGGG CCGGCGGGGC GGTCTCGCGG ACTGGGTGCC CCGACGACGA CGCCGTCCCG CCCTGGTGGC CGGTGCGCCA ACTGCTACGT GACCTCGGCG CCGACCCCGA CGATCTGCTG ACCCCACCGA GCGGCGCCGA CGCTGACGCG GCCCGGTTCG TCGTCTACGG CCGGGTGCTC GACGCGCTCT CCGAGGCCGC GCGGACCCGT CCGCTGCTGG TGGTGGCCGA GGACGTCCAC TGGGCGGACA CCGCGAGCCT GCGGCTGCTC ACCCACCTCT CCGACGCCGG GGCGTGCCCC GGGCTCGCCC TCGTCGTGAC CGCCAGGGAC GTCACCGGCC GCCCCGAGCT CGACCGGCTG CTCGCCGCCG CGGCCCGCCG GCACGGATCA CGGCGGCTGG CCGTCCCACC GCTGACCGAG GGCGAGGTGT CGGAGCTGGT CAACCGGATC AGCGGCCAGG CCATCGACGA CGCCGAAGCC GCCGAGCTGG CCGACCGGAC CAGCGGGAAC CCGTTCTTCG TCTGCGAGTA CGCCCGCCTG CCCGCCGAGG ACCGCGCCGG CGGAAAGGTA CCCGTCGCCG TGCGCTCGGT GCTCGGGCAG CGGCTCGCCG TCCTCGATCC GGCGGTGCTC CAGGTGCTGC GCGCTGCCGC CGTCATCGGC GACGTCCTGG ACATCGACCT GCTCGGCAAG GTGACCCGGC TCGATCGCGA CGAGCTCGCC GACCTGCTGG ACGAGGCCTC GGACGAACAT GTGATCGTCC AGGCCGCCGG CACCGGGCGG TACATGTTCG CGCACGCGCT GCTGCGCGAC GAGGTCGTCG CCGGGATCTC CAGCCTGCGC CGCCAGCGGC TGCACCTGCG GGTGGCCGAG GCCCTCGGCC TGGTGGACGG TGGCCCAGTC AACGGAGGCT CCGGCGGAGG CTCCGCCGGG GGCGAGGCGC TCTCCCGCCG GGCCGCGCAT CTCGCCGCCG CCTGGCCGCT GGCCGAGTCC ACCGACGTGT TCGACGCCTG CCGCGCCGCC GCCCTCGACG CGGAGCGCCG CTGGCAGTCG GAAGCCGCCG CGCACTGGTG GGGGCAGGCG CTGGACGTCC TCGACCGGAG CGCCGGTGAT CTCGACATCG ACCGGGACGA GGTGCTCGTC GCGCGGGTCA GCGCGCTCGC CCGCGCCGGG CGGGGCCAGA CGGTGCTCGA CGTCGTCGAC GCCGGCCTGC TCGACGCGGT GCGCCGCGGG CGGCTCGACT CGGCCGGCCG GCTGGCCGCC ACGCTGCTAC GGACCAGCGG ATCCTGGCCC TGGGCCGTCT ACGGCGACGA CCCGGCACCG CTGCTGGCGC GCCTCGCCGG CCTGGAGACC CTTGTCGCCG CCGACCCGGC CGCGCATGTG CGGGTGCTGG CCGCGCTCGC CGTCGGCAGC GCCTACGACC CGGACGGCTC CGTCCCCGAC CTGCTCGGCC GCCGGGCGAT CGAGCTGGCC GAACGCATCG GTGACGACGA GGCCCTGGCC GACGCGCTAC TCAGCCGGGC GCTGGCGTTC TCCGGCATCG CCGAACGGGC CGCGGAGTCC GTCGAGCTAC TCAACCGGCT GGCCACCGTC CCGCACGCCA GCGCGCAGAT CGACGAGGTG ATCGCGCACG GGCTGCTCTA CCTGGCGAAG ACGGCGCTGG GCGACCCGGG GTCCGCCGAG CACGTCCGGC TCGGCGCGCT CGGCAGCGAC CTGCTGCGGC TGCCGGCCAG CCGGGTGCAG TTCCGCTGGG CGCAGGGCTC ACTGGCGCTG TGGCGGGACG ACGACCTCTC CACGGCGGCG GAGATCTACC ACCACGCCTT CGCCCTGCAC CGGGAGACCG AGCTCTACGA GAGCGGCGTG TACCACCTCG CGCTGCTCGC TCTGTGCTGG GAGCAGGGCC GGCTGGACGA TCCGGACGAG CCGGTGCCGA TCAGCCCGTT CGTCCCGTGG GCGCCGGCGC TGACCGCGGT CGCCCGCGGC GACCCTGGGG CCGACAAGCT GCTCGCCGCG GAGATCGCGC AGGTCGAGCC GGTCACCTGG ACCACCCACG CGCGGCTGAC GATGCTCGCC CACGCGGTCG CGGACCTCGG CCTGCGCTCA CAGGTCAGCA CACTGACCGC GCGGTTGACA CCCGTCGCGC ACTGCGTCGC GAACATCGGC CAGTGCGGCT TCGTCGGCAC GGTCGCGCTG GCCCTCGCCC GGCTGGCCGC GCTGGACGGC GACCTTCCGG CCGCGCGGGG GCACCTGCGC ACCGCCGTGG AGGTCGCCAC CCGCGCGCAG GGCGTCGGCG CGCTGCTGCG CTGCCGCCTG TTCGCCGCGG AGCTCGCCTC GCTCGCCGGC GACCCCGTGG ACCTCGACGA CCTGCGCGAC GTCGCCGACC GCGCCGCACG CCGCGGCATG ATCGGCGTAG CCCGCGACGC CCGCACCCTC CTCACCCGGC ACATCGACCC GACCTGA
|
Protein sequence | MRGVPYYRLF GSIEVVRDGR PVQLGGPKQR AVLAALLLDA GRVVSVDRLA GAVWGDEHPP SMLSSLHAHI SNLRRLLRDD ERATSPIVRR TPGYLADVPS NDLDLRLFER ECDRAQAAAD AGDWPDAVAA ADRAAALRRG PLLAEFGDEP WVRGVANAVD ERWAQCERSA VVGLLGSGRI TAAVLRSRQL VHDAPLAERA CHLHMIALYR AGRAAEALDA FRDHARRLAD ELGLEASPAL RDLQGAILRQ DPALDSWPAS PRTADRATPT TPAAPTGPAN ASTGPATTAA AANPANPIAA NPIAVGAARP AAPTTMAGSG ASEGEGTPGA RYGELVGRVR EIAVLDSVLS EAMTGPVRWV VLTGPAGIGK SRLAEEAAAG WHRAGGAVSR TGCPDDDAVP PWWPVRQLLR DLGADPDDLL TPPSGADADA ARFVVYGRVL DALSEAARTR PLLVVAEDVH WADTASLRLL THLSDAGACP GLALVVTARD VTGRPELDRL LAAAARRHGS RRLAVPPLTE GEVSELVNRI SGQAIDDAEA AELADRTSGN PFFVCEYARL PAEDRAGGKV PVAVRSVLGQ RLAVLDPAVL QVLRAAAVIG DVLDIDLLGK VTRLDRDELA DLLDEASDEH VIVQAAGTGR YMFAHALLRD EVVAGISSLR RQRLHLRVAE ALGLVDGGPV NGGSGGGSAG GEALSRRAAH LAAAWPLAES TDVFDACRAA ALDAERRWQS EAAAHWWGQA LDVLDRSAGD LDIDRDEVLV ARVSALARAG RGQTVLDVVD AGLLDAVRRG RLDSAGRLAA TLLRTSGSWP WAVYGDDPAP LLARLAGLET LVAADPAAHV RVLAALAVGS AYDPDGSVPD LLGRRAIELA ERIGDDEALA DALLSRALAF SGIAERAAES VELLNRLATV PHASAQIDEV IAHGLLYLAK TALGDPGSAE HVRLGALGSD LLRLPASRVQ FRWAQGSLAL WRDDDLSTAA EIYHHAFALH RETELYESGV YHLALLALCW EQGRLDDPDE PVPISPFVPW APALTAVARG DPGADKLLAA EIAQVEPVTW TTHARLTMLA HAVADLGLRS QVSTLTARLT PVAHCVANIG QCGFVGTVAL ALARLAALDG DLPAARGHLR TAVEVATRAQ GVGALLRCRL FAAELASLAG DPVDLDDLRD VADRAARRGM IGVARDARTL LTRHIDPT
|
| |