Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5465 |
Symbol | |
ID | 5673796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6609507 |
End bp | 6611834 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244320 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001509726 |
Protein GI | 158317218 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGCGG GCAAGGTGCT GGAACCTCTC GGAGCTGAGC CGGTGGCGGC GCATTCTTTG TTTGTCGACG CGGTGACCGG CGGCGTCACC GTGTTTGTCG CGCCACCCGG CACGCTGCTC ACCGACGGGC TCGCGGCCGC GCTGGAGGCG GTCGGCCGGA CACCGGTCTG GCTCCGGCTG GGCGTCGAGG ACCGCGATCC GGCGACCTTC CTGGCAACGA TGATCACCGC GGTGCGGCGC AGGGCCCCCG GGTTCGGCGC CACCCTGCTC ACGCGGGTGC GGGCGTGGCC GGGCCCGGTG CACGGGTGGG ACGACATCTT CCGGCGGCTC GGCGCGGAGC TGGCCGAGCG GCTCCCGCCC GACTCGGCGC TGGTCCTGGA GCATGCAGAC CGGATCGATC CGCACGGGCC GTCGTTACGC CTCGCGTGCG GCCATGTTCT GGGCGCTCTG CCCGAGGCGA CGCCCCGCGT CCTGATCGGC CACGACGCGC TCCCGAGCAC CGAGGCGCTC GGGGACGCCC GCCGGCCGCC GGCCGAGCAC GCCCGGCTGC ACCCGTCGGA GGTGGAGCTC CCCGGCAGGC TCGGCCTCTG GCGGCGCCAC GCGGGGGTGC TCCTCTGCCG GTCCGCCGCC CTCCGGCACG CTGTCGACTT CGTGTGCGCC ACGATGCCGG AAGGTGATCT GGCGCGGGCC CTACGCGGCC GGACACTCGT CGGTGTGCTC ACCGCGCTGG CCCGAGCAGT CCTCGCGGGA GCCACCGGTG CGGGCCGTCG CATGCTCGCG CGGCTGCTGC ACGTCGAGTA CATCGTCCCC GAGGCGGACC GGCATCAACT CGCCGAGGTG GCGGGACCAT GGTTCCAGCC GTTGATCGGC GACTGGTCAC GACTGCGCAC CGTCTGGCGT GCGCCGCTGC TCGCCGCGCT GGCCGAGGAC GCGTCGCTGG ACCGCGAGAC CCTGCGCGGC ACCGCGCTCG AGCTTCTGCG GGCAGGTGCT CCGGAACGGG CGATCCCCAT GCTGCTCGAG TTCGACGATC GCGCGGTGGC GGCGCGGGCG CTGGCCAGGG TCGCCGCCGA CATGATCGCC GCCGGCCAGT GGGTGACAGT CGGCGGATGG CTGGACCAAC TGCCGCCCGA GGCTGTCGAG GCGGAACCGG ATCTGCTCGA CGCGGCCGCG CACGTCGCCT CGGCCCGGGG CCAGCAGGAC CCCGCCCGTC ATCGGTCCCA CGCGGCGGCC GCCGCGGTCT CCGAGACGGA CCGGCTGGCC CGGCGGCTGG CGGACATCCG GCGGGACCGG CGCCTGCATG CCGAGGCCGA GGCCGCGCTC GCCCGGTCTG AGGACGAGAC GACGGCGCTC CTGCACGCGC ACATCGCGAG CGTGTCCGAC GGCCCCGGCC CCAGGGCGCC GAGGGCCGAC GGCGCCACGC CGGGCTCCGT GTCGTTCGGC CTGGTGCCGG GGCCCAGGCA CCCGGGTCCG GAGGCGATCC GCCGGGTCGA CTCGCCCACG GGTCCGCCCA CGGGTCCGCC CGCGGGCCGG CTGAACGTGG GCGGACCCGT CCCCCCGGAC GCGGCGGGGC GGGTGGAGAT GGCCGTGCAC GTGCTGGGGC CACTGTCGGT GATGGTGGAC GGGCGGCCTG TGCGGGGGTG GGGGGCCCGG CCCCGGTCCC TGCTGGCTTA CCTGGTGATC CACCGGGCGG ACCTGCCCCC GCGAGAGGTC GTCACCGAGG CGCTGTGGCC CGGCGCGGAC CTGCCCGCAG CGCGGAACAA CATTCAGGTG GCCGTGTACG GCGCGCGGCG GGCCCTGCGT GAGGCCGTCG ACCGGCAGGT CATCGTGTTC GAGCGCGGCG TGTACCGGCT GGCCTCCGAC ATCGCGGTGG CCGTCGACCT CGACGAGTTC GACGGCCATG TCCAGGCCGG GCAGCGGCTC GCCGGCGCGG GGCACGTCGA GCACGCCATC GCCGAGCTGG AGGCGGCGAC CGCGCTGTAC CGGGGCGACT TCCTCGCCGA CGCGCGTTCC GAAGACTGGG CCGTGCTGCG CCGCGAACAG CTCAGGCTGG CCTACCTGGA GGCGCTGGAC CGGCTCAGCT CGCTGTACCT GGAGACCAGG CAGTACTCGG TCTGCGCGCT GCTGTGCCGG CAGATCCTGG AACGCGACCC GTGCCGGGAG GACGCCCACC GCCGCCTCAT GCGCTCCTAC GCCCGGCAGG GGCAGCCGCA CCTGGCGCTG TTGCAGTTCC GGACCTGCGC CGACGTCCTC GCCCGCGAGC TGCGCGTCGC GCCGGGACCG GCGACAGCGC GCCTGCACGA GCGGATCCGC CGGCACGAGC CCGTCTGA
|
Protein sequence | MLAGKVLEPL GAEPVAAHSL FVDAVTGGVT VFVAPPGTLL TDGLAAALEA VGRTPVWLRL GVEDRDPATF LATMITAVRR RAPGFGATLL TRVRAWPGPV HGWDDIFRRL GAELAERLPP DSALVLEHAD RIDPHGPSLR LACGHVLGAL PEATPRVLIG HDALPSTEAL GDARRPPAEH ARLHPSEVEL PGRLGLWRRH AGVLLCRSAA LRHAVDFVCA TMPEGDLARA LRGRTLVGVL TALARAVLAG ATGAGRRMLA RLLHVEYIVP EADRHQLAEV AGPWFQPLIG DWSRLRTVWR APLLAALAED ASLDRETLRG TALELLRAGA PERAIPMLLE FDDRAVAARA LARVAADMIA AGQWVTVGGW LDQLPPEAVE AEPDLLDAAA HVASARGQQD PARHRSHAAA AAVSETDRLA RRLADIRRDR RLHAEAEAAL ARSEDETTAL LHAHIASVSD GPGPRAPRAD GATPGSVSFG LVPGPRHPGP EAIRRVDSPT GPPTGPPAGR LNVGGPVPPD AAGRVEMAVH VLGPLSVMVD GRPVRGWGAR PRSLLAYLVI HRADLPPREV VTEALWPGAD LPAARNNIQV AVYGARRALR EAVDRQVIVF ERGVYRLASD IAVAVDLDEF DGHVQAGQRL AGAGHVEHAI AELEAATALY RGDFLADARS EDWAVLRREQ LRLAYLEALD RLSSLYLETR QYSVCALLCR QILERDPCRE DAHRRLMRSY ARQGQPHLAL LQFRTCADVL ARELRVAPGP ATARLHERIR RHEPV
|
| |