Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4217 |
Symbol | |
ID | 5672572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5023016 |
End bp | 5024065 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243090 |
Product | LacI family transcription regulator |
Protein accession | YP_001508507 |
Protein GI | 158315999 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.413404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCTCG AGGACCGACG GGGCGAAGGG CTGCCGGGCC CGGCCGCGGT GCCGGAGCCG GCCCGGCCCA CCAGCGCGGA TGTGGCCCGG CGCGCCGGTG TGTCGCGGGC AACGGTCTCC CACGTCCTCA ACGGCACCGA CCACCCGGTC AGTGAGCAGA CCCGGCAGCG GGTCCTGGCG GCGGCGCGGG AGCTTCAGTA CGCCCCGAAC GCGTCGGCGC GGGCGCTGCG CGCCGGGCGC AGCAACATCG TCCTGGTGCC CATGTCGCGG TCGACGACCG TCCCCGGCCA GGACAGGTTC CTCGAGAACC TCGACCGGGA GCTCGCGAAG CGTGACCTGG TCCTGCTGAT GCACGGCGAC CGCACCACTA CCGGGATGAG CGGAGCCTAC GCATGGGCCG AGCTTCGGCC CGCCGCGGTG TACGTCAACG TGTCCCGCGG TACCCGGACC GCCGTGGAGC TGCTGCGGCG TGCGGGCGTG CGGGCGATTC TGCTCACCGG CGTCCCGAGC GTGACCTACG CGCCCACCGT GCCCCTCGAT CCCGCCGCGG TGGCCAGCCT CGCGACACGG CACCTGCTCA CCCGGGGCTG CCGGCGCCTG GCCTGCCTGG TGCCCCATGG GGCACTGGCG GAGGTGCCCG AGCGGCGTTA CGACGCGGTC CTGCAGGTCG CGACAACCGA GAACGTGCCG GTCGAACGGG TCGACTGCGA GTTGTCCCCG GACAGCATCG CGGCCGTGGT GGACCGATGG CGCGATCCGG AGCACCGACC GGACGCCGTG TACGCCAACA GCGACCAGTT CGCCATCCTG CTCATCGACA AGCTGCGCGA TGCCGGCCTG GAGGTTCCCC GGGACATCGC GGTGGTCGGT TCCGGCGACC ACCCGCTGGG CGCCGCGCTG CGGCCCGCGA TCACCACCAC CACCTTCGAC GTGCCGGCGA TAGCGCGGGT GGTCGCGTCC TCGATCCGAC GGCTACTCGA CGGACAGGAC CTGGACCCGG ACATGGCGGT CGCCCTGCGA CCACACCTGA TCAGCAGGGA GTCCGGATGA
|
Protein sequence | MPLEDRRGEG LPGPAAVPEP ARPTSADVAR RAGVSRATVS HVLNGTDHPV SEQTRQRVLA AARELQYAPN ASARALRAGR SNIVLVPMSR STTVPGQDRF LENLDRELAK RDLVLLMHGD RTTTGMSGAY AWAELRPAAV YVNVSRGTRT AVELLRRAGV RAILLTGVPS VTYAPTVPLD PAAVASLATR HLLTRGCRRL ACLVPHGALA EVPERRYDAV LQVATTENVP VERVDCELSP DSIAAVVDRW RDPEHRPDAV YANSDQFAIL LIDKLRDAGL EVPRDIAVVG SGDHPLGAAL RPAITTTTFD VPAIARVVAS SIRRLLDGQD LDPDMAVALR PHLISRESG
|
| |