Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1962 |
Symbol | |
ID | 5670363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2358374 |
End bp | 2359366 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240883 |
Product | helix-turn-helix type 11 domain-containing protein |
Protein accession | YP_001506305 |
Protein GI | 158313797 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000800393 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTCCA GCCGGCTGAT GGCTCTGCTG CTCCATCTGC AGGCGCACGG CCGGGCCACG GCGGGGGAGC TCGCCGCCCA GTTCGAGGTG TCCGTCCGCA CCGTGCGCCG GGACGTGGCC GCGCTCGCCG AAGCGGGTGT CCCACTGTGG TCTGAGCCCG GCCCGCACGG CGGGATCCGT CTCGTCGAGG GCTGGCGGAC GAACCTGGAC GGGCTGACCG GCGACGAGGC GTCCGCGCTG CTCATCGCCG GGGCGGGCGG GGACGTGCTC GGCGGCCTCG GCCTCGAGAC GGTCGCCGCG GCCGCGCAGA CCAAGATCCT CGCGACGCTG CCGCCGGAGC TGCGGGCACG GGCGGGCCGG GTCCGGGAAC GCTTCCACCT CGACGCGCCG GGCTGGTTCG GCTCCGAGGA GCCCGTGCCG CACCTCGCGG TCGTCGCGGG CGCGGTCTGG TCCGGGCAGC GGATCACCGT CTGTTACGGG CGGCCCGACC GGACGGTGGA GCGTTCTCTC GAGCCGCTGG GCCTCGTTCT CAAGGCCGGT GTCTGGTATC TCGTGGCCCG CGGCGGGTCC GCCGTCCGCA GCTACCGGAT CGGCCGGATC GTCGAGGCGG CGGTCCGGAG CGGGCCGGAG GGCCGCTTCA CCCGGCCCGC CGACTTCCAC CTGGCGCGGT GGTGGGCGTC GTCGAACGAG GACTTCGCGC GCTCGCTGCT GCGCTGGCCG GCGCGGCTGT GGCTGTCCCC GCGAGGCCTG CGGAGCCTGC CCGGAGTGCT CGGCCCGCTG GCCGGCCAGC GGGCGCTGGC CACGGCCGGC GAGCCCGACG CGGATGGCTG GCGGGAGGTG GAGGTCTGGT TCGAGGGCCC CGATGTCGCC GAGAGCCAGC TCTGGGCATT CGGCCCGCAC GTGCGGGTGC TCGCCCCCGA CTCCCTGCGC GAGGCCCTCG CGCGGACGGC ACAGCAGGCG GCGGCGAACA ACGGCGCTCC GAAGTCCGGC TGA
|
Protein sequence | MRSSRLMALL LHLQAHGRAT AGELAAQFEV SVRTVRRDVA ALAEAGVPLW SEPGPHGGIR LVEGWRTNLD GLTGDEASAL LIAGAGGDVL GGLGLETVAA AAQTKILATL PPELRARAGR VRERFHLDAP GWFGSEEPVP HLAVVAGAVW SGQRITVCYG RPDRTVERSL EPLGLVLKAG VWYLVARGGS AVRSYRIGRI VEAAVRSGPE GRFTRPADFH LARWWASSNE DFARSLLRWP ARLWLSPRGL RSLPGVLGPL AGQRALATAG EPDADGWREV EVWFEGPDVA ESQLWAFGPH VRVLAPDSLR EALARTAQQA AANNGAPKSG
|
| |