Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4872 |
Symbol | |
ID | 5673212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5845040 |
End bp | 5846245 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243727 |
Product | hypothetical protein |
Protein accession | YP_001509143 |
Protein GI | 158316635 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.136813 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGCTAG CCTCATCCGA TGTGTCCCGC CGACGGCTGG AGCGCCTGCT CAACCTGACG ATGTGCCTGA TGGCGACATC GCGGTTCCTC ACCGTGTCCC AGATCGGGGA GATGGTGGAG GGGTACGACC CGGGGGAGTC GGAGGAGGCG CAGGAGGCCT ACCGGCGCAT GTTCGAGCGC GACAAGCAGT ACCTGCGCGA GCTCGGTATC CCGGTCGAGA CCGGCCAGGA TTCAGCGTTC AGCGACGAGC TCGGCTACCG CATCCGCCGC GGCGACTACG CGCTCGGGGA GATCTCCCTC GACCCGGACG AGTTCGCGGC GCTCGCGCTG GCCGCCTCGC TGTGGTCGAG CGCGACCCTC GCCGCCCCGG CCGCGAGCGC GCTGCGCAAA CTGGCAGCCG CCGGTGCCCC GGTGCGCCCC GCCGGGCTGG GCGGGGCCGA TCTCCTGGCG GGACCCTCCT CGGCGCTCGG GCCCGGCGGG TTCGAGGGCT TCGAGCCGCG GGTCGACACG GCCGAGCCGG CCTTCGAGGC GGTACTCGCC GCCGTCCAGG CCGGCCGGGT GGTGCGCTTC CCCTACCGCC GTCCCGGGCA GACCGAGGAC ACCGAGCGGC ACGTGCAGCC GTGGGGGGTG CTGTCCTGGC GGGGCCGCTG GTACCTGGTC GGGCACGACC TGCGGCGGGC CGCGCCACGG GTGTTCCGGC TCTCCAGGGT GACCGGCGGA GTGCGTGCGG TCGGCCCGCC CGGCGCGTTC ACCGTCCCCC CCGATGTGGA CCTGCGGGCG ATGATCTCCT TGACGGAGCC GCCCGAGCAC ATCCGCAGCG CGTTGCTGCG GGTCCGGCGC GGGTGCGGGC ACGCGCTGCG CCGCCGGGCC CGCCCGGCGG AGGCGCCGGG TCCTGACCTG CCCATGAGCC CGGTGCCGGA CACCGACCTG CTGAGCGTCG ACTTCTCCGA CGTGGAGCGG TTCGCCCACT GGATCGTCGG CTACGGGCCT GACGTGACCG TCCTCGGTCC CGCAGACCTG CGCGCCGAGG TCATCCGCCG GCTGCGGGCC ACCCTGGAAG CCACGGCGCC CGAACGCGGC ACCCGGCCCG CCGGAGCCAC GTCCACCGAC GGTGCTCCCG GTAGCGACGG TGCTCCCGGT AGCGATGTGC CGGGCGGCGA GCACGGCGGG CGGGCCGGAT GGCTGGCGAG GACCGGGCCG CGGTGA
|
Protein sequence | MRLASSDVSR RRLERLLNLT MCLMATSRFL TVSQIGEMVE GYDPGESEEA QEAYRRMFER DKQYLRELGI PVETGQDSAF SDELGYRIRR GDYALGEISL DPDEFAALAL AASLWSSATL AAPAASALRK LAAAGAPVRP AGLGGADLLA GPSSALGPGG FEGFEPRVDT AEPAFEAVLA AVQAGRVVRF PYRRPGQTED TERHVQPWGV LSWRGRWYLV GHDLRRAAPR VFRLSRVTGG VRAVGPPGAF TVPPDVDLRA MISLTEPPEH IRSALLRVRR GCGHALRRRA RPAEAPGPDL PMSPVPDTDL LSVDFSDVER FAHWIVGYGP DVTVLGPADL RAEVIRRLRA TLEATAPERG TRPAGATSTD GAPGSDGAPG SDVPGGEHGG RAGWLARTGP R
|
| |