Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2818 |
Symbol | |
ID | 5671207 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3334629 |
End bp | 3336782 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641241727 |
Product | hypothetical protein |
Protein accession | YP_001507147 |
Protein GI | 158314639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.245022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAA GCTCACCCGA TGTCGAGCCA GACAGGCGGA TCTGCCTCCC GGACTCGACC TTCGAGTTCA GCTCCGCTGC TGCGGCCGGT GCCGAGACCC TTCTCACCGG ACCTCCGCAG GCGCTCGTAC ACGCGCTGTT GGATGACCAT GAGTCGGAAT TTGTTTTGCT GGCCGCGCGT CGCATACTGT CCGCTCTTCT TGATGATTCA TTGGGCGAGC ACAGCGAACT CGGCTTGGAC GCAGTCGTCG AGATGGTGGC TACATTACGT CGTGAACTCG ATGCGAAGAT TGTCGGTATT TCCGGCGGTA ATTCGTCCGC ACGGGTCGTA ACGCTACTCG AGCGTGCGCC ATTAGCGCTG CTAGCCGGCT GCTGGCTGGA CACGGTCTCT CAGCCCGCGA CGCAGCCGGC GCTGATTGTC AATCGCCTGT TAGAGGACTG CGTAAGGCTT CGAGGTGGCG GTCATCCGAG GAAGGCCCTA ACGCATCTGC GACGCACCGC TCTAGAGTCC CAGGGCGTTC ACCTGCCTGT GCTTGGCGCC GAGGATTTTA TCTCCGCATC CCAGGCGCAC CCACTGACGG TGCGGCAGGC CATTTTCTAC CTTGCGTTGT CTCGCTTTCC CGCGACTTTT CTGCCCGAAG TCGTCGGCGT GCACTACGCG GTGTTTTCTC TAGGGGTGGA CGATCGCCTC TTGCGGATGC CACCAGCGTT GTCGGAAGCA TCGTTGCGTG CCGTTCTGGC CGAATATTTC ACTCTGGCGG ACGCGTCCGT CGACGGAGCC CGGGTTCGCC AACGACTGGC CGCAGCGGTC GGACTCGTGC TTCGTCTGGA GCGTGAGCAA GTTGATCTGC TATCGGAACT CGCCACGCGG AACGTCGATC AGTCACCGCA CAGCGAAGTC ACCGAGATCG TGCGTCGACA CCTGCCGTTC GCTGGTGAGC ATCATCGCGA TGTCGTCGTC GGTGGTCGGT CGTTGGCGGA CGTGGCTGCT TCCGATGACG CCGAGGTCGC CGAACTCATA CGCGAACTGC ATGCGTCCCC GTACGTCCGA TCGCCAACTC ACAGTGGGCC TCGGATCGTC AGGGCCATCA AGTTCGGTGG TCCAATGTTC GGCATCTTCG ATGAGGCCGA GGCAGCCGCC TTGACAGCGT GGGCGAGGGA ACCCGATGTC GCCGCTGCGG ATTCGGCGGC TCAGGTGCTC TACGGGTATG AGGTCTCCCC AAACCACGCG ATTTCCATTG GTGACGCCAT GCCGGCTGAC GTGGTCTGGG CTGACCGGGC ACCCGACCAC GATCGCCAGC TATTCCACCG CCTAGTGAAC GTTGAGAACT TTCCAAACAT CCGCCCGGTG GCGAGAGAGC GGGCTGCCCA GGTTCTCCAG GCGGCGGAGG TGTTATTCGA GCACGGCTCG TCTGGCAGGT ACACGGACGC CAGCTTCTTT CCCTACGAGT CCGGGGCTCT GCGCGAGCGC ATCGAGAGAA TTTACTTCGA TAAACGCCTT AGACGGTCTG AGGCGACGAC GGAGTTACCA TCCCGGGGCG CCGTCGTATC GAGCCGCAAG GAGCGCCTGA TCACCAATAT GGTGGACGGC TGCTGGCTTT ACAGAATTGG CGCAACAGGC CGGTACGGCA GGGACAGCGA CGGACAGCTG TTCGCCATCT ACGCCGATGA GATGGGCGGC GGGGACATCC GCAAGAATCA CATCATGCTC ATCCACGGGG CGCTCGCAGA CATGCGCATC TCGGTGCCAC ACATTAGTAA TGTCGACTTT CTTTCCCAGT GCGAGCTTCC GGACAGTTCC TACGCTCCTG CGATCTACCA GATCTGCCTG GCCTTGTTTC CTGACAGCTA TTACCCTGAG ATTCTCGGCT ACAACCTTGG CATGGAGATG GGAGGGATCG GCGAGCTTGG TATAAGCGAG ATCCGGCGAT TGCGCCATTA CGGATTCGAC GCCACGTATG AGGCGACCCA TCTGTCCATC GACAATATTT CCAGTGGTCA CTCCCGGCAG GCTGCGGACA TCATCGTCCG CTACCTTGAC GACGTACGCC GGGAATCGGG CGATGCCGCC GTTGCCGCAC GGTGGCGCCG TGTCTGGCGC GGCTATGCTT CGTTCGCCTA TTTCGCCGAG CGAGATTTGG TCCAGGACCT ATGA
|
Protein sequence | MKRSSPDVEP DRRICLPDST FEFSSAAAAG AETLLTGPPQ ALVHALLDDH ESEFVLLAAR RILSALLDDS LGEHSELGLD AVVEMVATLR RELDAKIVGI SGGNSSARVV TLLERAPLAL LAGCWLDTVS QPATQPALIV NRLLEDCVRL RGGGHPRKAL THLRRTALES QGVHLPVLGA EDFISASQAH PLTVRQAIFY LALSRFPATF LPEVVGVHYA VFSLGVDDRL LRMPPALSEA SLRAVLAEYF TLADASVDGA RVRQRLAAAV GLVLRLEREQ VDLLSELATR NVDQSPHSEV TEIVRRHLPF AGEHHRDVVV GGRSLADVAA SDDAEVAELI RELHASPYVR SPTHSGPRIV RAIKFGGPMF GIFDEAEAAA LTAWAREPDV AAADSAAQVL YGYEVSPNHA ISIGDAMPAD VVWADRAPDH DRQLFHRLVN VENFPNIRPV ARERAAQVLQ AAEVLFEHGS SGRYTDASFF PYESGALRER IERIYFDKRL RRSEATTELP SRGAVVSSRK ERLITNMVDG CWLYRIGATG RYGRDSDGQL FAIYADEMGG GDIRKNHIML IHGALADMRI SVPHISNVDF LSQCELPDSS YAPAIYQICL ALFPDSYYPE ILGYNLGMEM GGIGELGISE IRRLRHYGFD ATYEATHLSI DNISSGHSRQ AADIIVRYLD DVRRESGDAA VAARWRRVWR GYASFAYFAE RDLVQDL
|
| |