Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5035 |
Symbol | |
ID | 5675748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6033731 |
End bp | 6036532 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243886 |
Product | hypothetical protein |
Protein accession | YP_001509301 |
Protein GI | 158316793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.103553 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGCGC GCCTCGTCGC TGTTGCACCG CCGCTGCTCG CGCGCTGGCG GGAGGCCCGG CAGCGCCGGC TGTGGGCGGC GGCGGACCAT CCAGCCCCCC CGGGGGCCAC CACGCCACTG GTGATCGATC CCGACCTGCT GGTCGCGGCC GACCTGCGTA ACCGGGTGGC CGGTGATCCG GCGTTCGACC TGTGGACCGC CCGCCGGCAG TGGGTCGACG CCACCTTCGC AGCCGTCCGC GCCAGCCGGG CGGCACTCGG CGGGGCTGCC CCGGCCGACG TATTCGACAG GGTCGTCGCC GACGTCGTGG GCGCACCCCT CAGCGAGCTG ACGGAGCTCG ACCAGCGGCG TCGGCGCGCG AGCTCGATTG ATGCCGAACT GGACGCGCTC CATCTGCTCC CGGCGGAGCT GGCGCGGCTG GTGCGGCTGC GTGAGATGGC CGCCACCGGC ATCGTCACCG ACGACGAGTG GCGGGAACTC GACCACCTGC TCACGCAGGT GCGTAAGCGC CAGCGGAGGG CGGACTGGCT CGCAGCAGAG GCCGCGATCG CGCTGACGCC GGAGCACTTC GTGCTGTCGA CCGGGCCGTG GTCGCCGGTT GCGTGGCGCG CCAGCGCCGT CGAGCGGGCG GATTGGCTGG ACCGCCTGCA GGCGCGGATG GACCAGGAGA GCGCGGTCCG CGAAGCCCTG CGGACGGCGG TCAGCGAGGT CGAGCAGGCG GCGCTGCCCC GGTGCCGGGA CGGCCTGATC TCGCTGCTCG GCACGACCGG ACCGGCCGAG GAGGCTGACC GGCTGACCGA ACTGCTGCTG GTCGACTTCA GCGGCGGCGG CACGGAACTG ACGACCCGTG TGGACCAGGC AATCGAGACG GTGCAGTTGC TGTTCCTCGG CCTGCGGTCC GGCCGACTGC CGTCGGCGCA CCCGGCGCTG GCCTGGACCA TCGAGCCGGG CGGCGCGGCC AGTCTCGACG AGGAATGGGC GTGGATGGGG GGCTACGACT CCTGGCGCGC GGCCCTGTTC CTGTTCACCT ATCCCCAGGA CCTGCTCGCC CCCTCGGTCC GCGACACCAG CGCCGCGGTC CCCGAGACGG CCCGCCCCAC GCCCCCCTTC CGAACGCTGG TCACCGCGCT GCGCGGCCAG CAGCGGCCAC GTCGCGACAT CGTGCTGACC GCGGCAAACG AGTTCCTGGC GACGATCCGG CCGACGCTGG TGAACTTCCC GATCCGGCCC CTGCCCTACA CCGCCGGCGA CGAGGGCCTG CGCGACGGCC TGCTGGCGTT CGGGTACACC GACCGTCCGA CCGCGGCTGA CCTGGCCCGC ATCCGCGACC TGTCCGCGCG TGTGCTGATC ACCCTTGCGG TCGGACACCC GGGACTTCCG AACTACCTGC AGGAGCTCTT CTACTATGTG CCGCTGCTGG TGGCGCAGAA CCTGCAACGT GGCGGCGACT ACGTGGCCGC GCTCGACTGG TACCGCACGA TCTACGCCTT CGACCAGACT GACCTGCCCG ACTTCTTCGT ACCCTCGGAC GAACGCAAGA TCTACTACGG GCTTCAGCAG GAAGGGAACG CGCCGGACGC CCTGTCGCGT GGCATCCACT GGCTGCGCGA CGAGGTCAAT CCGCACGCCC TGGCCGGTCA GTGGTCCAAT CCCTACACGC GGTACACCTT CACCTCGATC GCGCGCTGCT TCCTGGAATA CGGGGACTTC GAGTTCACCC GGGACACCGG CGAGTCCCGG GCCAGGGCCC GGTCGTTGTA CCTGACGGCG CGGTCGCTGC TGCGCGCGCC GGAGCTGGCG AACCTCGCGG GCCCGGGCCT GGGCGTGGCG GTCCCGAGCC CCGTGCTGAC CGCGTTACTC ACCCGGGTTG ACAACCAGCT CCGCAAGCTT CGGCAGGGCC GTAACATCGC CGGCCTGCGC CGTCCGGTCG AGCCTCCGGA CGCGGCCGCG TCGACGTCGG CCGGCATGCC CTATATCGGC GCTGGCGGTC AGCTGGTCAT CCCCGCGCTG AACCCACCGC GACCCACGCC GTACCACTTC ACGGTGTTGC TGGAGCGAGC CCGGCAGCTT GCCGCCACCG CCGCCCAGGT GGAGGCGTCC TACCTGAGCG CGCTGGAGAA GCGCGACGCC GAGGCCTACG GCCGGTTGAC CGCCGGCCTG GATCTGGACG TCGCTCGGGC CGGCGAGGCC CTCCAGTCGT TGCGGGTCAG CGAGGCCCAG AAGGGCACCG AGCTTGCCCG CCGGCAGAAG GCCGCCAGCG ACGTACGCGC GACCACGTTC CAGCAGTGGA TCGACGACGG CCCGAACGAA TGGGAGCGCA GCCTCGTCCG GGACTACGAC GAGGCCCGGG TCTACCGTGA CTGGATGGTG GGACTGGACG CGGCCATCAC GGCGGCGCAG GCTGTCGCCT CCGTCTCCTC CATCCCGGGT GCGGCCGCCG CGTCGACGGT TGGCGGGCTT GCCGTGGGCC GGGCCGTGAA CGCGACGAGC CTCAACCGCA CCGAGCAGGA CATCGCGCTG AACACGCTGC GCGCCAGCCA GGAACGCCGC CAGGACGAGT GGGAGCTCCA GCTCGCGACG GCCACCCAGG ACGGCCTGGT CGCGCAGGAG GCGATCCTGC TCGCGCTCGA CCACGAGGCC ATCGTCGGAC AGGAGGCCCG GATCGCCGGC CTCCAGGCGT TCGAGGCCCA GTCGGTGACC GACTTCCTGA CCCGCAAGTT CACCAGTGCC GAGCTCTACG AATGGATGAG CGGCGTCCTG GGGAACACTT CCTCCAGCAG GCCACCGGCA CGGCGCTGCT AG
|
Protein sequence | MSARLVAVAP PLLARWREAR QRRLWAAADH PAPPGATTPL VIDPDLLVAA DLRNRVAGDP AFDLWTARRQ WVDATFAAVR ASRAALGGAA PADVFDRVVA DVVGAPLSEL TELDQRRRRA SSIDAELDAL HLLPAELARL VRLREMAATG IVTDDEWREL DHLLTQVRKR QRRADWLAAE AAIALTPEHF VLSTGPWSPV AWRASAVERA DWLDRLQARM DQESAVREAL RTAVSEVEQA ALPRCRDGLI SLLGTTGPAE EADRLTELLL VDFSGGGTEL TTRVDQAIET VQLLFLGLRS GRLPSAHPAL AWTIEPGGAA SLDEEWAWMG GYDSWRAALF LFTYPQDLLA PSVRDTSAAV PETARPTPPF RTLVTALRGQ QRPRRDIVLT AANEFLATIR PTLVNFPIRP LPYTAGDEGL RDGLLAFGYT DRPTAADLAR IRDLSARVLI TLAVGHPGLP NYLQELFYYV PLLVAQNLQR GGDYVAALDW YRTIYAFDQT DLPDFFVPSD ERKIYYGLQQ EGNAPDALSR GIHWLRDEVN PHALAGQWSN PYTRYTFTSI ARCFLEYGDF EFTRDTGESR ARARSLYLTA RSLLRAPELA NLAGPGLGVA VPSPVLTALL TRVDNQLRKL RQGRNIAGLR RPVEPPDAAA STSAGMPYIG AGGQLVIPAL NPPRPTPYHF TVLLERARQL AATAAQVEAS YLSALEKRDA EAYGRLTAGL DLDVARAGEA LQSLRVSEAQ KGTELARRQK AASDVRATTF QQWIDDGPNE WERSLVRDYD EARVYRDWMV GLDAAITAAQ AVASVSSIPG AAAASTVGGL AVGRAVNATS LNRTEQDIAL NTLRASQERR QDEWELQLAT ATQDGLVAQE AILLALDHEA IVGQEARIAG LQAFEAQSVT DFLTRKFTSA ELYEWMSGVL GNTSSSRPPA RRC
|
| |