Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5349 |
Symbol | |
ID | 5673683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6448589 |
End bp | 6452914 |
Gene Length | 4326 bp |
Protein Length | 1441 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244207 |
Product | hypothetical protein |
Protein accession | YP_001509613 |
Protein GI | 158317105 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.598215 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGCAA CGCACCGCGG ACTGGCTGGC CGAGCATGCG ACCGGCCGGG TGGTCTTGGC CGGGTAGCAC CGCCCATCGC TGGCCGAACG AAACACCGCC CCACCGGACC TTCCAGGGCG GGGCTGTTCA CTGATCATCG ACTGTACGAT GACTCGCCCG GCACGGCGTT CGTGGGGACG GGGGAGTCGC CGATGGTACT GATGCCGTGG GAGCGTGAGC TGCTGGCCAG GCGGCAGGAC CGCGCCCTCG AACTGGCCGT GCGTGCGAAC GCGCTGTTGG ACGCGGCACG TCTCACGGGC GACGAAGCCG CGCTGAACAC GGCGATCGCC CTGTTCCGCC GCGCCAACGG GACCATCCCC ACCGGTCATC CCGGCCAGGC CGCGGTCAGC CACAGCCTCG GCAACGCCCT GCAGACCCGC TTCGGTTGGC ACGGGCAGGG CCGGGACCTG CAGGAGGCCG TCGACCTGCA TACCGGAGCC GTGGCTGCCA CCGCGCACGA CGATCCGGCC CGGCCCGTGG CACTGGCGAG CCTCGGCGCG GCCCTGCAGA CCCGTTTCGA ACGGATCGGC GATCTCGCCG ATCTCGACGC CAGCATCGAC CTGCAGCGAG AGGCGGTCGC CGCAGCGACC GATCCGGATC ACGCTTCCGC GCGGAGTCGC CGTCCGGCCG TATTGCCCGA CACCGCGATT CTGTGGTCCA ACCTCGGCAA CGCGCTGCGG CTCCGCTTCA CCTGGACAGG CCGCGAGGAC GACCTTGACG CAGCGCTCGG CGCCGGCTGG ACGTCGCTGG CGGAGCCGGC GTCCCGCACC ACCGACCACG CCCGGCACCT GTCCAACCTC GGCAACACCC TGGCGACCCT GTTCGAGGCG CGGGGCCGCG CGGCGAACCT GGACGAGGCG ATCACCTGCT TCCGCGACGC CGTCGCCGCC GTCTCCGCGG ATCATCCCGA CGCGGCCGGA TATCTGGCCA ATCTGTGTGG CGCGCTGTGG ACGCGGGCCC GTCACACCGG ACGCCAGACC GACCTCAACG AGGCACTGGA GCAGGGCCGA CGAGCGAAGG CCGCCATCCC GCCAGACCAC CCCGCCAGGC CCCGAATACT GGCGGACCTC AACGGCGCGC TGCAGACCCA CTTCGAGTGG ACGGGCAGCG ACACCGACCT GGACGACGCG GTCGCCGTCA GCCGTGACGC CGTCGCCGCG GCTCCGCCGG GCCATCCCGA CCGGGCCGGG CACCTGTCCA ACCTCGGCAT CGCGCTGCAG CGCCGGTTCG AACACCGAGG CGACCCGGCT GACCTCGACG AGTCGGTCGA CCTCCTGCGC GAGGCCGTGG CGGCGACCCC CGACGGACAT CTCAAAGCCA CCCCCCACCT GGCGAATCTC GCCAACCTCC TGCGTCGTCG GTTCGAGCTG ACAGGTGGCA GGGAGGATCT CGACGCCGCG GTCGGCCTGC TGCGCCGGGC GGTCGCCTCA TCGTCGGAAG GCAGCTCCGG CCACGCCGGG CTGCTGCTGA ACCTGAGCAT CGCCGAGCAG GTCCGGATCG CCTCCCAGGG GCGCGACGGA GACCTGGCCG GCCTCGACCA GGCGATCGCA CTCGTGCGCA CGGCACTCGC CGTCACGCCG CCGGACCGCC CGGATCACGC CCGGCTGCAG TCGACACTCG GCCTGGCCCT GCGGGCCCGC TTCGAGTGGA CCGGGCGGCA AACCGATGTC GCCGAGGCGA TCGGCGCCCT GGAGACCGCG GTGGCGGCGA CCCCCTCGAC GCACCCCGCC CGGGCCAGCT ATCTGTTCAA CCTCGGCTGC GCGCTGCAGA CCCGCTTCGA GCGCACCGGA CTGGCCACCG ACCTGGAACA GGCCGTGGCC GCGATGGAGA CGGCGGCCGC AGCCAGCCCC GCGCAAGGCG CGGACCGCGG CCGGATCCTC TCCGGTCTCG GCCTCGTGCT GCAGCACCAC GCCCAGCAGT CAGGCGACCG CGCCCAACTG GAGAGGTCGA TCGAGGCTCT GAGGTCAGCG GCCGCCGCGA CCCCACGGAC AGGCCCCGAG GCGGCCGGAG TCCTGTCGAA CCTCAGCATC TCGCTGTGGC GGCGCTCGGT CTGGGACGAG GATGGCAACG CCGACCTGGA CGAGGCTGCC GCGGCCGGCC GCGCAGCGGT GGACGCGGTG GTCACCGACC ACCCGCTGCA TGCCCGGTAC CTGTCGAATC TCAGCAACCT GCTGCAGACC CGTTTCGAGC GTTGCGGCGC CCCGGCCGAC CTCGACGACG CCATCGACCA TTGCCGGGCG GCGCTGGCGG CCACCCCGAC CGACCACCCG GACACCGCCC GTTACCTGGC CAGTCTGGGA CTCGCGCTGG GTCTGCGGTA CGCGCGCATG AAGCAGCCGG ACGACCTGGA TGCGGCGGTC GGCGCCGGGC GCCGGGCGGC GGCGGTCGAG GCCGCGTCGC CGCGGGTCCG CGCGCACGCC GCGCGTGGGT GGGCGATGAC CGCCGCCGCC GCAGGGCGGT GGGCCGAGGC CGCGGACGGC TTCACCGCCG CGATCGACCT GCTGGGACAG GTCGCGCCCC GCGGGCTGGC CCGCGGGGAC CAGGAGGGCC TGCTCGACGA GCAGGCCGAT CTGGGCTCCG ATGCGGCGGC CTGCTGCGTC CGCGTCGGTC ATCCCGCTCG GGCGGTGGAA CTCCTGGAAC AGGGCCGGGG AGTGCTGCTC GCACAGGCCT TGGACAGCCG CACCGACCTG GGCGCGCTCA GGGAGATCGA CCCGGCGCTG GCGGAGCGGT TCGCGCAGTT GTGCGCGCGC CTCGACCGGC CCGGCCCACT GGACGCGGGC TTCGCCAACA TCGCTGGCGC ACTGGGCGGA GCCATGGGCC TTCCCGGTGC CGGGGCTGGC GTCGATTCCG GGGCTGGGGC TGGGGCTGGT GGCATGGCTG GGTCGGCTGC CGGTGCCGAT CCGACGCTTC GTCTCGACCG GGTCGACGCG GAGGAACGCC GCCAGGCCGG CCTGGCGCTC ACCCGCACCA TCGAGGAAAT CCGTGTGCTG CCCGGCTTCG ACAGCTTCCT GCGGCCTCCC ACACTGGCCG ATCTGCGCAC TGCGGCCTCG GCCGGGCCGG TCATCCTCCC TGTGGTCTCG TCGTTCGGCT CGTACGCGTT GATCGTCACC GAACGCGGGC TGCTGGACCC GGTGGAGCTG CCCGACCTCA CCCCGGACGC GGTCAGGGAC CGGATCATCG CCTTCCTGAC CGCGCTCGAC GATCCGCAGG ACCAGGAAGG CCACAAGCGG CTGGCCGAGA CCCTTGGTTG GCTGTGGGAC GCCCTCGCCG GCCCGGTCCT CCAGCGCCTC GACGCCCTCG GATTCCTCGC CCTGACACCG GCGGCCTCCC CCGCGTGGAA GCGCCTGCGT ACGAAGTGGC AGCCGGAGGC AACCGGCCCC GAGCGGTCCG CGGGTCACGC GGCCTCCGCC GGGCGGCCGC GGCTGTGGTG GTGCGCGTCC GGCATGCTGG CCTTTCTGCC CCTGCACGCG GCCGGGCACC ACGAGACCCG GCACCGCGGC GCTCCCCACA CCGTGCTGGA CCACGCGATC AGTTCCTTCA CCCCCACGCT GCGCGCGCTC ATCCACGCCC GCCGGCCGGT GCCTGCGCAC CCTCCGGCCA CGGGCGCTCC GGCGGTCACC ACGGACGCGC GGGCAACCGG AACGCCGGTC ACGCCGGTCG CGCCGGTCGC GTCGGGACCG CCGGCCGCGT CGAGACGCAT CGTGGCCGTC GGCATGCCAC GCACTCCGGG CGCCGGCGAC CTGCCGGGCG CCGAAAAGGA GATCCACCGA CTCGAAACAC TCTTCCCGGG GCAGGTCCGC ACGCTGATCG CGCATGAAGC GACGCACGCC GCGGTGGCCG CGGCCCTCCC GCACGCCCAC TGGGCGCACT TCTCCTGTCA CGGCTACGCG GACCTGCGCA ACCCCTCGAA CAGCCGACTG CTGCTCACCG ACCACGAACG CGACCCGTTC ACGGTGGTGG ATGTCGCCCG GCTGCGCCTC GACACCGCCG TGCTCGCATT CCTGTCCGCC TGCGAGACCG GCCGACCCGG CGGCCCGGCC GACGAGGGAA TCCACTTGGC ATCGGCGTTC CAGCTGGCCG GCTTCCGGCA GGTTATCGCC ACCCTGTGGC CCGTCAGCGA CAGTGCCGCG GCCGAACTCG CCGAGGAGCT CTACGACGCC CTAGCCCTGG CACCACCCGA CTCACTGGAC GTTGCCGCTG CCCTGCACGA GGTGACGCTC GGCCTGCGCG GCGTGTGGGC CGACCAACCG GAGGTGTGGG CGTCCCACAT CCATTCCGGG GCCTGA
|
Protein sequence | MAATHRGLAG RACDRPGGLG RVAPPIAGRT KHRPTGPSRA GLFTDHRLYD DSPGTAFVGT GESPMVLMPW ERELLARRQD RALELAVRAN ALLDAARLTG DEAALNTAIA LFRRANGTIP TGHPGQAAVS HSLGNALQTR FGWHGQGRDL QEAVDLHTGA VAATAHDDPA RPVALASLGA ALQTRFERIG DLADLDASID LQREAVAAAT DPDHASARSR RPAVLPDTAI LWSNLGNALR LRFTWTGRED DLDAALGAGW TSLAEPASRT TDHARHLSNL GNTLATLFEA RGRAANLDEA ITCFRDAVAA VSADHPDAAG YLANLCGALW TRARHTGRQT DLNEALEQGR RAKAAIPPDH PARPRILADL NGALQTHFEW TGSDTDLDDA VAVSRDAVAA APPGHPDRAG HLSNLGIALQ RRFEHRGDPA DLDESVDLLR EAVAATPDGH LKATPHLANL ANLLRRRFEL TGGREDLDAA VGLLRRAVAS SSEGSSGHAG LLLNLSIAEQ VRIASQGRDG DLAGLDQAIA LVRTALAVTP PDRPDHARLQ STLGLALRAR FEWTGRQTDV AEAIGALETA VAATPSTHPA RASYLFNLGC ALQTRFERTG LATDLEQAVA AMETAAAASP AQGADRGRIL SGLGLVLQHH AQQSGDRAQL ERSIEALRSA AAATPRTGPE AAGVLSNLSI SLWRRSVWDE DGNADLDEAA AAGRAAVDAV VTDHPLHARY LSNLSNLLQT RFERCGAPAD LDDAIDHCRA ALAATPTDHP DTARYLASLG LALGLRYARM KQPDDLDAAV GAGRRAAAVE AASPRVRAHA ARGWAMTAAA AGRWAEAADG FTAAIDLLGQ VAPRGLARGD QEGLLDEQAD LGSDAAACCV RVGHPARAVE LLEQGRGVLL AQALDSRTDL GALREIDPAL AERFAQLCAR LDRPGPLDAG FANIAGALGG AMGLPGAGAG VDSGAGAGAG GMAGSAAGAD PTLRLDRVDA EERRQAGLAL TRTIEEIRVL PGFDSFLRPP TLADLRTAAS AGPVILPVVS SFGSYALIVT ERGLLDPVEL PDLTPDAVRD RIIAFLTALD DPQDQEGHKR LAETLGWLWD ALAGPVLQRL DALGFLALTP AASPAWKRLR TKWQPEATGP ERSAGHAASA GRPRLWWCAS GMLAFLPLHA AGHHETRHRG APHTVLDHAI SSFTPTLRAL IHARRPVPAH PPATGAPAVT TDARATGTPV TPVAPVASGP PAASRRIVAV GMPRTPGAGD LPGAEKEIHR LETLFPGQVR TLIAHEATHA AVAAALPHAH WAHFSCHGYA DLRNPSNSRL LLTDHERDPF TVVDVARLRL DTAVLAFLSA CETGRPGGPA DEGIHLASAF QLAGFRQVIA TLWPVSDSAA AELAEELYDA LALAPPDSLD VAAALHEVTL GLRGVWADQP EVWASHIHSG A
|
| |