Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2647 |
Symbol | |
ID | 5671040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3129662 |
End bp | 3133573 |
Gene Length | 3912 bp |
Protein Length | 1303 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641241562 |
Product | hypothetical protein |
Protein accession | YP_001506982 |
Protein GI | 158314474 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.958765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGGTC GTAGCCTAAT CGAGGGTCGA CTTCCGTCAA GTTCCATCCT GACTCTCGTT AAACCGCTCG TAGGTGGCCA TACCGACGCT CTTGTCTTTC TCTGCGACAT AGCAGGAAAT CAGACTGCGG AAGAACTCAA TGGACAATTC ATCCTGAAGA TCGGGCGTGC GGAGCATAGT CAGGGTATCG CCCATCAAGA GTTTTGCCAG AATCTGGGGG AGTTTGGGAT TGATCACGTC CCGCGCCATC TAATGGGCGT GCTAGACATA GGCGTCAGTG TGGACCTGTA CGACGTTGCA GGGTTCAGCC TGGACAGCGT ACGTACCGCA GAACAGCTAG ACCACGAAGA CCTTGTAAAG GTGTGCTTCA GAGTCGCCAG CGAACTACTC CCTGCTCAAC TCCCCACAGG TCGTAGTCCA CAATACACAG AGACAGTGGG CCAGGTGTTT CATCGCTGGC TCGGCTCCAA TTTTCCTGCC AATAGGCGTG GTAAGTCACT CCGTCAGCTG GCTCGACGTA TAGGGGTGAC TGGACGAACT TTTCGATTCG ATGGCGAGTT GCTACCAAAC CCACTCTCGG TGCTAGAGCG AAACTCCACA CTTGAACGAA CAAAAATCCC GTACTTCCGA GGCTATGCGC ATGGAGATCT ACACCTCCGC AATGTGCTAG TCGGCGGTTC AATCTACACG CGCAACCTAA GTTATTGGTT GATCGACGTG AGCTGGGATG AACCGTCTCC GCTGCTATTT GATCAAGCAT ATCTTGAGCT TTCCGTTCTC CTTCACACGC TGCCAAACGC AGGTAGCGGG CGGGTACTCG GCTTGCTGAG CAAGATAGAC AACGAGCATC TAACCGTGCC CGCCAGGCTT AGCAGTACGG ACAGCGCCAT AGTCGATCTG ATAGAAGGGA TCCGCTCTAC TACGATTGAG AAGCTGGAGG AACAACAGCC GAGGCGTCGG GACGTTTGGC GTCAACAGTA CGTTCTTTCA CGTATCGCAG CAGGTCTAAA TTGGGCCGCT AAGCCGTTGG GTGATATAGC CCTTCAGCGA GCAGCCTTCA TTGCTGCTGC CTGGGCTACC AGGGGACTAC TACGCGACTT CGATGACTAC AATTCCTTGT GGAATGATCT AGCCAGAGAT GATCTTCAAA CTCAGACAGG ATTCGCCCCT CTCGGCGAAC CAGTACTTGC TGCAGAGGCC ATCGAGCGCT GGACTCCTTT TAGTGCGCTA GATACGGGAT CGGATCTCTT CCTTATCGCC GACCGTACCT CTGCTGACGA GCAGCTAAAC TCCTTCGCTG CATGCCAATG GGCAGCGGTC ATGGATCTTA ACCCCGAAAG CGATGAAACT GGTCTCGCGC GGGCGATCTT GCCATCCCTG CGATCAAGAC GCCATGTAAG CATGTTTGGT GAGAACCGGC AGTTGACCTC GCCGGGAGAG TCGACTAACT GGCTGATGGC CAACGGTTGG CGTAGCCGCA ACGAGCCGAC GGCAAGTTCA GACGCAGAAT GGAGACGGCG GGGATATCGC CCGCGGGTAA GGCAGCTTGT CGATGATATT GTGGTCGGTA CGCCGAACCG TGGTGCGGCC GTTCTATGTC TGCGCTCCGG GGATAACGAC ATGCTGATCG ATTATATTCT TGATTATATT GACGAGAAAT ATGACGGCAT CGCGGCTCGA TTGGACCTTG CTACGACGCC GGGTGCGGAA GGTCTGGACC TCGATGTCTT TCTTGCTGCT GTAGCCACGT CGCTGCCGAT CGCGAGTATC GGGCGGGAGG CTTCCATACC GGGTGCCGAT GGTCCGTTCC GCCTGGATCG TGCAGACTTG CATCGCCTAT CGGTAGATTT GGAGGTGCTC CATTCGCATG TACTTGCCGA AGGGCAGGCA ACTATTAGGG AAACGGATGC CTTCTGGCGA GGTCGTCCTC CAACATGGGC TGATCTGGAA GCCTGGATCG ATGTACAGCG CGACGCTTAT CCGGACCTTC TTGGAGAGCT GAGAGATCGC CTTGAGGATC GCCAGCTTGC ATGTGTCGAA TTTGAGCACT CACCCGGAGC TGGTGGTACA ACGCTCGCGC GTCGGGTTGC ATGGAACCTT CACAGAATTC ATCCTGTTGT TCTCCTGCGA AACTACACTC CGACGACCGT TGAACGTGTC AATGAGCTCT ACCAGGACGC GGGTCGTCCC CCTTTGGTGG TGGCGGAGTC GGCAGACCTG CCCGAGTCCG ACCGGGATGA GTTGCTCCAT GATCTGCAAC AACGGAACAG TCGTGCGGTG GTGCTTTGGG TCAACCGAAC GAACGCGCAG CGGAATCTAC GACATCAGCT AATCGATCCA GTGTCGGGCT CCGAACGGCA GCGATTCATC ACAGAGTATC TTCGCCGTGC TACGACTCCC AAAGCGCGCA GACTGCTTGG CGAGATCGCG GAAAGTGATT CGGCGTCGCT TCCAGCGCAA CGCCTATCCC CCTTCTACTT CGGACTCTGT GTTTATGAGA GTCAGTTTGA AGGCGTTGAA CCATACGTCC AATACCACAT GGCAAAGCTC GCGGGAACGC ACCTAAAAAT AGCAGGATAT CTTGCTCTGG TGACCCGGTA CGCCCAGGTT GGTATCCCAA TCGACCTCGT CAGGCGGTGG TTGGCCGAAT CGCCACCCCA GGCCGGGGGC TATGGGGATA AGGAGCTTCG CGCTCTGCTT GGACCGGACC TACGTAACCT GGTGGTCAGC GAGCGCCATG GTCTGCGTTT GCTACATCCG CTACTAGCCG AACACGTCCT TGCCGGCGAC CCAAGTAGAC CACGATTCGG ACTTGCCCAG ATCTCGGTGG AGCTCATAAG AAAAACTACC GAGTACCTTG GCCCTGAGAA CCAGGCGACT CGCCGTCTAC TTGAAGACCT ATTCGTGCGA CGCAAGGGCT GGAGTGAAGG TAGGCAGCGG CCAGATCTCT TCTCGGAGCT TGTTCAGGAT ATGCCCACGG ACGCCGCAGA ATGGGTTTTT GAGGAGCTGA CTACACGTTG TCCGAAGGAG CCCCACTTCT GGAATCACCG CGGTAGGTAT CATATCTATC GAGTAAGAGG AGATTTCGGG CGGGCCGAAG GCTTCCTTCT GCGCGCCGTG GAGGAGTCCT ACGGAAGGAA CTCTACCCAC CTTCACACGC TTGGTATGGT TCGACGGATC TGGATTGAAA ACGAAATGGA GGAGCTAGCT AAATCGAGGA TGCAAGTCAG ACCCGAACAG ATACTGGAGC ATTTCCGTCC ATTGTTTGAT AGTGCAATGG ATGCATTTGC GCGAGGCCGA GATGATCCGA ACAGCGCCCA TAGTTGGGTG ACACCTATTC AGCTCATTGC AACTGTAGTG GAATATCTGG TGAAATTTTC CGGAGCTCGA AACCTAGTAG AGTTCCTCGA AGGTCGCTAC CAAACTAGCA ATTGGGTGGG GCAGCAGCTA GCCCAGGCCG AAGTGCTTCT AGACGGGCTC CGGTCGAATT TTGCAGACGA CCGCCGTCAG GCCAAATACT ATGCTGAACT AGCCGAGCGG TTCGACTTGC TTTACGGTGA CCTCAATGCG CTAGTTGAAC AATGGAGGAA TCTTCGGCAT GCCATCAACG GGCGAGCCGC CGGACTCGGT GTCGCAATCG CTCGAACTTT GTACGCCCAT GCAGGCCGAG ATCTTTCCAG GCTTTCCGAG GATGAGACGC GAGAGATTGT TTCTATGGTC GAAGGGTTGG TGGAATCCGG CGAGGCGACC GATGCCGATC TGCGCCTGTG GTATCAGGCG TATCGTAGAC TTCCTGAGTA CTCGGAGACG AGGTCTCTGG AGCGGTTCAG TTGGTACGCG TCAACGCGCG GCAGCTTTGG ATTCAAATTA CTATCTGTAT GTAATGCATT TCTTGAGATG GTTCCGTGGT GA
|
Protein sequence | MVGRSLIEGR LPSSSILTLV KPLVGGHTDA LVFLCDIAGN QTAEELNGQF ILKIGRAEHS QGIAHQEFCQ NLGEFGIDHV PRHLMGVLDI GVSVDLYDVA GFSLDSVRTA EQLDHEDLVK VCFRVASELL PAQLPTGRSP QYTETVGQVF HRWLGSNFPA NRRGKSLRQL ARRIGVTGRT FRFDGELLPN PLSVLERNST LERTKIPYFR GYAHGDLHLR NVLVGGSIYT RNLSYWLIDV SWDEPSPLLF DQAYLELSVL LHTLPNAGSG RVLGLLSKID NEHLTVPARL SSTDSAIVDL IEGIRSTTIE KLEEQQPRRR DVWRQQYVLS RIAAGLNWAA KPLGDIALQR AAFIAAAWAT RGLLRDFDDY NSLWNDLARD DLQTQTGFAP LGEPVLAAEA IERWTPFSAL DTGSDLFLIA DRTSADEQLN SFAACQWAAV MDLNPESDET GLARAILPSL RSRRHVSMFG ENRQLTSPGE STNWLMANGW RSRNEPTASS DAEWRRRGYR PRVRQLVDDI VVGTPNRGAA VLCLRSGDND MLIDYILDYI DEKYDGIAAR LDLATTPGAE GLDLDVFLAA VATSLPIASI GREASIPGAD GPFRLDRADL HRLSVDLEVL HSHVLAEGQA TIRETDAFWR GRPPTWADLE AWIDVQRDAY PDLLGELRDR LEDRQLACVE FEHSPGAGGT TLARRVAWNL HRIHPVVLLR NYTPTTVERV NELYQDAGRP PLVVAESADL PESDRDELLH DLQQRNSRAV VLWVNRTNAQ RNLRHQLIDP VSGSERQRFI TEYLRRATTP KARRLLGEIA ESDSASLPAQ RLSPFYFGLC VYESQFEGVE PYVQYHMAKL AGTHLKIAGY LALVTRYAQV GIPIDLVRRW LAESPPQAGG YGDKELRALL GPDLRNLVVS ERHGLRLLHP LLAEHVLAGD PSRPRFGLAQ ISVELIRKTT EYLGPENQAT RRLLEDLFVR RKGWSEGRQR PDLFSELVQD MPTDAAEWVF EELTTRCPKE PHFWNHRGRY HIYRVRGDFG RAEGFLLRAV EESYGRNSTH LHTLGMVRRI WIENEMEELA KSRMQVRPEQ ILEHFRPLFD SAMDAFARGR DDPNSAHSWV TPIQLIATVV EYLVKFSGAR NLVEFLEGRY QTSNWVGQQL AQAEVLLDGL RSNFADDRRQ AKYYAELAER FDLLYGDLNA LVEQWRNLRH AINGRAAGLG VAIARTLYAH AGRDLSRLSE DETREIVSMV EGLVESGEAT DADLRLWYQA YRRLPEYSET RSLERFSWYA STRGSFGFKL LSVCNAFLEM VPW
|
| |