Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1486 |
Symbol | |
ID | 5669890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1778098 |
End bp | 1780215 |
Gene Length | 2118 bp |
Protein Length | 705 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240406 |
Product | hypothetical protein |
Protein accession | YP_001505832 |
Protein GI | 158313324 |
COG category | [S] Function unknown |
COG ID | [COG2852] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCCGG AGGACCGGTG GGACTTTTTT CTTTCCTATG CCGCATCGGA CCGTGCCTGG GCCGAGTGGG TCGCCTGGCA GCTGGAGTCC GCGAATTATC GAGTTCTGAT CAAGGCATGG GATTCCGTGC CGGGATCGAA CTGGAGTTCG CATATTCAGG CAGGCATCGT GGGCTCGAGA CGCACGATTC TCGTGCTCTC CGCGGCCTAC CTGCGGTGTC TTGACGACGC GCCCGAATGG CAGGCGGCCA TCGCGGCGAA CCCGTCCGGA TCCGACCGGC GCCTGCTGCC CGTCCGCGTC GAGGAGTGCG AGGCGCCGGG CCTGCTGGCT CGCCTGGCCC CGATCGACCT TTTCGATGCC TCCCCCGAGT CGGCCCGGGA CACCTTGCTG CAGCTCGTCC GCCATGCCCT GGACGGCCGC GCCAAGCCCG CCGCCGAACC GGCATTCCCC CGTCAGACGC CGGCCGCGTC CGCGACTCGA CCCGCGCCGG CCTTTCCCAC CGGCACTCCG AGGCCGGCTG GCGCCCCCAG CCCGGCTGGT GTTCCCAGCC CGGCTGGTGT TCCCAGCCCG GCTGGCGTCC TGGGTAGGAC CGACAGGCCG GACCGGTTCG TCACGCCGGT GCCGAACCCC GCCGTGCGGG ACGCTGGCGT TCCCACCTCT CCGGCAGCCC GCCCGGCCCG GCCCGGCGTC GTGCTCGCCC GGCGGGAAGC CTCCGAGACC TGCGAGGCGC GGCGTCCGGC CGCCCAGGCC CGCGGCGGCG CGCGTACCTC GCGGGCCCGG CTGACCACCA CGCATTCCGG CGGGACGGCC GGAGATCCCC ACACCCCGCC ATCAACACCG TCGCCGTCGC CACCGCCCTC GCAGCAACTG CCACCGCGAC CGCCCTCGCC CGCTGGTGAG GCGAGCGCTC CTGGTTCGAA CGCCGACCGC TGGGCGACAG CGCCACCGTC CCGACCGGTC GGCGGCGCTA CCGCCCGGCC TGGCGCGTCC TGGGCCGCGT TGCCGACCGC GCGGTTGACC TGGCTCAAGG GGACTACGGA CGCCGCTGTG CGTCGCGCAC TCGACCCGCT GCCGGATGAC GCGCCCGCCG CCGTCGCCTA CCGCCCACCC GTGCTCGCCT CCATGCCCGC CCACCGGCAA GCGCTACTCG ACGAGCTGGA GACGGTGGCG ATGGCCATGC TGCCCGCCTG GCTGCCCGAG AGCGAGGCGA TCGAGTCCGC GGGTGGAGCG GCGCCGGCGG CGATACGGGA CCTCGCCTCG AAGACAGCCG ACGCCTGGGG GATCTCGTCG CGGTACCTGG CCGACCTCGC CGAGCGCTCG CTGCGTGGCG GACCGCGGCC ACGGACCAGC CGCTTTCCGC CGGATATTCG CGCGGTGGGG GCCGCCCAGG CCGTCGCCTC GGGGTTCGGC CGTGCGGACC TGGTGCTGGT CGTCGATCCA GCCGCGCAGG AATCGCGTCC TGCCGAGCAG CGAGCACTGC TGGACGCGTT GGAGTGGTTC GCTTACCAGG GCCGTGTCGG TGTCTGGCTG ACCGGTCAGC CGGACGCGCC ACACGCCGAC CGGATCCAGG CTCTGCCCTG TCCGGCGAGC CCCGCGAACA ACCGGTCCAC CTGTGCCGTC CCGGCCGGCT CGACGGCAGC GCCGGTCGCA CCGCCCACGG CGGCTCCGCG ATCCCCACGC GCGGCGCCGG GCACCTCCGC GGCGCCAGCC ATCTCCGTCC CGGGACCGCC GGCGGCGACC GTACCGGCGG TACGGGGGCG GCCACATCCG GCGAGCCGGG CGGAGCAGCA GCTCGAATGT GCGCTGAGCC GCCATGCCTG GGCGCGTGGC CGGGCCTGGA ACCAGCCGCA CCGGTCGGGA CCGCTCTCGG CGCTGATCTA CGTCGACCTG GCGTGGTTCG CGCAGCGGGT CGTCGTCGAG GTGGACGGCC CTGAGCACCG CGCAATGGCG CACTACGAAC GCGACCGCTG GCGGGACAAC GAACTCGGTC TCGAGGGCTT CACCGTCCTG CGGTTCACCA ACGACGCGGT TCTCGGCGAC GTGGACCTCG TCATCAGCCA GATCGAGCGC TGCATAACGG CACGCCGGAA ACCATCGGAA GAGGCAACCA ACCCATGA
|
Protein sequence | MSPEDRWDFF LSYAASDRAW AEWVAWQLES ANYRVLIKAW DSVPGSNWSS HIQAGIVGSR RTILVLSAAY LRCLDDAPEW QAAIAANPSG SDRRLLPVRV EECEAPGLLA RLAPIDLFDA SPESARDTLL QLVRHALDGR AKPAAEPAFP RQTPAASATR PAPAFPTGTP RPAGAPSPAG VPSPAGVPSP AGVLGRTDRP DRFVTPVPNP AVRDAGVPTS PAARPARPGV VLARREASET CEARRPAAQA RGGARTSRAR LTTTHSGGTA GDPHTPPSTP SPSPPPSQQL PPRPPSPAGE ASAPGSNADR WATAPPSRPV GGATARPGAS WAALPTARLT WLKGTTDAAV RRALDPLPDD APAAVAYRPP VLASMPAHRQ ALLDELETVA MAMLPAWLPE SEAIESAGGA APAAIRDLAS KTADAWGISS RYLADLAERS LRGGPRPRTS RFPPDIRAVG AAQAVASGFG RADLVLVVDP AAQESRPAEQ RALLDALEWF AYQGRVGVWL TGQPDAPHAD RIQALPCPAS PANNRSTCAV PAGSTAAPVA PPTAAPRSPR AAPGTSAAPA ISVPGPPAAT VPAVRGRPHP ASRAEQQLEC ALSRHAWARG RAWNQPHRSG PLSALIYVDL AWFAQRVVVE VDGPEHRAMA HYERDRWRDN ELGLEGFTVL RFTNDAVLGD VDLVISQIER CITARRKPSE EATNP
|
| |