Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0052 |
Symbol | |
ID | 3903531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 64448 |
End bp | 67876 |
Gene Length | 3429 bp |
Protein Length | 1142 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637877382 |
Product | hypothetical protein |
Protein accession | YP_479175 |
Protein GI | 86738775 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03607] patatin-related protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGACC CGGCCACCCA GGAGGTACGG CTCGCGGTCG TCATGACCGG TGGCGCGAGC CTCGCCGTCT GGATGGGCGG GGTCGCCACC GAGATCAACC TTGCGACCAG CCCGGACCGT GACCGGGCCG ACGACGCGGA CGCCGCCGTC GCTGCCCGTT ACGCCCGGCT GGCGACCATC CTCGACGTCG AGGTCAGTGT GGACGTACTC GCCGGCACCT CGGCCGGTGG CATCAACGCC GCCATGCTGG GCTACGCGAA CACCCACCAC GCCGATCTCA CCCCGCTGCG GGACCTGTGG CTGTCCCTCG GCTCGTTTGA CGCGCTCATG CGCACCCCCC ACGAGAAGAC GTATCCCTCG CTGCTGCACG GGGACACCGC CGTGCTGCCC GCGCTGCACA CCGCCTTGAC AGCGGTGGGG GAGACGGCCC GAGCCGGCAT GGACCGAGCC GGCATGGACC GAGCCGGCAT GGACCGAGCC GGCATGGACC GAGCCGGCAT GGACCGAGCC GGCATGGACC GAGCCGGCAT GGACCGCCCG ACATCGGTCT TCATCACTAC GACGATCCTG CGCGGCGAGG TCGCCGAGCA CGCCGACTCA CTCGGCGGCA CCATGTACGA CGTCGACCAC CGGGGCCTGT TCGCCTTCGG CACGCAGGAC CTGACCGACC CTGGCGCGGT CCCCCGTCTC GCCCTGGCCG CACGGGCCAG TTCCGCCTTT CCGGGCGCCT TCGAGCCGGC CTACGTGCCG GTGGGGCACG CCGTCGACGC CGCGCACCCG GACATGGCCC GCTACGTCAA CGCCTCCCAC GGATTCCACG CATCGGACGG CGGGATCCTG GTCAACCGCC CGATCGGACC GGCGCTGGCG GCCATCTTCG ACCGGCCCGC CGAACGCCAA GTCCGTCGCG TGCTCGCCTA CGTCGTGCCC TCGCCGCGGA TGCCGGTACC GACGAATCCG GCGGCACCAA CGAATCCGGC GGCACCAACC AGGACGTCCA CACCGCCCGC CCTGCCTGTG CCACCCACGG TGATCCCCAC CCTTCTCCAG GTGCTCGGCG CGGCGCTGAA CCAGTCGATC GGTACAGATC TGGCGACGCT GCGCGACCAC AACAGCGCCG TACGCGGTAC GCGGGCGAAC CGGCGACGGC TGCTGCGCCT CGCCCCGGCC GGTGGTCCCC GATTAGCGGA CGAGAGCGTC TATGACGCCT ACCGCCGCAG CCTGGCAGAG GAGATCGCCC CACCGGTGAT CGAGGCGCTG CTGCGCGTCC TCGGTGGTCG GTTCGATCTG CCCCGTCCCC CGGACACCGA AGCGCTGCGC ACCGACGCGG CCGGACCCGC CCGGATGACG AACGCCGCGG TCAACGCCGT GATCCAGATG CTTCCCGACC GGCTGCCGAC CACGGCCGAC CTCGCCGACC TGTGGCGGCT GGGCCGACCC GCGCTCGACG CCGCCAAGGG TCTACTGATC AACATGATCA ACGAAGGGTA CGTCCTCTCC CCCGAGCCGG CGGACCGGAT CCGACTTGCC CGGCTCGCGG CGGCCGTGCA CGGCGCATCG CACGCCACCA TCCGGAACAC CGACACCACC GACACCACAG CCACCACAGC CACCGCGGCC CTGCGGCCCG GCGTCTTCAC GACGGTCAGC GAGACCCTCG GTGCGATGGC CGGGGCACCG CTGCTCGACG TCATCGCCGA GGCTGCCCGC CGCTGGTTGC GGACCGACCC GGAGGGCGAC GCGGGGCAGG ACGAGCTCAC CCGGGCCTGG CGCTCCCTGG AGCGGATGAT CGAGACCCTG CGGACCGAGC TCACCGATCT CGTGGACCGG CGGTCCCCCG CGGCCCGGCC TCCGGCGGGA CCAGAGACCA GGCCCGAGGG TCTCAGCGTC GGCCAGCGCC GGGCGGTGGC GGCCCGGACC CTCGCGGACT TCGTCGGGTA CCTCCCGTCG ACTCCCGCGG ACGCCCTGAT CGCCGTGCTC GACGTGCACC TGGTCGAACG AAGCACCGGC GCCGCGGTCC TCGACCAACC GGTTGAGCTC GTCCAGATCA GCGCGGATCT ACCGAACCGC CTCGATCCCG CCCGCGCACT GGCCGAGGAG AAGGTCACAG GCCTGCAGCT GGGAAACTTC GGCGCCTTCG CGAAGTCCTC GTGGCGGGCC AACGACTGGA TGTGGGGACG GCTGGACGGT GCGGGCTGGC TGGTCCGGAT CATGCTGGAT CCACGCCGGC TCGTGATCCG CCGGGACACC GCCGTCCCCG CCGGTCACGC CCACGCCGCG GCGCTGGCCC GCCGGGGCTG GCTGGTGGAT CTGGTCGACG ATCTCACCGA GGTGGCCGGC ATCCCGGCGC CGCGGGAGGT ACTCGACGAG CTTGGCTTTC TGACCGATCC GGACGCCCCC GTGCCGCCGA ACCTTCCGGT CACGGCGACC TGGGTCGCCG CCGGCATCCA ACGGGACATC GCGGCCAAGG AGCTGGTGGG GGTCGCCGAG GCGGTGCGGC GCGACAACAA GGCCGGGGTC GATCCGCGGC CCACCGCCGA CTTCCTCGCC GCCGTCGACC GGGCCCTGAC CGTGGAACCG AGCCGGCCCG GGACGACGAC CGGACCGGGG GCCTCGGCGG TACCGGAAGG GACAGGCTCC CCGAACCTGG ACGAGCCGCG CGGCGCCGAG GTCATCTCGC TGGCCGCGGC CGTGGCCGCC AGGATCGCCG CGGGACAGAC CGGTGCCCAC CGGCCTGCCG GCGTCGACAG TTCGGCTCTG GCGCGCCGAC CGCCCCGGCC ACCCGGGCTC AGCGGCGTCC CCGTCACGCC CGGCGCGCTG CCCCCGCGAG CGGTGGACAA GGTACTCGAG GCCTGCCGCG TCTCGGACGA GCGCATCACC GACGCGGCGC AGGGCCCGGT TCTGGTGACC GCCCTGGCCC AGATCGTGGC CGTGGTCGTC GCCTGGGCCA CATCGACCCG CCGGCTGCCA CGGCCCCTGC GGCCGGTCGC GTTCCTCGCC CGGACCGTCA CCCGTCTCGC CTTCGAGATG ATCCGGGATG TCACGCACGG CCGGCGGCGG ATGACGATCG CGGTGGGAAC AGCGCTGGTC GGCCTCGGCA CGGCCGGCGG GCTGACCGGT TCGGGCATCG TCGGCGGCCT GGGGATCGTC GTCGGCCTGA TCGGCCTGCT GATGATCGGA CTTACCGGCT GGCGGCACCT GCCCGGAGGA CTGGCGGTCG TGAGAGCGGG GCTCATCGCG GTGTTCGCGG CGGCCGGTGT CGTTCCGGTG ATCCATGACC GGCTCTTCCC CTGGCTACAC GACGATGTCG TGCCCTACCT GGCCGATCAT CCGTGGGCAT GGGCGACCGT CTTCGGAGCC CTGGTGCTGC CGGCTCTCTG GTCCGTCGCC GAGGCCCTCA CCACCCGCCG GGCCCGCCGC AACGGCTGA
|
Protein sequence | MRDPATQEVR LAVVMTGGAS LAVWMGGVAT EINLATSPDR DRADDADAAV AARYARLATI LDVEVSVDVL AGTSAGGINA AMLGYANTHH ADLTPLRDLW LSLGSFDALM RTPHEKTYPS LLHGDTAVLP ALHTALTAVG ETARAGMDRA GMDRAGMDRA GMDRAGMDRA GMDRAGMDRP TSVFITTTIL RGEVAEHADS LGGTMYDVDH RGLFAFGTQD LTDPGAVPRL ALAARASSAF PGAFEPAYVP VGHAVDAAHP DMARYVNASH GFHASDGGIL VNRPIGPALA AIFDRPAERQ VRRVLAYVVP SPRMPVPTNP AAPTNPAAPT RTSTPPALPV PPTVIPTLLQ VLGAALNQSI GTDLATLRDH NSAVRGTRAN RRRLLRLAPA GGPRLADESV YDAYRRSLAE EIAPPVIEAL LRVLGGRFDL PRPPDTEALR TDAAGPARMT NAAVNAVIQM LPDRLPTTAD LADLWRLGRP ALDAAKGLLI NMINEGYVLS PEPADRIRLA RLAAAVHGAS HATIRNTDTT DTTATTATAA LRPGVFTTVS ETLGAMAGAP LLDVIAEAAR RWLRTDPEGD AGQDELTRAW RSLERMIETL RTELTDLVDR RSPAARPPAG PETRPEGLSV GQRRAVAART LADFVGYLPS TPADALIAVL DVHLVERSTG AAVLDQPVEL VQISADLPNR LDPARALAEE KVTGLQLGNF GAFAKSSWRA NDWMWGRLDG AGWLVRIMLD PRRLVIRRDT AVPAGHAHAA ALARRGWLVD LVDDLTEVAG IPAPREVLDE LGFLTDPDAP VPPNLPVTAT WVAAGIQRDI AAKELVGVAE AVRRDNKAGV DPRPTADFLA AVDRALTVEP SRPGTTTGPG ASAVPEGTGS PNLDEPRGAE VISLAAAVAA RIAAGQTGAH RPAGVDSSAL ARRPPRPPGL SGVPVTPGAL PPRAVDKVLE ACRVSDERIT DAAQGPVLVT ALAQIVAVVV AWATSTRRLP RPLRPVAFLA RTVTRLAFEM IRDVTHGRRR MTIAVGTALV GLGTAGGLTG SGIVGGLGIV VGLIGLLMIG LTGWRHLPGG LAVVRAGLIA VFAAAGVVPV IHDRLFPWLH DDVVPYLADH PWAWATVFGA LVLPALWSVA EALTTRRARR NG
|
| |