Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1165 |
Symbol | |
ID | 3905276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1388110 |
End bp | 1392078 |
Gene Length | 3969 bp |
Protein Length | 1322 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637878497 |
Product | hypothetical protein |
Protein accession | YP_480273 |
Protein GI | 86739873 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0368185 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCCG GCTTTATGGT GCTCTTTCTG CTCCAGGCCC CCGGCAACCT GACCGCCGAC ACCAAGCTGG ACGTCCCCCT TGACCCGTGG GGCTTCATGG GGCGGGCCAC CCACCTGTGG AACTCCCTCG CCGAGTTCGG CTACCTGCCC AACCAGTACG TCGGCTACCT GTTCCCGATG GGACCGTTCT TCGGTGCCGG TGACCTGCTC GGGATACCCG CCTGGGTCAC CCAGCGGCTG TGGATCGCGC TGCTCCTCAC CGTCTCCGCC TGGGGGGTCG TGCGGCTGGC CGAGGCCCTG CGGCTCGGCG TACCGCGCAC CCGGGTGCTG GCCGGACTCG CCTACACCCT GTCCCCGATG TTCCTCGGCA AGATCGGGGC GACTTCCGTG GCGCTCGCGG GCGCCGCCAT GCTCCCGTGG ATCACCCTGC CGCTCATCCT CGCGCTGCGT CCGGACGGCG CCTTCGGCGC CGACACCGGT ACGGCCGCCA CCACCAGTAC CACGGCCGTC GATGATGTCG ATGCCGCTGA CGATCCCGCC CTGGACCCGG GCCGGCGGCT GTCCCCGCGA CGGGCGGCTG CCCTGTCCGG ACTCGCCATC CTGGGCACCG GCGGAATCAA CGCGACGGTC ACGCTGTGCG TCCTGCTCTG TCCGGCGGCG GTCCTTGTGT TCGCCGGCGG CACCCGCCGG GCCTGGGCGC TGCGGGCCTG GTGGGTCGTC GCCGTCATCC TCGCCTCGGC CTGGTGGCTG CTGGCGCTGA TGGTGCAGGG CCGCTACGGG CTGAACTTCC TCGCGTTCAC CGAGACCGCC CAGACGACCA CGGCGACGAC GTCGGTGGCG GAAGCCCTGC GCGGCGCGAC GGACTGGCTG GCGTATGTCC AGTTACCCAG GCCCTGGCTG CCGGCCGCCA CCGAGTACGT GTCCCGGCCG GTGCCTATCA TCGGATCGGC AGCCGTGGCG GCGATCGGGC TGTGGGGGCT GTCCCGGGCC GATCTGCCCG CCCGCCGGTT CCTGCTGGTG ACGTTGGCCG CCGGCACGGT CTCGGTGGCT GCGGCCTATC CGGGGCATCC GGGCAGCCCG CTGGCGGGCG TGGTGCGGGA CCTGCTCGGT GACCAGCTGG GCTTTCTGCG CAACGTCTAC AAGTTCCAGC CGGTGGTGCG CCTGCCGATA GCCCTGGGGA TCGCGCACGC CCTGACGGTC CGACCTGTCA GTCTCACCGC CCGTGTGCGG AGATGGACGA CCCGCGGGGC TTCCCGCCGG CACGGCAGGA CCGCGGACAT GATCGTGACC GTCGCGGTGA ACGTCCCCGT CGTGCTGACC GTGGCCGCGC TGGTCTGCGG CATGACGCCA GCGCTCGCCG GCAGGGCGCT GCAGCCCCGG CCGTTCGCCC GGGTGCCCGA CTACTGGGTG GCCGCCGCGG ACTGGCTGGC CGCCCATCCC CAGGACGGCC GGGCACTGGT GCTGCCCGGC TCGCCGTTCG CCGAGTACGA GTGGGGCCGG CCGCTCGACG AGCCGCTGCA GTGGCTGGCC CGCACGCCGT GGGGGGTCCG CGGCCTGATC CCGCTCGGTG GCACGGGCGT GACCCGGCTG ATGGACGGCA TCGAGCACCA GCTCGCCACC GGGTCCGCCG CGGGCCTCGC CCCGGCGCTG GCCCGGGCCG GGATCGGCCA GATCCTGCTC CGCAACGACC TGGAACAGAA GAACTGGGAC GTCCCCCCGT CCACCGACGA GCTCCACCGG GCGTTGCGTT CCTCCGGTCT CACCCTGACC GCCGCGTTCG GCCCGCAGGT GCCGGCGCGC GCCTCGGCGA AGGAGAGGCT CGTCCCGGAG CTGCGTAACC CGACCGCGCG GGTACCCGCG CTGGAGGTCT GGACCGTCCC GGGCGGCGCG AAGCTTGTCG GCTCCTACCC GGCGGACACC GCCGTCGTCG TCTCCGGCGG CCCCGAGGCG ACGGTGCAGC TCGCCGGGCA GGGCCTGCTG AGCAGCGACC GGGCCGCCGT GCTCGCCGCG GACCTCGCCG CGGATCGCAC CGATCCGTCC GCCGCCGTGG CGTCTTCGAC GAGCCCGGCG GCGATCAAGG TTGCGCCGGG GGAGGTGATC GGCCCGACGA CCGCCTGGGT CGACACGGAC ACCCTGACCC GGCGCGACAG CACCTTCGGA CTCGTCCACG ACGCCGCTTC GTACCTCCTC GGGCCCACCG GCACCGCGGT CGGCAAGACG GGGGAGCCCC ACCAGTGGGC CGAGGCCGAC GTCGCCGGCC ACCAGACGGT CGCGGGCTAC GTCGGCGGCA TGTCCGTCAC CGCGTCCTCC TACGGCTACG ACCTACTCGC CGCGCCGGAC CTCGCGCCAC CGGCGGCCGT CGACGGCTTC GCCGCGACCG CGTGGACGGC ACAGCGCACG AAGGGCACGA CGTCGGCCGG GCAGTGGATC CAGCTCGATG CCGGCCGGCG GATGACCGTG CCCTACCTCG ACGTGCGGCT GCTGTCCGAG GGTTCGTGGC GGCCGGCCGT CGAGGCGCTG CGGGTGTCCA GCGAGGCGGG CTCGGTGGTC ACCGAGGTGC GTCCGGTGGA GGACATCCAG CGCCTCGCCG TCCCAGCGGG GCCGAGCCGC TGGTACCGCA TCACGTTCGC GCGGGTCGGC CAGGAGACCG ACGACACCCT CGGCGCCGGT ATTCGGGAGA TCACCATCCC CGGGGTCACC TTCCGCCACT ACGCGCAGCT CCCCACCGAC GTGGCCCGGC GCTTCGCCGC GCCGGACGGG GGGCTGGTCG CCTTCTCGAT GGCCCGGGAG CGGGTCGACC CGGCGCAGCC GTTCGGCGGC TCGGAGGAGC TGGCGCTGTC CCGGCGCTTC GAGGTGCCCC GGGACATGAC CTTCACGATG AGCGGCTCGG CGAGCTTCAT CCCCCCGCCG CGGGGGACAC CCCCAGCGGG TGACACTCCG TTGCTGGTCG ACTGCGGCCA GGGCCCCACC CTGGTCGTCG ACGGGACCCG TTACCCCCTG CGGATCTCCG GCCGGGACAG CGACGTCACC TCGGTGCGGC CGGTGCGGGT GTCGCTGTGC ACCGACGGCG GCACGCTGCG GCTCTCGGCG GGGCCGCACC TGCTCGGCGT CGACCAGGGA GCGACCTCCA TCCTCGTCGA CGCGATCGCG CTGGTCGGAT CGGGCGCCCA GGTGAGCGCG GCCACGCCGC GGGCGACGAC GGTGCACGAG TGGACAGCGG AGCACCGGAC GGTCCAGATC GGCGCCGGTG ACCGCGCCTT CCTCGCCGTC CGGGAGAACG CCAACTCCTC GTGGACGGCG AAGCTGAACG GCGTGGCGTT GACCCCGCTG CGCCTCGACG GCTGGCAGCA GGGCTGGATC GTCCCGGCCG GCACCGGCGG CACGATCGTC ATCGACAACG CGCCCGGTGC GGAATACCGC CGCAACCTCG TGGTGGGGCT GTTGCTGGTC GTGGTGCTGG TCGTGCTCGC CGTGCTGCCC GCCCGGTTCC GGCTACGGCC ACGATCCGAC GAGAACGGCT ATCCGGCGCT GCTGAGGCCG CGCCGGCTGG GCCGGCTGGG CCGGGCGACG GTGTCCCCGC CGGTGGCCGG TGTGGGGTGG ACGGTGCTCG CCATGCTGGC GGTGGCGCTG GTGGCGGGCC CGGTGGCGCT GGCCGTACCG GTGTTCGTCC TGGTCGGTCG GCGCTGGCCC GCGGCGGGGG GCCGCTGCGC CGCGGTCGCC ATGATTGGCG CGGGAATCGG GGTGGCGGCG CAGCCCGGCA GCCAGCCCGG ATCCGGGCAG GGGGCGTTCG GGCCGCTGGT CCAGGTGTTC GGCGCCTTGG CCTTGGCGGC GGTGCTGGCC GCGTTGGCCG GGCGGGCCTG GGACCGTCCC GCCGGCTCGG GCGCCGGCGC CCACTTCGGC GCCCGAGCCG GGTCCGATGT CGGGCATAGG ACTCGCAATC CCTATACCGA CAGCAAGGGA CTCGGTTAG
|
Protein sequence | MAAGFMVLFL LQAPGNLTAD TKLDVPLDPW GFMGRATHLW NSLAEFGYLP NQYVGYLFPM GPFFGAGDLL GIPAWVTQRL WIALLLTVSA WGVVRLAEAL RLGVPRTRVL AGLAYTLSPM FLGKIGATSV ALAGAAMLPW ITLPLILALR PDGAFGADTG TAATTSTTAV DDVDAADDPA LDPGRRLSPR RAAALSGLAI LGTGGINATV TLCVLLCPAA VLVFAGGTRR AWALRAWWVV AVILASAWWL LALMVQGRYG LNFLAFTETA QTTTATTSVA EALRGATDWL AYVQLPRPWL PAATEYVSRP VPIIGSAAVA AIGLWGLSRA DLPARRFLLV TLAAGTVSVA AAYPGHPGSP LAGVVRDLLG DQLGFLRNVY KFQPVVRLPI ALGIAHALTV RPVSLTARVR RWTTRGASRR HGRTADMIVT VAVNVPVVLT VAALVCGMTP ALAGRALQPR PFARVPDYWV AAADWLAAHP QDGRALVLPG SPFAEYEWGR PLDEPLQWLA RTPWGVRGLI PLGGTGVTRL MDGIEHQLAT GSAAGLAPAL ARAGIGQILL RNDLEQKNWD VPPSTDELHR ALRSSGLTLT AAFGPQVPAR ASAKERLVPE LRNPTARVPA LEVWTVPGGA KLVGSYPADT AVVVSGGPEA TVQLAGQGLL SSDRAAVLAA DLAADRTDPS AAVASSTSPA AIKVAPGEVI GPTTAWVDTD TLTRRDSTFG LVHDAASYLL GPTGTAVGKT GEPHQWAEAD VAGHQTVAGY VGGMSVTASS YGYDLLAAPD LAPPAAVDGF AATAWTAQRT KGTTSAGQWI QLDAGRRMTV PYLDVRLLSE GSWRPAVEAL RVSSEAGSVV TEVRPVEDIQ RLAVPAGPSR WYRITFARVG QETDDTLGAG IREITIPGVT FRHYAQLPTD VARRFAAPDG GLVAFSMARE RVDPAQPFGG SEELALSRRF EVPRDMTFTM SGSASFIPPP RGTPPAGDTP LLVDCGQGPT LVVDGTRYPL RISGRDSDVT SVRPVRVSLC TDGGTLRLSA GPHLLGVDQG ATSILVDAIA LVGSGAQVSA ATPRATTVHE WTAEHRTVQI GAGDRAFLAV RENANSSWTA KLNGVALTPL RLDGWQQGWI VPAGTGGTIV IDNAPGAEYR RNLVVGLLLV VVLVVLAVLP ARFRLRPRSD ENGYPALLRP RRLGRLGRAT VSPPVAGVGW TVLAMLAVAL VAGPVALAVP VFVLVGRRWP AAGGRCAAVA MIGAGIGVAA QPGSQPGSGQ GAFGPLVQVF GALALAAVLA ALAGRAWDRP AGSGAGAHFG ARAGSDVGHR TRNPYTDSKG LG
|
| |