Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4164 |
Symbol | |
ID | 3907129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4965578 |
End bp | 4967083 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637881492 |
Product | hypothetical protein |
Protein accession | YP_483241 |
Protein GI | 86742841 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0103438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTTC CTTCGTTGCT CCCCTTGCCC GAATGGCCCG ATGCCGAGCC GGCCCTGTGG TACGGCCGGA TGTCCGATGT CGACAATGAC GCACGTATCC CGGACCAGTT CGCCCGAGGT CAGCGTTACG CGCGGCTGAC GGGTGAGTAC TGGATAGCCC GGGCGTGGGC CGATGACGGG ATTTCCGCCT GGCGTGAGGA TGTGGTCCGG CCGGAGTTCG AAGGATTCCT TACGACGCTA CGAACGGGGA AGCACCGGGT CGTGGTGGCG TGGGAGGAGA GCCGGATTAC CCGTGACCCA GTGGTCGGCG CGGAGTTCGG CAAGATCATG CAGCGGGTCA GCGGCCGACT AATCGTCACT GATGGTGAGA AGGCCACGAC CTACGACTTC CGCAGGCAGC GGGACCGCGA TGCCTGGCAT GGCGCGGTCG GGAAGTCGGT CAGCGATTCC GGGCTGAAGT CGGAGCTGGT AAAGCGGAAG TTGGATGCCA AGCGGGAGGC CCGGGAGTTC CTTGGTGGCC CGGTCGGCTT CGGCTGGTCT CAGACGATCA GCCGGAAGGG CAAGAAGATC GTGACCGAGT GGTCGGTCAA CGAGGAGCAG GCCCGTTGGC TACGCGAGGC ATCGCAGCGG ATCCGGGAAG GTGAAGCCGT ACTCAAGGTC GCGGACGACT TCTATGACCG TGGTCTACGC ATTCCGCACC GGCGGACCAG GCCCGACGAC ACCATGAAGA CCGGCACCCT GACACGGGCC AGTCTGACAG CGATGCTGCG TAACCCGCGC ATCGCTGGAC TGTTCGCAAC GGGAAACGTC CACCGCGGGT GGACCGTGGT CGGTCCGATG GCGAACTTCC CTGCCATCCT CACGGAGGAG GAGTGGCGGG AGACCTGTGC GGCGTTGGAA GCCGTGAAAA CCCGCAAAGG CACCGGAACG GCCGTCAAAC ACGTGTTCGC CGGCTACTAT GTGTGCCACA AGTGCAAGCG ATCATTGATC AGGAACTCTC CCCGCGCGTA CGCGCTGTGG CGGCATCGTC TCGGTAAGAG CCGTGAGCAT GTCGAGTGTG ACCAGTCGTT CCACATCAAC GCCGCCGATG CCGACGACCT GATGACCCGC CTGGTTGACG CCTACCTTGT GCGCCGGGAC TGGGAGAAGG CGGGTGAGGT CGCCGACGCC GAGGAGTTGA AGGCCGAACG GACCGAGAAG GAACAGGAGC TTGCCGACCT CCCCCGCGCC ATCACGGCCA AAGAGATCAG TCTGCGGTTG GGTGGCCAGG TCGAAGCCCA GATCGAGGCC CGGCTACGGG AGATCGACGC CGAGTTGGCC CGCCGGGCGC GTCTCGTGAC CGTCCTGGAC GGCCAGGAGG CACTACGGCT ATGGCGTAAC GGCACCCTGA CGGAGAAACG CCGCGTCCTA TCGACGATCA TTGAACGGAT CATCGCGATC CCCGGGAAGG ATCTTCCGTT GCGTGACCGG TTGGACCCCC AGTGGCGCAA TCCCAGCCCT GCCTAG
|
Protein sequence | MEFPSLLPLP EWPDAEPALW YGRMSDVDND ARIPDQFARG QRYARLTGEY WIARAWADDG ISAWREDVVR PEFEGFLTTL RTGKHRVVVA WEESRITRDP VVGAEFGKIM QRVSGRLIVT DGEKATTYDF RRQRDRDAWH GAVGKSVSDS GLKSELVKRK LDAKREAREF LGGPVGFGWS QTISRKGKKI VTEWSVNEEQ ARWLREASQR IREGEAVLKV ADDFYDRGLR IPHRRTRPDD TMKTGTLTRA SLTAMLRNPR IAGLFATGNV HRGWTVVGPM ANFPAILTEE EWRETCAALE AVKTRKGTGT AVKHVFAGYY VCHKCKRSLI RNSPRAYALW RHRLGKSREH VECDQSFHIN AADADDLMTR LVDAYLVRRD WEKAGEVADA EELKAERTEK EQELADLPRA ITAKEISLRL GGQVEAQIEA RLREIDAELA RRARLVTVLD GQEALRLWRN GTLTEKRRVL STIIERIIAI PGKDLPLRDR LDPQWRNPSP A
|
| |