Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2728 |
Symbol | |
ID | 3904718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3212637 |
End bp | 3216959 |
Gene Length | 4323 bp |
Protein Length | 1440 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637880051 |
Product | hypothetical protein |
Protein accession | YP_481817 |
Protein GI | 86741417 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.935327 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACCG TGGGTGCCAC CACGCGGGGC CGGGGCATCC CCGGAGGTCG CGGCAGCGGT GGCGGCAGCG GCAGCGGTGG CGGTCGGCGC CGGCCGCCCG CGCCGGACGG CTCCGCGCAG CACCGCGAAT GGCTGAGCCT CATCGAGGTC AGCGGGCCGT TCCTCAGCCT GCCCGTGCTC CGGCGGGTCT GGCCGACGCT CGACCCGCTG GAGAAGAAGA CCCGGGAACG CCTGCGCCGG GAGCACACCG CCTGGCTGGA CGATCCCGTC GGCGGCCAGG CGGCCTGGGT GCGTTTCGTT CTCGGCGAGC TACTCGGCTG GGGGAACACG CTCGTCCTGC CCGATGGCAC GAACGGCGCG GGTGGCGCGG ACGGCGCGAA CGGCGCGAAC GGCGCGCTGC GCTCGTTCAC CGTCGACGTC GGCGAGCACG ACGCGCACCT CGTGCCGTCC TTCATCCTCG TCGATCCGGA CACAGCGGAC ACAGTGGCCG CGGGGGACAC AGTGGCCGCG GGGGACACAG TGGCCGCGGG GGACACAGTG GCCGGGGGGT CGTCCGAGGT GAAACCGGAC GCGGTCCGCG TGCTCGGTCT GGTCGTCGCG CCGTCACAGT CCCCGACGGC GCGGATCGCC GGGCAGGCCT GGGCTGCGAC GCCGGTGGAC CGGGCGGCGA TCGCCTGCCG GCATCACGGG ATCGAGCTCG CGCTGGTGAC GAACGGCCGG CAGTGGGCGT TGGTTTGGGC GCCGCGCGGC GGGGTGACGG CGACGGCGGT GTTCGACGCG GTGTCCTGGC CGGAGGCGGC CGAGCGCGAC GTCGTGCGGG CGTTCGTGTC GTTGCTGTGC CGGCGGCGCT TCTTCGGCGT ACCCGACGAC GAGACGCTGG TCCCGTTGCT GAAGGCGAGC CTCGGCAGCC AGGAGGAGAT CACCGAGGCG CTCGGCGTCC AGGTTCGCCA GGCCGTGGAG CTGCTCGTCG CCGCCTTCGG CCGCTGGCAC GTCGCCGAGC GGGACCGGGG TGGCGCCGGG CTGCGCGACG TCGGCGCGCA CGACGTCTAC CGGGGCGCGG TCGCGGTGAT GATGCGCATC GTCTTCCTGC TCTACGCCGA GGAACGCGGC CTGCTGCCCG CCGGCAACGA CCTGTACGCG AGTACCTACT CCGCCGGCCG GCTGTGCGCC GAGCTGGAAC AGCAGCTTTA CGACGGCGCG ACCGAGGACG ACCTGGAGCA GTCGACGGCG GCCTGGCACC GGCTGCGCGC CCTGTTCACC GCCGTGTACG CGGGGGTCGA TCATCCCCGG CTGCGCATGC ACGCCTACGC CGGTTCGCTG TTCGACCCGG CCGCCCATCC GTGGCTGCCA CTCGGCGTGG ACGACCGCAC GGTGCTGCAC ATGCTGCGTG CCGTCCAGTA CATCCAGGTC GGCAGCGGGA AGTCCCGGGA GCGGCGGACG CTGAGCTTCC GCTCGCTGGA CGTCGAGCAG ATCGGTTACG TCTACGAGGG TCTGCTCTCC TACGATGCCG AACGCGCCCC GGCCGCGGAG GTGGTCGTCG GCCTGGCCGG CCGGGAGGGG CTGGAGGCCG AGGCGAAGCT GTCCGAGCTG GAGGAGCTCG CCTCGCGTCA CCGGGGCCGC CCGCTGCTCG CCGCCGCGCT CGCCGAGAGG TTCGCCGCCA GCGGGATCGG CGCGCCGAAG GCGGTCGAGC GCCGGCTCGT TCCGTTGGAG GCCGGCGAGC GGGAGGAGGC CCGCAAGGCC CTGCTCGCCG TGACCCGCGG CGACTACCCG ACCGCCGAGC GGCTGCTGCC GTTCCACGGC CTGCTGCGCA CGGACCTGCG CGGCCAGCCC GTCGTGGTCC TGGAGGACGG GCTGTACGTC ACCGAGTCGC CGCTGCGGAA GAACACCGGC ACCCACTACA CCCCCAGGTT CCTGGCCGAG GAGGTCGCGG ACAACACCCT GGAGCCCCTG GTCTACGCCC CCGGGCCGTT GCAGACCGCC GACCGGGGCG CCTGGCGGCT GCGGTCCAGC GCCGACATCC TGTCGCTGAA GGTCGCCGAC ATCGCGATGG GCTCGGCCGC GTTCCTCGTC GCCGCCGCCC GCTACCTCGG CGACCGGCTG GTGGAGGCGT GGATCCGTGA GGGTGACCCG CGGGCCGCCG GGCACGCCGG GTACGCCGGG TACGCCGGTG ACGCCGACGC GGACGACGTC ACCGTCGAGG CCCGCCGGCT GGTCATCGAG CACTGCCTGT ACGGGGCGGA CATCAACCCG ATGGCCGTGG AGATGGCGAA GCTGTCGCTC TGGCTGGTGT CGATGGACCC GCGCCGGCCG TTCACCTTCC TGGACGACCG GCTCGTCTCC GGCGACTCCC TGCTCGGTAT CACCTCGGTC GAGCAGCTGG AGTACATGCA CCTCGACCCG GCCAAGGGCC GGAAGATCCA CTCCGACATC TTCGGCTGGA CCGCCGGCGT CCGTGAACTG GTCCGCGACG CCGCCGGGCG GCGCCGGGAC CTTGTCGAGA CCGACGGCTC GTCTCTGGCG GGACTTGCGA GGAAGCGCAG GCTCCTGTCG GGGGTGGAAC ACGACACCGA GCGGCTGCGG CTCTTCGCCG ACCTGACTGC CGGCGCCGCG CTCGTCGGCT ATCAGAAATG GTCGGGGACG AGTAACCCGC ATGCCCGCGC CGAGATCGAC GAGGAGAAGA AGGCCAGCCG GGAAAGCTTC TCCCTGACCG CGGCCCGCCT CGCTGATGAG GTCACCGGCA AGCGGGATGA GACCGAGGCC CGGAAAACGG CACAGTCCTG GTTGGCCACC GATTACGTTC CCGGCGGTTT CCGGCGGAAA CCCCTGCACT GGCCGCTGGT CTTCCCCGAG GTCTTCGAGG AGGGCGGTTT CGACGCCGTC CTCGGCAACC CCCCGTTCCT CGGTGGCCAG AAGCTGACCG GTGCACTGGG CGTGGCGTAT CGCGAGTACC TCGTCGAGGA GATCGGCGGG GGAGCACGGG GGAGCGCCGA TCTGGTCGCC TACTTCGCGC TGCGTGCGCA CTCTCTGCTC AACGGTGGCG GTCAGACGGG GCTGATCGCG ACGAACACCC TGGCCCAGGG CGATACCCGT GAGGTCGGCC TTGACCAGAT TGTCGCTGAC GGGAACGTCA TCCGCCGGTC GATCAAGAGC CGGCCCTGGC CGTCGAAGTC CGCGGTCCTG GAATTCAGCG CCGTCTGGAC CACAGGGCAG ATCGGGCCGG CCGCCGCCCG GTTCGCCGAC GGCGTCCAGA CCCCCGTGAT CGCGCCCTCC CTGGAGCCCG AGGCACGGGT GTCCGGAAAT CCTCACCGAC TGGAAAAAAG CGCAGGAATC TCCTTTATTG GAAGTTATGT CCTGGGTCTT GGCTTCACGA TGGAACCCGC CGAGGCCCGC GCGTTGATCG CGAAGGATCC GCGTCATACC GACGTGCTGT TCCCCTATCT GAACGGTCAG GATCTCAACT CGCGCCCCGA CTGCTCGGGC AGTCGTTGGG TCGTCAATTT CCACGACTGG CCTGAGGACC GCGCAAAGCG GTACCGCGAC TGTTACGACC AGGTCCGCCT CCTCGTCAAG CCAGAACGTG ACCGAAATAA TCGAAAAGTA TATCGCGACT ACTGGTGGCA GTACGCTGAG AAGCGTCCCG CCATGCTCGA CGCGATGGGG GATCTCGAAC GGCTTGTCGT CATCACTCTT GTGAGCCGCA CGGTGATGCC GGTGATGGTG CCGACCGGGC AGGTGTTCGC GCACAAGCTT GGTGTGTTTG TCACTGACGA CGGGGCAATG CTGTCACTAT TGTCGAGCGC GCCCCATTTT TGGTGGGCGA TGAGTCGTAG TTCGACGATG AAGGCGGATC TCAACTACTC GCCGTCGGAC GTGTTCGAGA CCTTGCCGTT GCCTGAACTG ACGGGGGAGA TGCGGGAATT GGGGGAGGGG TTGGACGGGT TCCGGCGGGA GGCGATGCTG GCGCGGCAGT CAGGGCTCAC CAAGACGTAC AATCTGGTTT TCGATCCGGG CTGCAACGAT GCCGATATCG CCGAGCTGCG GGAGATCCAC CGGCTCATCG ACGAGGCGAC CTTTCGGGCG TACGGGTGGG TCGATATGAT CGACCGGGGC CTGGACCACG GCTTTCATCC GGCCGGGAAG TACACTCGGT ACACGATCGG GCCGTGGGCG CAGCGCGAGG TGCTCGACCG CCTCCTCGAA CTGAACCACG CCCGGTACGC CGAGGAGGTC GCCGCCGGCC TGCACGAGAA GGGCGCGAAG AAGAAAGGCG CCCGCGCCAG GGCGGCCGCA TCGCCGGACC AGGGGTCGCT GCTTGACCTG TAG
|
Protein sequence | MSTVGATTRG RGIPGGRGSG GGSGSGGGRR RPPAPDGSAQ HREWLSLIEV SGPFLSLPVL RRVWPTLDPL EKKTRERLRR EHTAWLDDPV GGQAAWVRFV LGELLGWGNT LVLPDGTNGA GGADGANGAN GALRSFTVDV GEHDAHLVPS FILVDPDTAD TVAAGDTVAA GDTVAAGDTV AGGSSEVKPD AVRVLGLVVA PSQSPTARIA GQAWAATPVD RAAIACRHHG IELALVTNGR QWALVWAPRG GVTATAVFDA VSWPEAAERD VVRAFVSLLC RRRFFGVPDD ETLVPLLKAS LGSQEEITEA LGVQVRQAVE LLVAAFGRWH VAERDRGGAG LRDVGAHDVY RGAVAVMMRI VFLLYAEERG LLPAGNDLYA STYSAGRLCA ELEQQLYDGA TEDDLEQSTA AWHRLRALFT AVYAGVDHPR LRMHAYAGSL FDPAAHPWLP LGVDDRTVLH MLRAVQYIQV GSGKSRERRT LSFRSLDVEQ IGYVYEGLLS YDAERAPAAE VVVGLAGREG LEAEAKLSEL EELASRHRGR PLLAAALAER FAASGIGAPK AVERRLVPLE AGEREEARKA LLAVTRGDYP TAERLLPFHG LLRTDLRGQP VVVLEDGLYV TESPLRKNTG THYTPRFLAE EVADNTLEPL VYAPGPLQTA DRGAWRLRSS ADILSLKVAD IAMGSAAFLV AAARYLGDRL VEAWIREGDP RAAGHAGYAG YAGDADADDV TVEARRLVIE HCLYGADINP MAVEMAKLSL WLVSMDPRRP FTFLDDRLVS GDSLLGITSV EQLEYMHLDP AKGRKIHSDI FGWTAGVREL VRDAAGRRRD LVETDGSSLA GLARKRRLLS GVEHDTERLR LFADLTAGAA LVGYQKWSGT SNPHARAEID EEKKASRESF SLTAARLADE VTGKRDETEA RKTAQSWLAT DYVPGGFRRK PLHWPLVFPE VFEEGGFDAV LGNPPFLGGQ KLTGALGVAY REYLVEEIGG GARGSADLVA YFALRAHSLL NGGGQTGLIA TNTLAQGDTR EVGLDQIVAD GNVIRRSIKS RPWPSKSAVL EFSAVWTTGQ IGPAAARFAD GVQTPVIAPS LEPEARVSGN PHRLEKSAGI SFIGSYVLGL GFTMEPAEAR ALIAKDPRHT DVLFPYLNGQ DLNSRPDCSG SRWVVNFHDW PEDRAKRYRD CYDQVRLLVK PERDRNNRKV YRDYWWQYAE KRPAMLDAMG DLERLVVITL VSRTVMPVMV PTGQVFAHKL GVFVTDDGAM LSLLSSAPHF WWAMSRSSTM KADLNYSPSD VFETLPLPEL TGEMRELGEG LDGFRREAML ARQSGLTKTY NLVFDPGCND ADIAELREIH RLIDEATFRA YGWVDMIDRG LDHGFHPAGK YTRYTIGPWA QREVLDRLLE LNHARYAEEV AAGLHEKGAK KKGARARAAA SPDQGSLLDL
|
| |