Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1030 |
Symbol | |
ID | 3906272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1221773 |
End bp | 1223491 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878363 |
Product | hypothetical protein |
Protein accession | YP_480142 |
Protein GI | 86739742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.16283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCTCC CTCCCCCTCC CGGCCCTGAC CCGGCCGGCC CCCACGACCC CACCAACCGT GACGATCGGC CCGGCTCTCG GGCGGCGCGG ATGCGGATGC CGCTCGCCCG TGCGGTCGTC GAAGCGGTCG CCGTCGAGAA CGGCGTGTGC GTCCGGCCGA TGGCGATGCG GCGCACGAAC CTCGACACCG GCGAAACCGA GATCATCCCC GTACCCTGCG GCGCCACCCT GGCCAGCAAG TGCCCGACCT GCGCGGAGAA GGCCCGGCGG CTGCGGATGG CGCAGTGCAG GGCAGGGTGG CACCTCGACG ACGAACCGCT TCCCGACCCG GACCCGCCGT CCGATGACGC GAAGGTCCTT GCCGGCTTCC GCGCGGATCT CGAAGTCGCC CGGCAGGACG CGGAACGGGA CAGTGACCCG GCCGGCGTCG CCGAGATCGA CGCGCTGATC GGCCAGGTGG ACGAGGAACT CAACGCCCTG GGCGTGCGCG GCAAGACTGC CCCGGACAAC CGGGACCGGC CCCGCCGTGC CCGCTCGACC CGCCGGCGGC AGGACGCTGC CGACCTGCCG CGGTTGCCCG TGGAGAAGCG CACGATCGGC CGGACGTATG AGGCGGCGGA CGGCACGACC TGGCGGCCGT CGATGTTCCT CACGCTGACC TGCGACACCT ACGGGCGGGT GAACTCCGAC GGAGCTCCGG TGGACCCAGC GTCGTATGAC TACCGGCGGG CGGCCCGCGA CGCGATCCAC TTCCCGAAGC TGATCGACCG CTTCTGGCAG AACCTCCGCC GTGCGGTCGG CTGGGACGTG CAGTACTTCG CCGCGCTCGA ACCGCAACGG CGGCTCGCCC CGCACCTGCA CGCGGCCCTC CGCGGAACCG TGCCACGGGC CCTGCTGCGG CAGGTGGCGG CGGCCACGTA TCACCAGGTC TGGTGGCCAC CGTCTGGCCA GCCCGTCTAC CTGGACACGG CACTCCCGAC CTGGGCAAGC GAAACGGGCG GATACGTCGA TCCGGCTTCC GGCCGGCCGC TGCCCACCTG GGATGAGGCG CTCGACGCCA TCGGAGACGA GGACGAACCG TCCCATGTGG TGCGCTTCGG CCCGCAACTG CAAGCGGACG GCTTCACCTC GAACTCGGCG CACACCGGCC GGATGATCGG CTACCTCTGC AAGTACCTGA CCAAAAGCCT CGACGCCTGC CACACGGCCA CCACCGACCG GCAACGGCGC CACGTCGACC GGCTCGCCGA AGCCCTGCGC TACGAACCCT GCTCACCCAC CTGCGCGAAC TGGCTCCGCT ACGGCGTCCA GCCGAAGAAC GCGAAACCGG GCCTCGTCCC GGGACGCTGC CGCGGCAAGG CACACCGACG GGAGACCCTC GGCTTCGGCG GCCGGCGGGT CCTGGTGAGC CGCAAGTGGT CCGGCAAGTC GCTGACCGAC CACAAGCATG ATCGCGTCGC GTTCATCCGG GAGCAGCTCG AAGCGCTCGG CCACACCGCC ACCGGCCCGG CCGCGGCAAC CGACACCGAC CCGGCTCGCA CCGCCTGGAC GATGCTCCGG CCCGGCGACC CGGCCGCACC TCGCCGCGAA CACCTGCTGT TGCAGGCCGT CGCGCAACGC CACGCCTGGC GCGCACAGCT CGACGCGGCC CGACGCGCCG CACCCGACGA ACTTCCGGCA ATCGGCCTCG GCCCACCGGG CAGGGCACAA GCCGCCTGA
|
Protein sequence | MTLPPPPGPD PAGPHDPTNR DDRPGSRAAR MRMPLARAVV EAVAVENGVC VRPMAMRRTN LDTGETEIIP VPCGATLASK CPTCAEKARR LRMAQCRAGW HLDDEPLPDP DPPSDDAKVL AGFRADLEVA RQDAERDSDP AGVAEIDALI GQVDEELNAL GVRGKTAPDN RDRPRRARST RRRQDAADLP RLPVEKRTIG RTYEAADGTT WRPSMFLTLT CDTYGRVNSD GAPVDPASYD YRRAARDAIH FPKLIDRFWQ NLRRAVGWDV QYFAALEPQR RLAPHLHAAL RGTVPRALLR QVAAATYHQV WWPPSGQPVY LDTALPTWAS ETGGYVDPAS GRPLPTWDEA LDAIGDEDEP SHVVRFGPQL QADGFTSNSA HTGRMIGYLC KYLTKSLDAC HTATTDRQRR HVDRLAEALR YEPCSPTCAN WLRYGVQPKN AKPGLVPGRC RGKAHRRETL GFGGRRVLVS RKWSGKSLTD HKHDRVAFIR EQLEALGHTA TGPAAATDTD PARTAWTMLR PGDPAAPRRE HLLLQAVAQR HAWRAQLDAA RRAAPDELPA IGLGPPGRAQ AA
|
| |