Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1434 |
Symbol | |
ID | 3903165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1726274 |
End bp | 1728592 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878771 |
Product | hypothetical protein |
Protein accession | YP_480540 |
Protein GI | 86740140 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.709183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.106581 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGGG ACGTACCGGC CGACGAACAC GCGCTCATCA TCTCGGCCGG AGGGGAGACG GGCCGAGCAC TCCGTGCGGC GTTCGCGGCG ATCGCACCCA CCGACGCGTC CTCGGATGAC GGGGTAGACC AGACCGCCGA GGGCGACGGC TCGCCAGCCG ATCCCACCCT GGCCAACGAA CGACGGCCGC TGCACCGCTA CGGCAGGGTC CACGTGTTGG GCCAGCCTCG GGGCACCTCC ACGGCGACGG CGGAGCGCAC CGCCGCCCGG CTGGCGGCGG CGGTACTCCC CCCCGCGGAC GGCAGCGTCG TGTCGACCAC CGGGGTGTCG CCCGGGCTGA ACCGCGCCGA GACGCTCGGG CTGGACGCCT TCCGCCTGCG GGCCAGCGCC GCCTACCGGC AGCTCAAGCA GGAACGTCCC CGCGACGGCC GGCCATGGGA CATGGACCGG CCCTGCACCG ACATCCCGCC GCCCCGGGGC AGCTCGCCTG GTCCGGACCA GCCGACACCA CATGCCCAGC CGACACCGCA TGCCCAGCCG ACACCACATG CCCAGCCGAC ACCGCATGCC CAGCCGACAC CGCATGCCCA GCCGACACCG CATGCCCAGC CGACACCGCA TGCCCAGCCG AACGGGCGGG CCGAGGCCCA GAGCCCGACC TCCGCGCCCA CGGACACCCG GGCCACGGAC ACCCGGGCCA CCGCGCAGAT CGCCCAGGCG TCCTCCCCCA CCAGCGCGGC GCTGGAGGGA TCGGTGGCCG TCGGGCTGAT CCTCGTGGAA GGGCCGACCA CCGCGCTGCA ACTGACCGCC GCGGAACGCA CCAAGATCGT AGCTGAGGTG CAGAACGGGT TGTCCTGGTA CGCAACCACG AACCCGGCTG CCGACCTGAC CTTCCACTAC GACATCCAGA TCGTGCGGCT GTCCGTCCCA CCGGACCCGA ACGCGCCCGA CCTCGAGGCG CTGTGGCGTG ACCCCACCAT GAGCCGGCTC GGCTACGCGG CCAGCTTCGA CGGCGTCTAC GACTACGTCG ACGCGCTCCG GTCCCGGCTC GGCACCCGGT CCGCCTACTG CGCGTTCTTC ACCAAGTATC CGCTGGGATA CTTCGCGTAC TCCTCGGTGG GTGGCCCCCG GCTCGTGATG TCCGTCGACA ACGACGGCTG GGGACCGGAC AATATCGACC GGGTCTTCAC CCACGAGACC GGCCACATCT TCGGCGCTCC GGACGAGTAC GCGGGCGCCC AGTGCGACTG CGGCGGTCGG TGGGGGGCGT TCCACGCCCC GAACGGCAAC TGCGACGCCT GCGCGCCCGC GCCCGTCGAC TGCCTGATGC GCTCGAACAG TTTCGCGCTG TGCCGCTACA CCCCCAGCCA CATCGGCTGG GGCCACGGAG TAAGCGGCAA CCCGGCGCTC GTCCAGGCCA GGGGGCTCGG CCAGATCGGC AACTTCGACG CCGTCGTCCC GTCGGCCTTC GCCGGGCTGA CCCACGTCTG GCGGGACAAC GACGCGGCTG GCTTCCCCTG GATGGCCCCG TGGCAGACGG CTCAGGAGCT CGGCCGGATC GATGCCGCCA CCATGATCCA GAGCACCCTG GCCAGGCCGG GACCCCTCGA GGTCGCCGTC CGGGTCGGCT CGACGCTGTA CTTCCTGTGG CGGGACTCCA CCGGCGCCTT CGCCTGGCAC CCGCCGACCC GCCTCGTCCA GGGGGTGGGG GGCGTTCCGT CGCTGGTGCA GAGCCGGCTG GGCACCAAGG GCAACTTCGA ACTGCTCGTG CCCGCCGCGG ACGTCGGCAT CCTGCACCTG TGGCGCAACC ATGACATCCA CGGATTTCCG TGGAGCACTC CGAAGCTGTT CGGCGCGAAC CTCGGGCGCG TCGACGCCGT CAGCCTCATC CACGGCACGC TGGGCGGCGG CAACGGGATG CTGGAAGCGG TCGCCCGGGT CGGCAACCGG CTCGTGCACC TCACCCGCGA CAACGGGGCG GTCTGGCGCA CCGGCCCCGT CTTCGCCGAG GGCGTGACGG GCAATCCGGC GCTCATCCAG AGCGCCTTCC CCGACGGCTC CCGCAACTTC GAGGTGGTCG TCCCCGCCGC GGACCGCGGG CTCATCCACT TCTACCGGAA CAACGGCGCG CCCTCCCCGG GCTGGAGCGG ACCGCGGCCG TTCGCACCGG AGCTGGGCCG GGTGGACGCC GTCTCGATGA TCCAGAGCAA CTTCGACGGG CATCTGGAGG TGCTCGCCCG CGTCGGCGAC CGGCTCCACA TGGTCTGGCG TTCGTCGGGC CCCGGTGCAA GCTGGTCGGT CCCCCGGCGC GTGTTCTGA
|
Protein sequence | MAGDVPADEH ALIISAGGET GRALRAAFAA IAPTDASSDD GVDQTAEGDG SPADPTLANE RRPLHRYGRV HVLGQPRGTS TATAERTAAR LAAAVLPPAD GSVVSTTGVS PGLNRAETLG LDAFRLRASA AYRQLKQERP RDGRPWDMDR PCTDIPPPRG SSPGPDQPTP HAQPTPHAQP TPHAQPTPHA QPTPHAQPTP HAQPTPHAQP NGRAEAQSPT SAPTDTRATD TRATAQIAQA SSPTSAALEG SVAVGLILVE GPTTALQLTA AERTKIVAEV QNGLSWYATT NPAADLTFHY DIQIVRLSVP PDPNAPDLEA LWRDPTMSRL GYAASFDGVY DYVDALRSRL GTRSAYCAFF TKYPLGYFAY SSVGGPRLVM SVDNDGWGPD NIDRVFTHET GHIFGAPDEY AGAQCDCGGR WGAFHAPNGN CDACAPAPVD CLMRSNSFAL CRYTPSHIGW GHGVSGNPAL VQARGLGQIG NFDAVVPSAF AGLTHVWRDN DAAGFPWMAP WQTAQELGRI DAATMIQSTL ARPGPLEVAV RVGSTLYFLW RDSTGAFAWH PPTRLVQGVG GVPSLVQSRL GTKGNFELLV PAADVGILHL WRNHDIHGFP WSTPKLFGAN LGRVDAVSLI HGTLGGGNGM LEAVARVGNR LVHLTRDNGA VWRTGPVFAE GVTGNPALIQ SAFPDGSRNF EVVVPAADRG LIHFYRNNGA PSPGWSGPRP FAPELGRVDA VSMIQSNFDG HLEVLARVGD RLHMVWRSSG PGASWSVPRR VF
|
| |