Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0878 |
Symbol | |
ID | 3903861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1022447 |
End bp | 1023736 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637878211 |
Product | phage integrase |
Protein accession | YP_479991 |
Protein GI | 86739591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.964674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACCTG ATCCCACGCA ACCCGGCGGA CCGGACCAAG CCGCCGCAGC GGAAGGCCCA GGCCGAGCGG CACACGAAGA TCCGCACCTG GACGAACGCC GAACTCCGGA CCTTCCTCGA CGCCACCCGC GACGAACCGA CCTACGCCCA GCGCCTCACC TCGCCACCAC CGGCGTGCGG CGAGGTGAGG CGCTGGGGCT CGCCTGGTCG GCGGTGGATC TGGACGCGGG GCGCATCTCG ATCCGCCGGA CGCTCGTCAA CGTCACCACG TCTGACACCG GCCGGGTGCC GGTGTTCAGC GACCCGAAGA CCGTACGCGG CACGCGGGTC ATCGCCCTGG ACACGGCCAC CGTAACGGCG CTGCGTGACC TGCGCGAGAG CAAGCTGAAG GAGCTGGCGT TGCTCGGCAG GGAGCCAGAG GCCGATCTTG TCTTCACCCA CTGGGACGGC CGGGCCATGC ACCCGGAACG GAACTCCCGC GCGTTCCTCC GGCGGGTGAG GCGCCTCGGC CTGCCGGTCA TCCGGCTGCA CAACCTGCGG CACACCTGGG CAACCCTCGC GCTCGCGAGC AACGTGCATC CCAAGGTCGT CTCCGAGCGG CTGGGGCACG CCAGCATCAC GATCACCTGG AGATCTACAG CCACGTTCTG CCCGGCATGC ACTCCGATGC CGCGGAAGTC GTCGCCGGGC TCATCCTCGG CACTGGCGGG CAGGCCGACG AGGACCAGGC GGACCGGCCC GACGGCGGTG GCGACGCGAC CGACTCGGAC GACTCCGATG GCCTGGGTGA TGACGATGCG GAACCGGGTG CGCCTGCGTC AGACTGCCCG GCCACCGGCC CGCGGACTCC TCGTCCGTCC ACTTCGGTCA CGGCCCGGTC GTGACCAATC TGTGACCAAA CGGGCCCAAA CGATCAGAGG AGGAACGGTG GGCGAGCATC TGACCTGCGA GAACGCTGCC GTTCAACGGG TGCACCTCCT TGGGCGCTAC TCGAACACGC CAGAGGTACT GACCGATCTT CAAACCGTCT GGGCCGTCGT AGCTGAAACC CCTGGTCAGG GGGAGACACA GGAGCTTCCG GGCCTCACAG GTTCCGGTCA GGTGCCCCGA CGACACGCCA TCGTTGACCG CCTCCCGGCC TCGGACATCG AGACCTTGAT CAGCCTGTAC CTGGCCGGCA GCACCGCTCG CGCCTTGGCG GCCAGGTACT CGATCAGTCT CACGGCGGTC AAGACGCTGC TGCGCAAGCG TGGCATCCGC CGCAACCGGC GATCAGCTGA ACCCTCGTGA
|
Protein sequence | MGPDPTQPGG PDQAAAAEGP GRAAHEDPHL DERRTPDLPR RHPRRTDLRP APHLATTGVR RGEALGLAWS AVDLDAGRIS IRRTLVNVTT SDTGRVPVFS DPKTVRGTRV IALDTATVTA LRDLRESKLK ELALLGREPE ADLVFTHWDG RAMHPERNSR AFLRRVRRLG LPVIRLHNLR HTWATLALAS NVHPKVVSER LGHASITITW RSTATFCPAC TPMPRKSSPG SSSALAGRPT RTRRTGPTAV ATRPTRTTPM AWVMTMRNRV RLRQTARPPA RGLLVRPLRS RPGRDQSVTK RAQTIRGGTV GEHLTCENAA VQRVHLLGRY SNTPEVLTDL QTVWAVVAET PGQGETQELP GLTGSGQVPR RHAIVDRLPA SDIETLISLY LAGSTARALA ARYSISLTAV KTLLRKRGIR RNRRSAEPS
|
| |