Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1073 |
Symbol | |
ID | 5669487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1269206 |
End bp | 1270945 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641240002 |
Product | hypothetical protein |
Protein accession | YP_001505435 |
Protein GI | 158312927 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.542041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGCGG AAGAACTCAT GGTGGCTGGG CCGCCCCTGA CCCTGTCGGG CCCGATCCGG CCGGGCCGCC CGTCGGCCCG TATCTCCCCG GACCACACCT CCCCGGAGAT CATCCCGCCG GACCGCATCT CGACAGCGCA GTCCGCCGCC GACATCCCGG GCACGCACAT CCCGGCCACC CAGCTCCCGA ACAACCAGGT CCCGGCGCCG TCGATGGCGT CCACCGGGCA CGCCAGGCCG ACGGCGAAGG CTGCCGCGGG CAAGGCCGCC GGCGGAACGT CCACCGGCGC GGGTGGCGCT GGCGGCGCCA GCAGCACCCG CGCAGACGGC ACCAGGGGTG CCGGTACCAG CCCCAACACC GGCGAATCCG GCACGCCGAC CCTGTTCTCC CGCCTGCTCG CCGAGGCCAA CGTCTCCGAC ACCCGCTTCG CACGTCAGGT CAACAACCGC GCCCGATCCC AGCGCCGCAT CGAGCTCGGC CTAGCCCGGA CGACCGTCGG GCACTGGCGC CGCGGCATGC GGCCCCGGGA CCCGATGGTC GCCGAGCTGG CCGCGGCCGA GCTCTCCGCC CTCGTCGGCT ACCCGGTGAC CCCCGCCGAC CTCAGCTGGC GGGGCGAAGC CAGTGAACGC GACGACCTCG GCCTCGCCGT CGCGGACATC CCCGACGACA CCCTGCGGAC GCTCGCTGGA CTTTCGGGAC GAGACATGCG GCGACGTGAC GTCCTACATG ACGGGGCGGC CTTCGTCGCC ACCGCCTTCG CCGACCCCGT GCTGTCCAGC CTCACCGGCA TGATCCGCCG GATCAGCGCG GACGTCCCGT CCTCCCCGTC CGGCGGAGCG ATGATCCGGG ACATGACCGA GACGTTCCGC CGCCTGGACG CCCGGTTCGG CAGCAGTGAG ATCCGCCCCC AGGTCGTGAC GTTCCTGCAC GACCGGACGC GGGCGGCTGT GGCGGGACCG GCCGACACCG ACACCTTCGG CGCCCTCGCC GAGCTGGCCC AGTTCAGCGG CTGGCTCGCC CAGGACTGCA ACCGCCAGGC GCTGGCCCAG CGCTACTACA TCCAGGCACT CACGCTGGCC GAGCACGCCG ACGACGTCAT GATGGCCGGC CGGGTGCTGT CAGCGATGAG CGACCAGTCC GCGGCCCTGG GGCACAACCG GCACAGTCTG TCCCTGGCGC GCGCGGCGAT CGACCGGTCC GCCCGGCAGT CCGCGCCGGC CGTGCAGGCG ATGCTGCAGG ACAAGCTGGC GTGGGCCCTC GCCCGCAACG GCGACGAGGC CGGCTGCATG CGTGCCCTGG ACGCGCTGGA GCGCACGATC TCCCGCGAGC CCGGCGACGC CCCGTCCTGG GCCGGGCACT ACAACATCGG CGACGTGGCC GAGTGTCAGG GCCACTGCCT CCTCCTGCTG GGCCGGGCGG AGATGGCCGA GAAGCGGCTG TTGGAGGCAC GTGACCTGCA GGGTCCGGCG CGGGCCCGGA CCCGCGCGTA CGCGGAGGCG GACCTGGCGC TGTCCTACCT GAAACGCCCG CGCCCCGAGC TCGAGGCGGC CCTCGAAGCC GGGTACCGGG CGGTGGAGGT GGCCGGCCCG GTGTCCTCCA CCCGGATCGT CAACAAACTC TCCGAGCTGG ACCGGACGAT CGCCGGCTTC TCGAAAGCCG TCGCGGCCCG TGAGTGGCGC TCACGCGCCG CTGGTCTCGT GCGACCTTCC CCCCAGCGGC CGGAACCCGC CGTCGGCTGA
|
Protein sequence | MVAEELMVAG PPLTLSGPIR PGRPSARISP DHTSPEIIPP DRISTAQSAA DIPGTHIPAT QLPNNQVPAP SMASTGHARP TAKAAAGKAA GGTSTGAGGA GGASSTRADG TRGAGTSPNT GESGTPTLFS RLLAEANVSD TRFARQVNNR ARSQRRIELG LARTTVGHWR RGMRPRDPMV AELAAAELSA LVGYPVTPAD LSWRGEASER DDLGLAVADI PDDTLRTLAG LSGRDMRRRD VLHDGAAFVA TAFADPVLSS LTGMIRRISA DVPSSPSGGA MIRDMTETFR RLDARFGSSE IRPQVVTFLH DRTRAAVAGP ADTDTFGALA ELAQFSGWLA QDCNRQALAQ RYYIQALTLA EHADDVMMAG RVLSAMSDQS AALGHNRHSL SLARAAIDRS ARQSAPAVQA MLQDKLAWAL ARNGDEAGCM RALDALERTI SREPGDAPSW AGHYNIGDVA ECQGHCLLLL GRAEMAEKRL LEARDLQGPA RARTRAYAEA DLALSYLKRP RPELEAALEA GYRAVEVAGP VSSTRIVNKL SELDRTIAGF SKAVAAREWR SRAAGLVRPS PQRPEPAVG
|
| |