Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5393 |
Symbol | |
ID | 5673725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6505394 |
End bp | 6506470 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244249 |
Product | hypothetical protein |
Protein accession | YP_001509655 |
Protein GI | 158317147 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.896467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATTTCG ATGCCACTGG CAAAGTCTCA CTAGATCACA TCTACGACCA GCCTGACCCG CGAGCGTATT TCACGACGCT GCGTACGCTC GCCTATCAGG TCCCCCAAAT CGCCAGGCCG TACTTCATCG AGACCGTGGC CGACTACCGT GCAGCAGGCC GCCCGATGCC CGGTGATCCT CCCGGACGTC ACCCGTCGCC GCTGAAGATC CTCGACATCG GTTCGTCCTA CGGGATCAAC GCGGCACTGC TGCGCTGCGG GGTTACCCTC GAACACCTGT ACGAGCGCTA CGGTGCGCCC GAGGCCCACG GCTACACCCG TGACGAGCTG CTGGCCCGCG ACCGCGAACT TGTCCGTTCC ACCAGAGTGT CCCCGCCACC CACTTCCAGG CGTCCGAGCT CCGACGGGTG TTCGGGTGGC TCCCTGCCGC CTGGTGCGGT GTTCGTGGGC CTCGACGTGT CGAACGAGGC CCTCTCCTAC GCTCTCGCGG CAGGCTTTCT CGACGATGCC GTGCACGCCG ACCTGGAGGC TGCGGATCCC ACCGAACGGC AGCGACGGCA GCTTGCCGGC ACCGATCTCG TCATCTCGAC CGGCTGCCTC GGCTATGTTG GCGAACGAAC CATCTCCCGG GTTCTCGACG CCATCGAGGC GGCTGACGGA AAGCGACCGT GGATGGCGCA CTTCGTACTG CGGATGTTCC CGTTCGACGC CATCGGGGCG AGTCTCGCGG AGAGGGGATA CGAGACCGTC ACGCTCGACC GCATGTTCCG CCAGCGCAGA TTCGCTTCCG CGCAGGAACA GCAGCTCGTG CTGGACAGTC TCACCACGGT GGGCATCGAC CCGCACGGCC TGGAGAGCGA CGGTTGGATG TACGCCCAGC TGTACGTGTC ACGCCCGCGC GGCAGCTCCG CCGCACACAC CGCCAGCGCC TCCGCCGCGG CTACCGCCAG CACCTCCGCC GCGAGTATCG CGAGTACCGG CACTCCGGCA TCGGTATCCC CGGCGCTGGT TCCGCCATCA GCAGGAGCCC GACGGCCGTC GGGCCACCCG TCCCACGCCC CCCGAGGTCA CGAATGA
|
Protein sequence | MHFDATGKVS LDHIYDQPDP RAYFTTLRTL AYQVPQIARP YFIETVADYR AAGRPMPGDP PGRHPSPLKI LDIGSSYGIN AALLRCGVTL EHLYERYGAP EAHGYTRDEL LARDRELVRS TRVSPPPTSR RPSSDGCSGG SLPPGAVFVG LDVSNEALSY ALAAGFLDDA VHADLEAADP TERQRRQLAG TDLVISTGCL GYVGERTISR VLDAIEAADG KRPWMAHFVL RMFPFDAIGA SLAERGYETV TLDRMFRQRR FASAQEQQLV LDSLTTVGID PHGLESDGWM YAQLYVSRPR GSSAAHTASA SAAATASTSA ASIASTGTPA SVSPALVPPS AGARRPSGHP SHAPRGHE
|
| |