Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2700 |
Symbol | |
ID | 5671091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3194321 |
End bp | 3195517 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641241612 |
Product | hypothetical protein |
Protein accession | YP_001507032 |
Protein GI | 158314524 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.75707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGACCCC TGGGGGTCGT CGCGGCAGCA CTGGTAGTTT CTGCCGCTGC GGTCGGCTGC TCGGCAGGTG AGTCCGAGAA CAACCAGGCA GGTGCCTGCG AAAGCCCTGG TGTGACATCC GACCAGGTGA AGATCGGATT CGTCGTCTCG GACTCCGGTC CCGGCCATGA AGCGCTGTCC TCGGCACGTG CAGGTGTGGA GGCCAGAATA GGTCTCGTCA ATGCGGCCGG CGGAATAAAT GGCCGTGAGA TCACCTATGA TTGGCGTGAT GACGGGAGCT CCGCCTCGGT AAACGTGGTG GCGACCGAGG AGCTTGTGCG GAGTGAGCCG GTTTTCGGTC TTCTGGCGGT GACGACTGCG CTCGACCCGT CGATGGATGC CCTGGAGGCG GCGGGGATTC CGGTGGTGGG GCTGGCGGCC AGTCCCGGCT GGGCTAAGCA TCGGAACATG TTCTCGTACT CGTACGAGGC TTCGCCCGTG ACGATCGGTC GCTATATTCA GTCTCTCGGC GGAAGAAAGG TGGCAGTGCT GGTCGCGGGA GCCATTGACT CTGTACCGGA GACCGTGGCG AAATATATTG CGGAGATGCG CGCGGCCGGC GTCAACGTCG TCGGCTCGAT TTCTTATGCG AGTGCGGCCG AGAGTCCATC CCGGGTCGTG CAAAGGATTG CCAGTCTTGG AGCTGATTCC ATAGTTGGCT TCACCACGGC GGAAGAATTT GCGGAAATCA TACAAGCCGC TCGCTCCGCC GAGCTGCGCA TCGTGGCCAG CGTTGCCCAG GCCGTGTATG ACCGTGCGTT GCTGCCCACG TTCGGGCCGG CGCTTGCAGG AGTTTCGGCT CCCGTGTACT TCCGCCCTTT CGAAGCAGGT GGCGAGGCCA TGGACCGCTA CCGCGATGCG ATGGCCCGGT TCGCCCCGGA GACCGGGGTT CCCGAGCGGC AGTTTGCGTT GCTTGCATAC GTCTATACTG ACTTGTTCAT CCAGGGCCTT GAGCTGGCTG GCGCCTGCCC CACCCGCGCA GCCTTCATCG AGGGTCTGCG AAAAGTCACC TCCTATGATG CCGGCGGTCT GATCGAGCCG GTGAGCCTCA GGGACAACCT CGGCGTGCAG CTCAGCTGTT ATGCGTTCGT ACAGGTCAGT CCGGCGGGTA ATGAGTTCCA GGTCGTCCAG GAGCGCGTGT GCGCTGACGG TAAATGA
|
Protein sequence | MRPLGVVAAA LVVSAAAVGC SAGESENNQA GACESPGVTS DQVKIGFVVS DSGPGHEALS SARAGVEARI GLVNAAGGIN GREITYDWRD DGSSASVNVV ATEELVRSEP VFGLLAVTTA LDPSMDALEA AGIPVVGLAA SPGWAKHRNM FSYSYEASPV TIGRYIQSLG GRKVAVLVAG AIDSVPETVA KYIAEMRAAG VNVVGSISYA SAAESPSRVV QRIASLGADS IVGFTTAEEF AEIIQAARSA ELRIVASVAQ AVYDRALLPT FGPALAGVSA PVYFRPFEAG GEAMDRYRDA MARFAPETGV PERQFALLAY VYTDLFIQGL ELAGACPTRA AFIEGLRKVT SYDAGGLIEP VSLRDNLGVQ LSCYAFVQVS PAGNEFQVVQ ERVCADGK
|
| |