Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2245 |
Symbol | |
ID | 5670644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2683309 |
End bp | 2684508 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241165 |
Product | hypothetical protein |
Protein accession | YP_001506586 |
Protein GI | 158314078 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.350034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0195881 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATCGGA CGAACCGGGA CAGTCGTGCG GAACGGGAAG CTCCAACGGA CCCGGACAGT CAGGCGGCCC GCGTGGATGA AGATGAAGCG GACCGTGCCG GCCTGTCGGA GGAGAACGAC CTGACGATTC CGAGTGACGG GCCGGCTCGG AGTGGCGGGG CCGCTACCGC CGGTGGGGTG GCGCGGAGCG ACGGGACGGA TCGGATGGAC CGGGGGGAGC GGGTGGACCG GGAACGTCGC GCGGAGGCGC CGGCGGGCCT GCCGACGCCT CCGCAAACAG GCCGGTTCCG TAACCCGGCG ACCGGCCTGC CCAGGCTGTG GGTCGAGGCC GTCGTCCTGG TGGGGCTCTA CTACGTCTAC ACGGCCACCC GTGGCGTGGC GGGCTCGTCG GTCGGCGCCG CGACCGACAT GGGCTGGGAC ATCCTCCGCC TGCAGCAGCA CCTGCACATC GACATCGAGC TCAGCCTCAA CCGGTGGCTG CAGAGCATCC CGCCGCTGGC GGTCGCCTGC TGCTACTACT ACTCGACCCT GCACTTCGTC GTCACGCCGG CGCTGCTGGT CTGGATGTAC CGCCGCCACC CCGGCCGCTA CATCCGGGCC CGGTGGGCCC TGGTCTTCAC CACTCTGATC TCGCTGTGCG GCTTCTTCCT GTTCCCCACC GCCCCGCCCC GGCTCCTGCC CGGCACCTCG TATGTCGACA CGATGTCGCA CTTCGAGGCC TGGGGCTGGT GGAGCGGCGG CGCCAGCGCC GCTCCGGACG GCCTCGAGGG GCTGGCCAAC CAGTACGCGG CCATGCCCTC GCTGCACTGC GCGTGGGCGC TGTGGTGCGG CTTCATGCTG GCCCGTTTCG CCCGCACACC CCTCGTTCGA GTGATCGGCT GTCTCTATCC CGCTGCGACC GTGTTCGTCG TGATGGCAAC CTCGAACCAC TACATCCTGG ACGCCGTCGC CGGCTGGGCG GTGCTCGGCG TGAGCACGCT GCTCTCGCTG GCCATCACCG CCCGCGGCCG ACGCCGGCCG GCCGAGGCTC CGCCGACTCC CGCTCCGGCT GCCGCCGTGG TGCACCGGCC TGCCGCGGTG CCTCTGCCCG CCGTGGCGAA GAAGGCGACG GTCGACGTGG CGACCGCCGA CAGGGCCGCC GGGACCAAGG TGGCCGGGCG CCCGGGCCTG GAGCCTGACG TGGGCCAGGC CTCGGGCTGA
|
Protein sequence | MDRTNRDSRA EREAPTDPDS QAARVDEDEA DRAGLSEEND LTIPSDGPAR SGGAATAGGV ARSDGTDRMD RGERVDRERR AEAPAGLPTP PQTGRFRNPA TGLPRLWVEA VVLVGLYYVY TATRGVAGSS VGAATDMGWD ILRLQQHLHI DIELSLNRWL QSIPPLAVAC CYYYSTLHFV VTPALLVWMY RRHPGRYIRA RWALVFTTLI SLCGFFLFPT APPRLLPGTS YVDTMSHFEA WGWWSGGASA APDGLEGLAN QYAAMPSLHC AWALWCGFML ARFARTPLVR VIGCLYPAAT VFVVMATSNH YILDAVAGWA VLGVSTLLSL AITARGRRRP AEAPPTPAPA AAVVHRPAAV PLPAVAKKAT VDVATADRAA GTKVAGRPGL EPDVGQASG
|
| |