Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3923 |
Symbol | |
ID | 5672284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4692495 |
End bp | 4693607 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242802 |
Product | hypothetical protein |
Protein accession | YP_001508219 |
Protein GI | 158315711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00983972 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0399777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTGG GCCTGGCGTT CACCGCCTGT TCCAGTTCCA GCGACGACAC CGACGGCGGG GAGACGACGG CCGCGCCCGA CCAGACGGTG AACGGGGCGC CGGGGGTCAC CCAGGACGAG ATCCGCTTCT TGGTTCTCGG CACGAAGACC AACAACCCGA CCGGCTCCTG CACGCTCGAC TGCTTCTCCC AGGGGATCAA GGCCTACTTC GCCTTCCGTA ACAGCACGGG CGGGGTGCGC GGCCGCAAGC TGACCGTCAC CACCGAGATC GACGACGCGC TCGGGCAGAA CCAGCAGGGC GCGCTGCAGA TCATCACCAA GAACGACACG TTCGCCGACT TCGGCGCCGC GCAGCTGCCC ACCGGCTGGG GTGACCTGGT CAAGGCCGGC GTCCCGCAGT ACGTGTGGGC CATCCAGCCG CAGGCCATGG CCGGCCAGGA CTCGATCTTC GGCAACGCCG GAGTGACCTG CCTGGAGTGC ACGAACCGGA CCTTCACGTA CGCGGCGGAG CTCGCCGGCG CGAAGAAGAT CGGCGCCCTC GGCTACGGGG TCTCGGAGAG CTCGAAGCGC TGCACGTCGA CCATCACCGA CACGATCGAG CTCTACCACG ACAAGACCAG CCAGGAGGTC GTGTACAAGA ACGACGACCT GGCCTTCGGC CTGCCGAACG GGATCGACCC CGAGGTCTCC GCGATGAAGC GCGCCGGCAC CGACATGATC ATTACCCGCC TCGACCTCAA CGGCATGAAG ACGCTGGCAC AGGAGCTCGA GCGCCAGGGC ATGGGCGACA TCCCGCTTTA CCATCCGAAC ACCTATGACC GGAAGTTCGT CGCCGCGACG GGCGACCTGT TCGAGGGGGA CTACATCGGC GTCACCTTCC GCCCGTTCGA CGCCGACCTC GCCTACCAGG GAATTCTCGC CGCCGGACCT TCCTTTGACC ACGCCAAGGT CATCGCCGCG AAGAACGCCA TGACCGAGTA CAGCGCGGAC GGCCTGATCA ATCCCAGCGA CTGGAGCCGC CAGCACGAAA GCCCGACGCA GGACGACCCG GCGACCCACG GCTACAGGCA GGAGCGCTTC GCCATGGTCC AGGTGCGGAA AGGAAAGTTC TAG
|
Protein sequence | MALGLAFTAC SSSSDDTDGG ETTAAPDQTV NGAPGVTQDE IRFLVLGTKT NNPTGSCTLD CFSQGIKAYF AFRNSTGGVR GRKLTVTTEI DDALGQNQQG ALQIITKNDT FADFGAAQLP TGWGDLVKAG VPQYVWAIQP QAMAGQDSIF GNAGVTCLEC TNRTFTYAAE LAGAKKIGAL GYGVSESSKR CTSTITDTIE LYHDKTSQEV VYKNDDLAFG LPNGIDPEVS AMKRAGTDMI ITRLDLNGMK TLAQELERQG MGDIPLYHPN TYDRKFVAAT GDLFEGDYIG VTFRPFDADL AYQGILAAGP SFDHAKVIAA KNAMTEYSAD GLINPSDWSR QHESPTQDDP ATHGYRQERF AMVQVRKGKF
|
| |