Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1642 |
Symbol | |
ID | 5670044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1957924 |
End bp | 1959321 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641240560 |
Product | hypothetical protein |
Protein accession | YP_001505986 |
Protein GI | 158313478 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.121155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCAC AGGAGGGCCA GCAGGCGCTG GAGGCACAGC AGCCGCTGGA CGCGCTGGAG GCGCAGTTCG CGCAGTGGCG TCACTACGCG CAGCGCCGCC GGGAGCTGCG GACCGCCGAC GCCGACGAGC TCGAGGACCA TCTCCGCGGT TCCGTCGACG AGCTCGTCAT GGCCGGCCTG AGCGCCGACG AGGCGTTCCT GGTCGCGGTC AAACGGATGG GCAGCCTCGA CGAGCTGTCC CGCGAGTTCG CCCGGGAGCA TTCGGAACGG CTGTGGAAAC AGCTGGTCCT GACCGGCGGC CCGGCCGCGG ACACCCGCTC GCGGCGCGAC CTGCGGATGC TGGTGCTCTG CGCCGCGGGC GCGGCGGTGT CTGTCAAGGC GCCGGAGCTG TTCGGCGTGC GGATGACCGA CGACGGTTCG GCCTCGTTCT ACGGGCCGAA CCTCAGCCTG TTCGTCCTGC CGTGGCTGGC CGGCTTCCTC GCCTGGCGCC GCCAGGCCCC GCGCCCGCTG GTCGGGATCC TGGCGGCGCT GTTCGCGCTC GGCGCGGTGG CGGCCAACGT CTACCCGCTC GGCGACGATT CGCAGTCGGT GGTCGTCACC AGCATCCACC TGCCGATCGC CCTGTGGCTC GTGGTGGGCC TGGCCTACGC CGCGGACGAC TGGCGGTCGT CGCGCAGACG CATGGACTTC ATCCGCTTCA CCGGCGAATG GTTCGTCTAC TTCGTCCTCA TCGCCCTCGG CGGCGGTGTG CTCACCGTGT TCACGGCCGG CACCTTCGAA GCCATCGGAA TCGTTTCGGA CGACTTCATC TCGCAGTGGC TCCTTCCCTG TGGAGCGGCA GCCGGGGTCA TCGTGGCCGG GTGGCTCGTC GAAGCGAAGC AGAGCGTGGT GGAGAACATC GCCCCGGTGC TCACCAGGCT GTTCACCCCG TTGTTCACCG TGGTCCTGCT GGCCTTCCTC ATCGCCGTCT GCTGCACCGG CACCGGCATC GACGTCGAGC GGGACGCGCT GATCCTGTTC GACCTGCTGC TGGTCGTCGT CCTGGGGCTG CTGCTCTACT CGATGTCAGC CCGCGATCCG CTGGCCCCGC CCGACCTGTT CGACCGGCTG CAGCTCGCCC TGGTGGTGAG CGCGCTGGCC ATCGACGTGC TGGTCCTGCT GGCGGTCACC GGGCGCATCA CCGAGTACGG CACCACGCCC AACAAGGCCG CGGCGCTCGG GGAGAACGCC ATCCTGCTGG CGAACCTCGC CTGGTCGGCG TGGCTCCTGC TGAAGCTGGT CCGCCGGCAC ACGCCATTCG CGGCGCTCGA ACGCTGGCAG ACCTCTTACC TGCCGGTGTA CGCCGTCTGG GCCTGGATCG TGGTCCTCGC CTTTCCTCCG TTGTTCGGCT ATGCCTGA
|
Protein sequence | MTAQEGQQAL EAQQPLDALE AQFAQWRHYA QRRRELRTAD ADELEDHLRG SVDELVMAGL SADEAFLVAV KRMGSLDELS REFAREHSER LWKQLVLTGG PAADTRSRRD LRMLVLCAAG AAVSVKAPEL FGVRMTDDGS ASFYGPNLSL FVLPWLAGFL AWRRQAPRPL VGILAALFAL GAVAANVYPL GDDSQSVVVT SIHLPIALWL VVGLAYAADD WRSSRRRMDF IRFTGEWFVY FVLIALGGGV LTVFTAGTFE AIGIVSDDFI SQWLLPCGAA AGVIVAGWLV EAKQSVVENI APVLTRLFTP LFTVVLLAFL IAVCCTGTGI DVERDALILF DLLLVVVLGL LLYSMSARDP LAPPDLFDRL QLALVVSALA IDVLVLLAVT GRITEYGTTP NKAAALGENA ILLANLAWSA WLLLKLVRRH TPFAALERWQ TSYLPVYAVW AWIVVLAFPP LFGYA
|
| |