Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1061 |
Symbol | |
ID | 5669475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1250929 |
End bp | 1252746 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641239990 |
Product | FHA domain-containing protein |
Protein accession | YP_001505423 |
Protein GI | 158312915 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00105103 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCGA CACCCGGCGG CCGACCGACC CCTCAGGTCG ACCGCACCTC CCGGTTCGGC TTCGGCCGGA ATCCCGCGGG CGCCCCGGCG TCCCGGCCGC CGCGGCCGCG CCGCCGGCAC CGCTTCGGCC GTGGGCCGCG GCTTGGCCTG GCAGCCGGCT TCGTGCTGGC GCTGTCCCTG CCGCTGGTGG CGCCGGCCGG TGCCGCGCCG ACGGCCGCCA CGCCGACACC GTCGCCCACC GTGGGCACGA CCCCGGAGGC TGACGGAACC TCGTCGCCCG TGCCGTCCTC GTCGGCCGGG GGTGTCGCCG ACCAGCAGGT CCCGGTCGCC GGCCTGACCT CCGGCCCGGC CACCGCCGGG CAGTCGAAGC TGACTCTCGT CCTGTTCGAC AGCGACATCG TGCTCGTCGG CGGGCGTGGC TTCGGGCCCG GCAAGGACGT GGCGGTCACC GCCGCGACCA CCGACCTCGG TGGCTCCGCG AGCGCCCGGG CCGGCGCGGA CGGCCGGTTC ATCCTCGGCT TCCAGGTGCC GCTCGGTTTC TCCGGCCGGG TGACCGTGAC GGCGAAGCAG GAGTCGGTGG AGGCCAGCGG CACGCTCGAC GTCGTCGACG CTGGCACGCC GGGCCTGGCG AACGGGGCGG CCAACGGCGC CGAGGTCCCC GCACCAGCCC CGACGGCGAG TCCTGTCCCG GGCCCCGCGG CCACCCCGGC GCCGACGACC GCCGAGCCCA CGACTTCGGA GCCCACGACT TCGGAGCCCA CGGCCTCCGG GCCCACCAGC TCGGAGCCGA CGGGCACCGC TCGGGCCACC ACCCCCCCGA GCACCGGCCG GCGGGGCTCG TCGGCGCCGA CCGCCGTACC CGCGCCAGCG CCCACGGCCG CGCCGTCGGC GGGGAGCGGG ACGGGCACCG GCACCACCAC CGGCGGCACC GGCCGGCTCT CCGGCCTGCC CTGGATGTCC GGCGTCTACC CGTCGCACGT CCTGTCGCAG GTCATGTCCT TCGGGACGTG GCGCGGGCGG GCGAACGACG TGGCGCACGT GTTCACCGTC CGTACCCAGG GCTGGAACGC GATGGTCGAG CCGCGCTGGC CGCTGGACCT GTACAAGGCC TTCCCGGGCA AGCTGATTAT CAGCCAGCCG ACCTATCCGA AGGGCCAGGG CAACAACGCG GCCTGCGCCC GCGGCGAGTA CGACAGCTAC TGGAAGACCT TCGGCACGTT CCTCAAGAAC AACGGCCGCG CCGATTCGAT CGTCCGCATC GGCTGGGAAT TCAACGGCAA GTTCATGTAC TGGCACTCGG ACCCGGCCGG GACGGAGTTC CGCGACTGCT TCCGCAAGAT CTCGACTGCC ATCCGCTCGA CGGACCCCGC GGTGAAGATC GACTGGACGT TCAACGCGCA CGCCTCGCCG GTTCCCAACG GGGGCACCCC GTGGGCGGCC TACCCTGGTG ACGAGTACGT CGACTATGTC GGCATCGACT CCTACGACTG GTACCCGCCG TCGCGGGACG AGGCCACCTG GAAGAAGCAG TGCGAGGACC CGAACGGCCT GTGCTACCTG CTCGAGTTCG CCCGCCAGCA CGGCAAGAAG GTGGGCGTGG GCGAATGGGG CGTGTCCTCG TGCAGCCGCA ACGGCGGCGG TGACAACCCC TTCTACATCC AGAAGATGTT CGACACGTTC ACGAAGTACG CGGACGTGAT GGCGTACGAG TCGTACTTCC ACGACGCGGC GCCCGGCAAC GTCTGCTCGA CCATCATGAA CGGCGGCCAG AACCCGAAGG CGTCCGCCCT GTACAAGAAA CTGTTCGGCT CGGTCTGA
|
Protein sequence | MASTPGGRPT PQVDRTSRFG FGRNPAGAPA SRPPRPRRRH RFGRGPRLGL AAGFVLALSL PLVAPAGAAP TAATPTPSPT VGTTPEADGT SSPVPSSSAG GVADQQVPVA GLTSGPATAG QSKLTLVLFD SDIVLVGGRG FGPGKDVAVT AATTDLGGSA SARAGADGRF ILGFQVPLGF SGRVTVTAKQ ESVEASGTLD VVDAGTPGLA NGAANGAEVP APAPTASPVP GPAATPAPTT AEPTTSEPTT SEPTASGPTS SEPTGTARAT TPPSTGRRGS SAPTAVPAPA PTAAPSAGSG TGTGTTTGGT GRLSGLPWMS GVYPSHVLSQ VMSFGTWRGR ANDVAHVFTV RTQGWNAMVE PRWPLDLYKA FPGKLIISQP TYPKGQGNNA ACARGEYDSY WKTFGTFLKN NGRADSIVRI GWEFNGKFMY WHSDPAGTEF RDCFRKISTA IRSTDPAVKI DWTFNAHASP VPNGGTPWAA YPGDEYVDYV GIDSYDWYPP SRDEATWKKQ CEDPNGLCYL LEFARQHGKK VGVGEWGVSS CSRNGGGDNP FYIQKMFDTF TKYADVMAYE SYFHDAAPGN VCSTIMNGGQ NPKASALYKK LFGSV
|
| |