Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0739 |
Symbol | |
ID | 5669155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 860622 |
End bp | 862364 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239666 |
Product | FHA domain-containing protein |
Protein accession | YP_001505103 |
Protein GI | 158312595 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.002863 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGTCGGCC CACACCCGCC CGGCGGCCCG GACGACCCGC TCCACGGCCC CACCACCGTG CTCGGCAACT CCGCGGCCCC GCAGGCGCCA CCCGTCACAG CACCGCAGGC GCCTCCCGCC GCGGCACCAG AGCAATCACC CGCCCCGGCA CGGCAGGCGC GACCTGCCCC CGTCCCTGGC CCGCAGGCAC CACCCGCACC AGCCCGGGTG GGCGGGGGCG AGACTTATGG CGACCTCACC CGACGGCTGA TCGAGACGTC GGTCTGGACG GACCGCATGC TCGCCGACGC CGGCATCCCC GTCCACGACG CACCCCTGGT CACGGTGGGC GGTGGGATCG GCTCGTTCGT CCTGGTCGAC TACCTGCGCA TCGCGGGCGC GCCGACCTCG GCGATCCGGG TGCTGTCCAA CATCGACACC CCCTGGCAGA CCTACCGGTA CCTGACCCGG GTCTCGCAGA TCCCCGACCA CGAGCGCATC CGCTCGGACT CGAGCTCGAC CCCGGACAAC ATCTGGGCCT TCCCGTCCTA CGCGGTGCGG GAGGCGTTCG CGGCCCGAGG CCCGCGCGGG TTCGTCGAAC CGCTGTGGCG GGTCGCGACC GAGCCACTGC TGTCGGACTA CTTCACCCCC CGCATCAGGA TGGTCTTCGA CGGCATGGCC CGGGAGGCCG CCCGGATCTC GTACCCCGAG ATGCTGGTCT CCGGGCAGGT GCGGATGGTC CGCCGCCGCG CGGACGGGGG GTACCTGACC GTCCTCACCC CCCCGGCCGG ACGGTCCGCG ACGAAGCGGA TCGCCTACCG CAGCCGGTAC GTCCACCTGG GCGTCGGCTA CCCGGGGCTG AAGTTCCTCC CGGACCTGCA GGAGTACCGC TCCCGGTATG GGGACGTGCG GCGGGTCGTC AACGCCTACG AGCCGCACGA GCACGTCTAC GACGAGCTGA TCCGGCACCC GGCGACCGTC GTGGTGCGCG GGGCGGGCAT CGTGGCCTCC CGCATCCTCG ACCGGCTGAT CACCGACCGC GACCGGCACG GGGCGAGGAC GCACATCGTG CACCTGTTCC GCACGTATGT GCGCGGCTCG CACGGCCCCA GCGTGTTCAT GCGCCGCCGG GGCGGCGACG GCTGGGCCTA CCAGGGGTTC AACTACCCGA AGTCGGTCTG GGGCGGCCAG CTGAAGGCAC GCATGCGCAC GCTGGAGGGC GACGAGCGGA AGGCGCTCTA CGACACGATC GGCGGGACGA ACACCCCCCG CCGGCGCCGG TGGCAGGCTC AGCTCGCCCG CGGCCGGCAC GAGGGCTGGT ATGTGACCCG GGTGGGCGAG GTGGAGCGGC TCACCCCCGG AACGGACGGA ACGGTAGTCA CCCGGGTCCG CACCGCCGAC GGGATGCTGG AGGTGCCGGC GGCGTACGTC ATCGACGCCA CCGGCCTGGT GGCGGACATC CGCGAGCACC GGGTGCTGGC AGACCTGCTC GACCACTCCG GAGCCGGGCA CAACCCGCTC GGCCGGCTGG ACGTGGAGCG GACCTTCGAG GTCCGCGGGA CCCGCAACGG GCCGGGGCGG CTCTACGCCT CGGGCGCGGC GACCCTCGGC GGCTACTTCC CGGGCGTCGA CACCTTCCTC GGTCTGCAGA TCGCCGCCCA GGAAATCTGT GACGACCTGG CCGCGGAGGG ATTCGTGCCC CGGATAGGGG TGGCACGTTC GGTGTCGCAG TGGGTGCGTT GGATGCGCAA CCAACCGGTC TGA
|
Protein sequence | MVGPHPPGGP DDPLHGPTTV LGNSAAPQAP PVTAPQAPPA AAPEQSPAPA RQARPAPVPG PQAPPAPARV GGGETYGDLT RRLIETSVWT DRMLADAGIP VHDAPLVTVG GGIGSFVLVD YLRIAGAPTS AIRVLSNIDT PWQTYRYLTR VSQIPDHERI RSDSSSTPDN IWAFPSYAVR EAFAARGPRG FVEPLWRVAT EPLLSDYFTP RIRMVFDGMA REAARISYPE MLVSGQVRMV RRRADGGYLT VLTPPAGRSA TKRIAYRSRY VHLGVGYPGL KFLPDLQEYR SRYGDVRRVV NAYEPHEHVY DELIRHPATV VVRGAGIVAS RILDRLITDR DRHGARTHIV HLFRTYVRGS HGPSVFMRRR GGDGWAYQGF NYPKSVWGGQ LKARMRTLEG DERKALYDTI GGTNTPRRRR WQAQLARGRH EGWYVTRVGE VERLTPGTDG TVVTRVRTAD GMLEVPAAYV IDATGLVADI REHRVLADLL DHSGAGHNPL GRLDVERTFE VRGTRNGPGR LYASGAATLG GYFPGVDTFL GLQIAAQEIC DDLAAEGFVP RIGVARSVSQ WVRWMRNQPV
|
| |