Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0172 |
Symbol | |
ID | 5668597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 205839 |
End bp | 208469 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641239101 |
Product | hypothetical protein |
Protein accession | YP_001504545 |
Protein GI | 158312037 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.211375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCCGT CGACAGGCCG GGTCCCGGGC CCGCCCGGAA CGCTCGCCCA CTGGCTGCGC GCGCTGCCGG ACGACACCCT GGTCCGGCTG TTCAGACTCC GGCCCGATCT CGCCACCCCG CCCCCACCAG ACTTCGACAT CCTCGCGGCG CGGCTGGAGA TCCGCGGCAG CGTGAGTCGG GCGCTGGAGC GGCTGGACAC CTTCACCCTC GAGGTCGTCC AGGCGCTGAC GCTGCTGCCG AGCCCGGTCT CCGTGGCCGA GCTGACCGCC TTCTGCGGCG GGCTCGACCT GCGTGCCGCC CTCGAGAGCC TGCGGGAGCG ACTCGTCGTC TGGGGCCCGG ACGAGGCGTT GCGCCTGGTC GGCGTCGCCT TCGACCTGCT GGGCGAGCGT CCGCTCGGGT TGGGGCGGCC GGTGCGGGCG TGCCTGGCCG GCTACCGGCA GGCGCAGCTC GCCCGGGTGG CGGCGGCCGT GGGGCTGCCC GACGCGGTGT CCGACGGGTT GTCCGCCAGC TCGGGCGGCC TCGGCGACTC GGACGACTCC GATCTCGATC TCGATCTCGA CTCCGAATTC CACCTTGGTG CCGGCAGCCA TCCCGTCCGG CGCGAGAGCA TGATCGAGAT GGTGTCGGCC GCCTTCGCGG ACCCTCACCG CGTCGCCGCC CTGCTCGACG GGTGCTCGGC GCGGGCGCGG CGGCTGGCCG AACGGCTCGC TGCCGGCCCG GCGCTGGGCG CTACCAGCGA CGCCGAACGT CTGCTGAGTG TGTCCTCCGC CCGCAGCCCC GTCGAGGAGC TGCTGGCCCG CGGGCTGCTC ATCGGCATCG AGCCGGGCAC CGTCGAGCTG CCCAGGGAGG TCGGCCTGGT GCTGCGCGGC GCCGACCAGG CCGGGCCGCT GCATCCCGAG CCGCCTGAGG TCACCGGCCG CGAGGTGGGG GCGGGCGCCG TCGACCCGGC CGCCGCGCTG GCCGCCGACG CGCTGGTCCG CGCGGTCACC ACGCTGCTCA CCGCCTGGGG GAGCACCCCG GTCACCCCGC TGCGCACCGG CGGGCTCAGC GTGCGCGACC TCAAGAACAG CGCCCGGCTG ATGGACGTCC CCGAGACCGA GGCGGCCGTC GTCATCGAGG CGGCGGCCGC GGCCGGGCTG GTCGACCTGA CACCCGGCAC CGACGTCCAG TTCGTGCCGA CCAACGTCTA CGACCGGTGG TGCACCGAGA CGGTGGCCAT GCGGTGGGCC GTCCTCGCCG AAGGGTGGCT GCGCTCGCCG TCGGCGGCCT GGCTGGTCGG CGGGCGCGAC GAGCGCGGCC GGCAGATCGC CGCGATGTCC CTCGACGCCC GCCGTCCCGG CGCCCCGGAC CTGCGCGCCG ATGTGCTGCG CGTCGTCGCC GCCGCGCCGG AGGGCTTCGC GCCCACCCCC GAGTCGGTGC GCGCCCGCCT GGCCTGGCGT TCCCCACGCC GGACGGGCCC GCTGCTCGAC GGGATGATCG GCGGGACGCT CACCGAGGCC GAGGTGGTCG GGTTCACCGG GCGTGGTGCG CCCAGCACCC TCGGGCAGCT CGTCGCGCGC CGCCTCGCCG CCGCCGAGGC CGACGACCCG GGGCGTGGCT CCGGCAGGTC CCCGGCGGCC GACGAGGGGC TGTGTCGCCT GCTCGCCGAC GCGATGGCCC CGCTGCTGCC CGAACCCGTC GAGGAACTGC TCATCCAGAC CGACCTGACG GCCGTCGCCC CCGGCCCGCT CGTGCCGCGG GTCGCCGCCG AACTGTCCCG GATGGCTGAC ATCGAGTCCG CGGGCGCGGC CACCGTGTAC CGCTTCACCG AGAGCTCGCT GCGGCGCGCG ATGGACGCCG GCAGCTCCGC CGACGACCTC CACGACCTGC TGGGCCACCT CGCCCGTGGC GGCGTCCCGC AGTCGCTGAC CTACCTGATC GACGACACGG CCCGCCGGCA CGGACGGTTG CGGTCCGGGC CGGCCGCCTC CTACCTGCGC TGCGACGACA CCGCGCTGCT CACCGAGGTC GTCGCGTCCC GGCGCACCCA GGCCCTCGCG ATGCGCCGGG TGGCCCCGAC GATCGTCATC TCCCCCCTAC CGGTGTCCGA CCTGCTGGAA GGACTGCGCG CGGCCGGTTT CGCCCCGGTG GCGGAGGCCC CGGACGGGCG CATTGTGCTG GCCCGCCCGG AGGTGCACCG CACCCCCGCC CGCGCCCGCC CGCCCGCCGC GGAGTCCGTC CCGACCAGGT CGAACCAGCT GCGCGACGTC GTCCGTCTGG TGCGCCGGGG CGACGACAGC ACCCGCGCCG CGCGGGCGGC GCAGGACGCC GCGGGGGCGC AGCTCGGGCT CGCACGCTCC GCCCCGGTGA TCCTGGTGAT GCTGCAGGGC GCGGTCCGGG ACCGCCGGCG CGTCCTGCTC GGCTATGTCA ACCAGCAGGG GACTCCCAGC GACCGGGTCG TGCGCCCGAC GCTGCTGGAG GGCGGTTGGC TCACCGCGTG GGACGAGCGC AGCGAGGCTC CCCGCCGCTT CGCCCTGCAC CGGGTGACCG GGGTGGCCGA CATCGACGAC CCGTTCGGCG GCCCGCCGGT GCCCGACACC GGCGACTGGG TGGTTCCGCC GGCGGCCGAC GACCTGCCCG GCCCGCGCTG A
|
Protein sequence | MSPSTGRVPG PPGTLAHWLR ALPDDTLVRL FRLRPDLATP PPPDFDILAA RLEIRGSVSR ALERLDTFTL EVVQALTLLP SPVSVAELTA FCGGLDLRAA LESLRERLVV WGPDEALRLV GVAFDLLGER PLGLGRPVRA CLAGYRQAQL ARVAAAVGLP DAVSDGLSAS SGGLGDSDDS DLDLDLDSEF HLGAGSHPVR RESMIEMVSA AFADPHRVAA LLDGCSARAR RLAERLAAGP ALGATSDAER LLSVSSARSP VEELLARGLL IGIEPGTVEL PREVGLVLRG ADQAGPLHPE PPEVTGREVG AGAVDPAAAL AADALVRAVT TLLTAWGSTP VTPLRTGGLS VRDLKNSARL MDVPETEAAV VIEAAAAAGL VDLTPGTDVQ FVPTNVYDRW CTETVAMRWA VLAEGWLRSP SAAWLVGGRD ERGRQIAAMS LDARRPGAPD LRADVLRVVA AAPEGFAPTP ESVRARLAWR SPRRTGPLLD GMIGGTLTEA EVVGFTGRGA PSTLGQLVAR RLAAAEADDP GRGSGRSPAA DEGLCRLLAD AMAPLLPEPV EELLIQTDLT AVAPGPLVPR VAAELSRMAD IESAGAATVY RFTESSLRRA MDAGSSADDL HDLLGHLARG GVPQSLTYLI DDTARRHGRL RSGPAASYLR CDDTALLTEV VASRRTQALA MRRVAPTIVI SPLPVSDLLE GLRAAGFAPV AEAPDGRIVL ARPEVHRTPA RARPPAAESV PTRSNQLRDV VRLVRRGDDS TRAARAAQDA AGAQLGLARS APVILVMLQG AVRDRRRVLL GYVNQQGTPS DRVVRPTLLE GGWLTAWDER SEAPRRFALH RVTGVADIDD PFGGPPVPDT GDWVVPPAAD DLPGPR
|
| |