Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1954 |
Symbol | |
ID | 5670355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2350034 |
End bp | 2351758 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240875 |
Product | hypothetical protein |
Protein accession | YP_001506297 |
Protein GI | 158313789 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0113455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.562335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGAGG CGCTCCTCCT GCTCCTCAGC TTTGCGCTAG TCATCGTATG TGGCATCTTC GTCGCGGCCG AGTTCGCCTT CGTGACGGTG GACCGGCCCA CGGTCGAGCG CTCCGCGGCG GACGGCGACA GCCGCTCGGC GGGCGTCCTC ACCGCACTGC GAAGCCTCTC CACCCAGCTC TCCGGCGCAC AGCTGGGCAT CACCATCACC AACCTGGCGA TCGGCTTCCT CGCCGAGCCC GCCATCGCGG ACCTACTTGA AGGTCCCGTC ACGACTCTGG GTGCCTCCGA GGGCCTGGCA CGTGGTCTCT CCGTGGCCCT GGCACTCGTG CTCGCGACCG CGTTCACGAT GTTGTACGGC GAGCTCGTCC CCAAGAACCT GGCCATCGCC AGGCCGCTGG GGACGGCGCG CGCCGTCCAG CGTCCGCAAC GGCTGTTCAC CCGGGTGACC AGCCCGGTGA TCCACTCGCT GAACAGCACC GCGAACGCGC TGCTGCGGCG GGTCAACGTC GAACCCCAGG AGGAACTGGC GTCGGCACGT TCCCCGCAGG AGCTGTTCTC GCTGCTCGGC CGGTCGGCCG AGCACGGAAC GCTGCCGCGG GAGACGGCGA CGCTCATGCA GCGCTCGCTG ACCTTCGGCG ACCGGGTCGC CGAGGACGTG ATGACCCCCC GGATGCGCAT GCAGTCGATC GACGCGGACG CCCCCGTCGC CGAGGTGATC AGCGCCGTCC GGCGGACCGG GCACGCCCGG TTCCCCGTCA TCGGCGACGG CAGCGACGAC GTAGTCGGCC TGATCCATGT CAAGCACGCG GTGAGCGTGC CGGAGGAGCG GCGCGACACG GTACAGGTGC GCACCGTGAT GATCCCCGCG GCGACGGTGC CGTCCTCGAT GCCCCTCGAG CCGCTGTTGG AGACGCTGCG CTCGGGCGGC CTGCAGATGG CGATCGTCGT CGACGAGTTC GGCGGGGTCG ACGGCCTGGT GACGGCGGAG GACCTCATCG AGGAGATCGT CGGTGACGTC GTCGACGAGC ACGACCGGGT CAGCCCCCGG GCGCTGCGCC GCCGGGACGG CAGCTGGCTC GTCTCGGGCC TGCTGCGGCC GGAGGAGGCC AGTGAGGTCA CCGGCCTGCC GATCCCGGCC GACGACGCCT ACCAGACGCT GGGCGGCCTG ATGTCGCGCA CCCTCGGGCG TATCCCCGGC ACCGGCGACA CGATCGTCCT GGACGGCATC CGCTACGAGG TCGAGCGGAT GGACGGCCGC CGCGTCGACC GCATCCGGCT CGACCCGCGG GGCGACGCCG CGACGAACCC GGCGCGGGCC GACATCGCGG AACAGCCCTC GGGCGAGGCG CCAGCCACGA CTCCGCCACC CGCGACCCCG GCGGACACGA AGGCCACCGA CGAGAATCCA CCGACCGTAG CGCCAGCGAC CACAGCGCCG GCAGGCACGG CGTCGGCGAG CACAACGGCG GCGGACGCGG CACCGACGGG CGCGGCGCAG GCAGACACGG CCCCGCCGAC GGGCACAGCG CCAGCGGTGG GCACAGCGCC AGCGGCGGGC ACAGCGCCAG CGGACACAGC ACCGCCGGAC ACAGCACCGC CGGCGGGCAC GAAGCCGCCG GGCACCGGGC CTGGACGGGC CCGCGGCCGG GACCGCGCTG ACGACGCGGG TCGGCGCGGG CGCACGCGGT CGGTCTCAGC CGGGGTCAGG GAGTCCGAGC GATGA
|
Protein sequence | MTEALLLLLS FALVIVCGIF VAAEFAFVTV DRPTVERSAA DGDSRSAGVL TALRSLSTQL SGAQLGITIT NLAIGFLAEP AIADLLEGPV TTLGASEGLA RGLSVALALV LATAFTMLYG ELVPKNLAIA RPLGTARAVQ RPQRLFTRVT SPVIHSLNST ANALLRRVNV EPQEELASAR SPQELFSLLG RSAEHGTLPR ETATLMQRSL TFGDRVAEDV MTPRMRMQSI DADAPVAEVI SAVRRTGHAR FPVIGDGSDD VVGLIHVKHA VSVPEERRDT VQVRTVMIPA ATVPSSMPLE PLLETLRSGG LQMAIVVDEF GGVDGLVTAE DLIEEIVGDV VDEHDRVSPR ALRRRDGSWL VSGLLRPEEA SEVTGLPIPA DDAYQTLGGL MSRTLGRIPG TGDTIVLDGI RYEVERMDGR RVDRIRLDPR GDAATNPARA DIAEQPSGEA PATTPPPATP ADTKATDENP PTVAPATTAP AGTASASTTA ADAAPTGAAQ ADTAPPTGTA PAVGTAPAAG TAPADTAPPD TAPPAGTKPP GTGPGRARGR DRADDAGRRG RTRSVSAGVR ESER
|
| |