Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1058 |
Symbol | |
ID | 5669472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1241443 |
End bp | 1244232 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641239987 |
Product | hypothetical protein |
Protein accession | YP_001505420 |
Protein GI | 158312912 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0721278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCAGC CACGGCGTTC CTCCCGGTCG GGCCGCTCCG GCGGCCCGGG TGAGGCGGGC GAGGGCCGGG GCGGCGCCGG TGGTTACCGC TGGTTCGCCC CCGAGGACCT GCCCGGCGAG GAGCTGCCCG ACACCCGGGA GCAGGGCGAG GACGATCGGG GCGAGCCCCG GTCGGACGGG CCGGAGCGGA CGTCCGCCTG GTACGACACG GACGACGGCA CCGACCCGGG CGACGGGGAG TACCTCCGTC CCAGGGGCTA TGTGTCACGG CTGCCGGGCG GCCGGTCGGG GCGCGAGCGC TCCGACTCCG GTGGCGCGGG CGCGCCGGCC GGGGGCGGTC GTGGATTCGC CCCGGACACC TACGAGCGCG CCGGGTACGG CCCACCGGAC CGCGGCGAGG GAGATCCGGA GGCGGGCTAC CGCTCCGCCT ACCCGGACGA GGCCTACGGC GACCCCGGGT ACGCCGGCGG CAGGTACGAG GAGAGCACGT ACGAGTCCGG GCCGGAGGGC GAGGACCGGT ACGCGGGCGA CGGCTACCGG CGGCCCGAGA CCGCCGACGG CGAGTACGAC CGCGGTCCCG GGTACCCGGG CGGGCCGCGG TACGCGGGCG GTGACGGGTT CGCCGACGCT CCCGGCTTCC CCGGTGAGGC GGCTTTCACG GGCGAGGCGG ATTACACCCG CGGTGGCGAG CCCCCGCTCC CGCGCCGGAC CAGCTACAAC GCCGCTCGCG GCTATCCCGA CGACGGCGGC TACCGCGGCG AGCCCCGGTA CGCCGGGGAC GGCGGCGGTG CCGCGGCGTA TGCCGGCACG CCGGGATACG CCCGCGACCC GGAGTTCGTC CCGGACGGCT CCGGCGACGC CGAGTACTCC GGTGATGCGC GGTTCGCTGA AGGTGACGCG CGGTTCGCGG GGGAGGTGCG GTACTCGGAC GACCCGCGGT TCTCGGGCGA CTCCCAGTTC TCGGGAGATG CGGAGTACCC GAGGGACACG GCGTTCGCCG GTGACGAGGA GTACTCGGGG GACGGCCCGC GTGCCGGTGG CGAGGGCTAC CCGGATGCCG CTCCGTACGT CCGCGGGTCC GACCGGTACG AACGCGAGGG CGACTTCCCG GACGGCCCGG CATACCGGCG CGACGCCGCC TACGCCGGTG GCGCCGACTA CACCGATGAT GCGGGCTACG CGGGTGGCGC CGCCTACGAG GCGAAGGCCG ATGCCGGCCG CGGCACCGCT TTCGCGCCCG GCGGCGGAAC CGGCTACGGC GGGGACACCG GCTACGGCGG GGACACCGGC TACTCCGGCG ACGGTGAGTA CACAGCCGCC GGCTACGACG GAGCCGACGA TCGCGGCGGT GCCGAGCTGC CGCCGATGCC CGCGCAGAAG ATCGATTTCT TCCGGCGCTA CGTGCGCCAG TCGACCGAGC CCGCCGGCAT GCCGGACGCC CCGGCCGGGG ACGGCGGGGA CTCGACGGGG GCCCGTCCCG CCAGCCGTCG GGACGAGTTC GACCTGCTCG ACTACGAGAT CCTCGGCGGG AGCGTGTCCG ACCGGCATGC CGGGGACGCC CCCGCCGGCC GCGTCCCCGG CCGGCTCGAC GACGCCGGGG ACCCGATCGG GGACTACCCT GCCGCGGAGC ACGCCTTCGC CGCGCGGGAC CGGGTGGATG TCCTCGTGCC GGGTGACCTC CCGACACCGG GTGACCTCCC GACACCGGGT GGCCACCTGG CGGACGACGC CTTGGCGGCC GACGACGTCG TGGCGACAGG CGACGTCGTG CCGGGCAGTG TTGTGCCGGA CAATGCCGCG GCGGGCGAGG TCGTGGCGTC CGGCGAGTTG CTGGAGCCCG ACGACGTCGT CGCGTCCGGT GATCTGTCGG CATCCGATGA TGTTGTGGCG TTCGATGACC TCCTGACGGG AGATGATCTC CTGACAGGAG ACGAGGCAGG CACCGGCAGG ACGGCGGACG GGGCCGGCGC GCAGGCGCTG GGCGCACCAC CGCCGGGCAC CGGCCTCGGC CTCGGTTTCG AGCAGGGGCC GTTCGACGGC ACCAGCCGGC CGGCCGACGC GGACGTGATC GACGGATCGG GCGCGGACGC CGTGCCCGCG CAGACCATGC CGGCGGGCAC TGTTCCGGCG GGCACTGTTC CGGCCGGCAC CGTTTCGGTG GGTGCCGGGG GCGTGCCGTC CGGCGCGGTC CCGGTGGGCC TGGCGGTGGG GGACACCGAT GGCCCGGCTG TGACCGGCGC GGTCCCGTCG GACGAGGCCC CGCTGGCCGC CGCCGTGGAC GGCGCTTTGA ACGGAGCCGC CATGAACGGA GCCGGAGCGG ACGTGTCCGC CGCGGCGCTG GTCGACCGGC AGGTGCTGGA ACGCCTCGCC GATCGGGTGG ACGAGCTCGT CCGCCTGCGC CGCCACGACG CCGAGCTGGT CGACCGCCTG CACGCGGAGA ACGGCCGGCT GCGCGGCGGT GAGCTGACCG AGGCGATGAC CCCGCTGCTG CGCGGGCTGG TCCGGCTGTA CGACCAGATG AGCAGCCTCG GTGCCGAGGA CGCCCAGAGC GTCGCCGGGA TCCTGCGCAA GCAGCTGCTC CAGATCCTGG ACCTGGCCGC GGACGTTCGT CCCTACGCGC CGGCGGCCGG CGACCCGTTC GATCCGGGGC GGTCGCTGGG GGTCCGCCGG GTCGGCACCG ACGATCCGGC GCTGGAGGGA ACTGTCGCCC GGACGGTTCG TCCCGGCTTC GTCCGCGGGG AGTCCATGGT CGTCCGACCT GCCGAGACCG AGGTCTACCG GGCCTTCTGA
|
Protein sequence | MFQPRRSSRS GRSGGPGEAG EGRGGAGGYR WFAPEDLPGE ELPDTREQGE DDRGEPRSDG PERTSAWYDT DDGTDPGDGE YLRPRGYVSR LPGGRSGRER SDSGGAGAPA GGGRGFAPDT YERAGYGPPD RGEGDPEAGY RSAYPDEAYG DPGYAGGRYE ESTYESGPEG EDRYAGDGYR RPETADGEYD RGPGYPGGPR YAGGDGFADA PGFPGEAAFT GEADYTRGGE PPLPRRTSYN AARGYPDDGG YRGEPRYAGD GGGAAAYAGT PGYARDPEFV PDGSGDAEYS GDARFAEGDA RFAGEVRYSD DPRFSGDSQF SGDAEYPRDT AFAGDEEYSG DGPRAGGEGY PDAAPYVRGS DRYEREGDFP DGPAYRRDAA YAGGADYTDD AGYAGGAAYE AKADAGRGTA FAPGGGTGYG GDTGYGGDTG YSGDGEYTAA GYDGADDRGG AELPPMPAQK IDFFRRYVRQ STEPAGMPDA PAGDGGDSTG ARPASRRDEF DLLDYEILGG SVSDRHAGDA PAGRVPGRLD DAGDPIGDYP AAEHAFAARD RVDVLVPGDL PTPGDLPTPG GHLADDALAA DDVVATGDVV PGSVVPDNAA AGEVVASGEL LEPDDVVASG DLSASDDVVA FDDLLTGDDL LTGDEAGTGR TADGAGAQAL GAPPPGTGLG LGFEQGPFDG TSRPADADVI DGSGADAVPA QTMPAGTVPA GTVPAGTVSV GAGGVPSGAV PVGLAVGDTD GPAVTGAVPS DEAPLAAAVD GALNGAAMNG AGADVSAAAL VDRQVLERLA DRVDELVRLR RHDAELVDRL HAENGRLRGG ELTEAMTPLL RGLVRLYDQM SSLGAEDAQS VAGILRKQLL QILDLAADVR PYAPAAGDPF DPGRSLGVRR VGTDDPALEG TVARTVRPGF VRGESMVVRP AETEVYRAF
|
| |