Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7336 |
Symbol | |
ID | 5675637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8975238 |
End bp | 8977577 |
Gene Length | 2340 bp |
Protein Length | 779 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641246173 |
Product | hypothetical protein |
Protein accession | YP_001511561 |
Protein GI | 158319053 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.840083 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGCTGC CGAGAGTGGG CGGGCCGCCA CCGGAACCGA CCACGACGCC CGTGACCATC CCCGAGACGC CCGTGACCAT CGCCGCCGAG CTCGGTGGGC TCGGTGGGCC GGCTGACCTC GGCGGTATCG GCGAACACCC GTCCGCGGCG AGCGGGCCCG CGCTGCTGGA ACGGGATTCC GCGTCCGCTG ACCTCCCTCC CCGGGCCCGG CCATGGGGCT GGCTGCGACT CCGGGCTCAG TCGCGGCGGT GGGCCCGGAG GGCTCGGCCG CGGCGGTGGG CCTGGTCGCG GCTGTGGCTG CCGGGATTCG TCGCGCTCGC GCTCATCGCT CTCGTGGTGC TGCTACGCGC CTACCTCGAC CGACCGTTCT GGTACGACGA GATCTGGCGC GGGCATTTCG TCAGTGAGCC GGTGGGCACC CTGTGGTCTG AGCTGGCCGC GGCGAACACG CCGTCGGCAC TGGGATGGCT GAGCGTCACC CGGCTCTCCG GGGATCTGTT CGGCTGGCAC AGCTGGGCGT TGCGGCTGCC GGGCTTCGCC GCGCTGCCGG CGCTCGGCGC GCTGACGGCT GTTCTCACCC GGCGGTTCTC CGGATCCGTG GCCGCCCTGT TCGCGGTGTG TTGGCTCTGC CTGAACGCGA CCTTCCTCGA CCTCGCCACC CAACTCAAGC CGTACACGAT CGAAACCCTT GCCGCGGTCG CCGTCGTCCT GCTGTGGATT GGCCCGGCGC CCGGTTCCGG AATCGGAGCC GGAGCCGGCT CTGAAGCAGG CGGAGCCGAC TCCGGAACCG GCGGTGTCCA CAGCGGCGGG GACGCCCGTC CGCCCGTTCG TCTCGGCTCG GTTGACCGTG GTCGTCTCGT CCGCCGGACG GCAGCTGGCG TGATCTCCCT GTTCTCCGTC CCGGCGATTT TTCTCGTCAT TCCGCTCGCG GTGATGGACG TCCTGCCCGG CCCGGACCGG CGGCGGAGGA TCGCCGAGAC GCTGCCGAGT GTGGTGCTCG TCGCCGCGCA CACGCTTCTC TTCATCGGCC ACCAGTCCGC GCAGCGCGCC GGGGGCTACT GGGACACGCA GTTCCTGGCC GGGCGGGACC CCTGGCAGGC CGTGAGTTTC GTCGCCGAAC AGCTGTGGCG GTTCGGCACC GGCTCACCGC CTGGGATCGA CCAGTTCGAT CCCAGTCTGG TGCCCGGATT TGTCGATCTT CCGGGGTACC TGCCGGGGCT GCTGGCCCCC GCGATCGTCC TGCTCGGGGC GACCGGCGCC GCCGCGCTCG CCCGCCGGTC GGACGGCCGC CGCGTGCTCG GCGCTCTCGC CGGGGCGGAG CTGCTCATGC TGGGCGCGAG CGCCGCCCGA TTCTGGCCGT TCGGGCCGAC GAGGACGAAC CAGTTCGTCG TGCCGATGTT CATCCTGGTC GTCGTGGTCG GTGGGGATCG CGTCATCCGC CTCGCCGCCC GCCGGGCGGG TCGGGCCTGG CACGCCGTCC GCGGTACGGG TGCCGTACCG CCCTTCTGCT CGGATGTCCT GCCGGTCTCC GGCTTGGGCG TCGTGCCCGC CTCCGACTCG GGTGCTGTGC CGGCCCCCCG TACGTCGCGA GACCCGGAGG TCACCGACGC CGCCGACGTG ACGGCCGTTC CGGATCACCC GGATAGTCCC GCCGATCCGC TCGTCCCGCA TGTCCTGGAT GTCGGGCGTG CCCCGGCCGC CCGGTGCGTG GTCGATGCGG GCGGTGCGGT GGCGGTGATC CTGCTCGTGG TCCTCGCGGC GGGCGTGGCG CTGTCGGGTG CTGTGTCCGG TGACAGGCTT GTCTGGGAGC GGCGGGATCG GATGCGCGGC CTCGACCTCA TGGTGGACGC GGCGGTGGCC ACCCGGCGGC TGGTTCGTCC CGGGGATCTT GTCGTGGTCG GCGGTCGGCT CGCCCGCCCG GGCTGGATCT ACGCGATGGA GGTCAGCGCG GACGCCCCCC GGGCGCCCGA AGGTCTCCCC CGGCCCCGCG TGGAGCGTGC CCCCGGAACC GCCGTGGGCG CCGCGGCCGG GACCGGCGCC GGAACCACCT CCAGGACCGA GTCGCCCCGG GTGGTCCGTG CGGAGACCGT TTTCGTCGAT CCCGGCCTCC GTGATTCCGT GGGCCGGCTG GTGCCACTTG CTGTGCAGCT GGAGAGGCCT CGCCGACCCG GACGCCTCGT CGTCTTCGTC TTCGATATCG ATGCGGGCGC GATGGCTCCG GGGCTGGCGG CGTTGCGGGA TGAAGGCTGG TGCCCCGGTG AGAGCTGGCG ATTCCGGCTG ACCGGTTCGG TCACGGTCTA CTCCGACTGC CCGAGGACTT CAGCCGCGAG CCGCGGATGA
|
Protein sequence | MVLPRVGGPP PEPTTTPVTI PETPVTIAAE LGGLGGPADL GGIGEHPSAA SGPALLERDS ASADLPPRAR PWGWLRLRAQ SRRWARRARP RRWAWSRLWL PGFVALALIA LVVLLRAYLD RPFWYDEIWR GHFVSEPVGT LWSELAAANT PSALGWLSVT RLSGDLFGWH SWALRLPGFA ALPALGALTA VLTRRFSGSV AALFAVCWLC LNATFLDLAT QLKPYTIETL AAVAVVLLWI GPAPGSGIGA GAGSEAGGAD SGTGGVHSGG DARPPVRLGS VDRGRLVRRT AAGVISLFSV PAIFLVIPLA VMDVLPGPDR RRRIAETLPS VVLVAAHTLL FIGHQSAQRA GGYWDTQFLA GRDPWQAVSF VAEQLWRFGT GSPPGIDQFD PSLVPGFVDL PGYLPGLLAP AIVLLGATGA AALARRSDGR RVLGALAGAE LLMLGASAAR FWPFGPTRTN QFVVPMFILV VVVGGDRVIR LAARRAGRAW HAVRGTGAVP PFCSDVLPVS GLGVVPASDS GAVPAPRTSR DPEVTDAADV TAVPDHPDSP ADPLVPHVLD VGRAPAARCV VDAGGAVAVI LLVVLAAGVA LSGAVSGDRL VWERRDRMRG LDLMVDAAVA TRRLVRPGDL VVVGGRLARP GWIYAMEVSA DAPRAPEGLP RPRVERAPGT AVGAAAGTGA GTTSRTESPR VVRAETVFVD PGLRDSVGRL VPLAVQLERP RRPGRLVVFV FDIDAGAMAP GLAALRDEGW CPGESWRFRL TGSVTVYSDC PRTSAASRG
|
| |