Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1198 |
Symbol | |
ID | 5669611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1430065 |
End bp | 1431828 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641240130 |
Product | hypothetical protein |
Protein accession | YP_001505558 |
Protein GI | 158313050 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.441597 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.262975 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGGGCT CGGGCGGCGG CGGCTTCGGC GGATTCGGTG GCTTAGGCGG TTCAGGGGGA GCCGGAGGTG CGGGCGGTGC CGGCGGCTCG GGGGCCGTCA GCCGGCTACG CCGGTGGAGC AGGCGGGATG ACGCCGACAG CGCCCCGGTG CTCGACCTCA CCGACGTTGA CGAGGTCACC ACGATGTTGC TCGCGCCCCC GGCCGAGTAC GCGGGGCTGC GCCGGCGGGT CGTCGACCAG TTCACGATTG TCAGTGCCAA CCGCTGTCGG CTTACCCGCT CGGTGAACTG GGGCCCGCTC GACGGGCTGC TGAACAGCAT TGTTCCGGGT ACCGGGCGGC TCGACACCGT CGGCCGGCTG CCGCCGGAGG TCTGCCTGCT GCTGCCGATC TCCACGTTGC CCAAGCGTGC CCTGGTCGGA TTCAATCTCG CCGGTCCGAG TGGCTCCGAC GCCCACCTGA TGCCGTACGG GACGTCCGTC GCGATCCAGG GCAACCTGAT CGCGCTGCTC GCCGACGCCA TCGGCTGCCC GCTGCCCGGC CCGGCGCGCC GCGTCGTGGA CGCGATCTCC AGGTTCCGCC CGGGGCGGCT GTCCGGCAAC CTGCCTGGGC TGGTGCCCCG CAAGGGACGT CCGCTCTCGC TCGAGGCCGT CGCGACCTAT CTTGCGCGTG AGGCGGATCT GTCGGTGCCG GATGTGACAC TGCGGTCGTG GCAGGCGCTG CTCGAGCCGG CCCAGCTTCT GCTGCGCTCG GCGCTGGCGG AGCCCTTCGA CCCGCTGAGC AGTGCGGACA CGATGCTGCT CGCCGCCGGC GAACTTTGGC GTGATCCGGA CGTCCCCACA CCGCTGGAGG TCGCGCACAT CGGTTACTAC CTGCGGGAGT TCACCGCCTG GATCGACGAG CTGAGCCTGG CGGGCCCGGC CGCGGTGCCG GTGCTGAGCA CGGTGGCGGA GTACGGGCGG CGCTGGGAGG CGCTCGCCGC CGTCACCCTG GACCCATACC GCCCCTGCTT GATCAAGATG TCCGAGGAGC GGCGGACGGT GCTGGCTCGG CGTTGCCCCG TCCGTGACGA GTCGCTTACT CTGCGGCAGC GGTGGCTGGC GCCGGTGGCC CTCGTCGACA TCGACCCGGG CGGCCCGGGC AGCTACCACG TGAGCGTGGG CACCGACGAC ACCAGTATCG AGCTGAGCAC GCCGATCACG GTCGACCTGG AGCACCGCCG GATCGCCCGG ACCTACGTCG AGGACGTCCA CCAGAACCGT GAGGTGTACG CGTTCTACAC CACCGATGCG CGTCGGGCGG CGCGGGCGAA GCTGGTGGTG GGCCTCAGCG TCTCGCCGGA CGTCGCCCGC GTCACGCTGG CGATCCTGGT CCTGATGTGT CTCACCGTGG CCTTGTCGGC GTTGCCGTTC GAGCTCGGAG CTGACGCGGT CGCGGTCGTC GCGGTGCCGT CGTCGTTCGC CGCGACCCTG CTGCTGACCA GGGAGCGGTC GAGCCTCGCG GCCTGGGTGC TCGGCCCGGC CAAGCAGGCA CTGCTCGCGC TGCTGGTGGC GCTGGCCGTG CTTTCGGGGC TGCGTGCCCT GGGCTGGCAC ACCCCGCCGG CGGACCCGGG CGGGTCGATC ATGTCGGTTC CGGGAGTGTC GTCCCAGCTA GGGGCGCTGG CCCCGGCCGT GCCACCGGGG CGTTCGGTGA GCTTGTCCGC ATTGGTGGCC GCGACGGACC GGCCGGGCAC GTGGCAGGTG GGAGGAGCTG GTCCGCGCCG GTGA
|
Protein sequence | MAGSGGGGFG GFGGLGGSGG AGGAGGAGGS GAVSRLRRWS RRDDADSAPV LDLTDVDEVT TMLLAPPAEY AGLRRRVVDQ FTIVSANRCR LTRSVNWGPL DGLLNSIVPG TGRLDTVGRL PPEVCLLLPI STLPKRALVG FNLAGPSGSD AHLMPYGTSV AIQGNLIALL ADAIGCPLPG PARRVVDAIS RFRPGRLSGN LPGLVPRKGR PLSLEAVATY LAREADLSVP DVTLRSWQAL LEPAQLLLRS ALAEPFDPLS SADTMLLAAG ELWRDPDVPT PLEVAHIGYY LREFTAWIDE LSLAGPAAVP VLSTVAEYGR RWEALAAVTL DPYRPCLIKM SEERRTVLAR RCPVRDESLT LRQRWLAPVA LVDIDPGGPG SYHVSVGTDD TSIELSTPIT VDLEHRRIAR TYVEDVHQNR EVYAFYTTDA RRAARAKLVV GLSVSPDVAR VTLAILVLMC LTVALSALPF ELGADAVAVV AVPSSFAATL LLTRERSSLA AWVLGPAKQA LLALLVALAV LSGLRALGWH TPPADPGGSI MSVPGVSSQL GALAPAVPPG RSVSLSALVA ATDRPGTWQV GGAGPRR
|
| |