Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3874 |
Symbol | |
ID | 5672237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4603890 |
End bp | 4605032 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242752 |
Product | hypothetical protein |
Protein accession | YP_001508172 |
Protein GI | 158315664 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.104897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0811727 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTGGA GAGTCCGGGT TTCACTACCT GACCACCTGT TCCGCGGCGC GGGCGCGTGG GCGCCCGGCC GACCGAGATG TGAGGGAGAC ACCATGGCGA TCGACCTGAC CGGCGGTCTG AGCGACGACC GCGAGTACGT GTTCCCCGCC CAGCCCGACA ACCCGGACCT GCGCGAGTCG GTCAACGCGT GGGTGTGGGA CGACGGCGTC GAGTTCGGAT TACCCCGAAT CGGCATCGAG GCCGCCGCGG ACCAGTGGGA CACCCACGAC ATCCAGGTGA ACATCGCGTT CGCCGGCGGC CGGGTGCTCA ACATGTACGG CCCCAGGAAG GTGCACGACC CGCTGGGCGC GGACGGGAAG GCCCGCATCC TGGGCGCGGG GCCGCTGTCC TTCGAGCTGG TCGAGCCCTA TCGGCACTGG AAGATGCACC TGGAGGGCCC CGCCGTGGTG ACCTCCGCCG AGGATCAGAT CGGCGGCTGG AAGCGCGGCG TCACCGGCGG GCCGACCGTC GAGGTGCGCC TCGAACTGGA CATCAGGCCC GCGGTGCCGC CGTGGGAGAG CGGCACGCTC CTCGAGGAGG CCGACCGGGT CCTCGCCACC CAGGAGGAGG GCGACCTGAT GGGCGGCCCC CGCTTCGAGC AGCTCTCCCG GGTGACCGGC CGCCTGCAGG TCGACGACGA GGTCCACGAG CTCAACGGCG GCGGCCTGCG GATCCGCCGC GCGGGCGTTC GCCGGCTCGC CACCTTCCGC GGGCATGTCT GGCAGTCGGC GCTGTTCCCG AGCGGGCGCG CGTTCGGGCT GTGCCTCTAC CCGCCGCGCG CCGACGGCAA GCCGACCTTC AACGAGGGCT TCCTCTTCGA GGGCGACGGC GCGCTCATCC CGGCCTGGGT CGTCGACGCG CCCTGGCTGC GCGAGCTGCG GCCCACCGGC GAGGATGTCT CCGTCACCCT CGAGACCGAG GACGGCCGGA CGACGACGAT CCACGGCGAG TCGCTCCTGT CGACCTTCGC GGTGATGGGC GCGGATATCG GCTCCCCCCA GCGCCTGAAC CTGCAGCAGG CCATCGCTCG CTACACCTGG GACGGCGAGA CGGCCAACGG CATGATGGAG CGCTCCAGCG TGAGCGACAC GGTCGCCCAG TGA
|
Protein sequence | MAWRVRVSLP DHLFRGAGAW APGRPRCEGD TMAIDLTGGL SDDREYVFPA QPDNPDLRES VNAWVWDDGV EFGLPRIGIE AAADQWDTHD IQVNIAFAGG RVLNMYGPRK VHDPLGADGK ARILGAGPLS FELVEPYRHW KMHLEGPAVV TSAEDQIGGW KRGVTGGPTV EVRLELDIRP AVPPWESGTL LEEADRVLAT QEEGDLMGGP RFEQLSRVTG RLQVDDEVHE LNGGGLRIRR AGVRRLATFR GHVWQSALFP SGRAFGLCLY PPRADGKPTF NEGFLFEGDG ALIPAWVVDA PWLRELRPTG EDVSVTLETE DGRTTTIHGE SLLSTFAVMG ADIGSPQRLN LQQAIARYTW DGETANGMME RSSVSDTVAQ
|
| |