Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1339 |
Symbol | |
ID | 5669750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1611132 |
End bp | 1612844 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641240270 |
Product | hypothetical protein |
Protein accession | YP_001505697 |
Protein GI | 158313189 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCC CGCGTCGCCA GCGGGCGCAC GCCCGCCATC GAGCGGGGCA TCGACCCTGG ACTTGTCCCG GCCGTCGCCC GGGGTGTCGC GAGCACGGCC GCAGCGGTCC GGCACCAGTT AGCTACCGTG ACGCACGCCG CGGTGTACGG CTTACGGTCG GCGTGCTGGC GGTGGTGAGC ATTCTCGGAT TTGCGGTGTG GACGTTCGTC GACCGTTCGT CTTCGCCGGA GGCCCTTGCT GCTTCGGTTG TCGCCTCTAC GGCGGATGAT CCGGATTGCC GCACGGCCAC CGCGACCTTC ACGGACGTTT CGGTGCCCGG GTATGGCGGC CCGGCGCCCG CAGGCTGGCA GCCAGCTCAG CGCTGCCCAT CCGATAGTCG GCTGGGGGCG CGATCGCCGG AGAGCGGTTT CGCAGCCCAG CCGGCGCAGA TTCCGGGAAT CGGCGGGATT GTTGGTGGTC TGGTGGGCGG AGGCATCACG GGGGTTATGG AAACGGCGGT CGAAACCGTA GTGAACCGGG TGTCGCAGAA TCTGGTTGAC GCGGCCAAAA GTATGATCCT CGAGTTCCTC GGGGCATCGA CCCGGCCGCA GGTGACAGCA GAGGAGTTTA TCGGCCCTCA CGGTGCGTAC CACAGCACGG CATCGATGGC CACTCTGCTG CTCGTCGGAT GCGTGATGAT CGGTGTTGGG CAGGGACTCT GGTCTGGCGA GCCGGTCCAG GCCATGCTGC GACTTCTTGG CGATATACCC GTCGCGGTAC TGGCCATACT GGGTTTTCCG TGGGTCGTGG ATCAGCTGGT CACGATTTCC GATGTGATGG CCGACTGGGT GCTCGGGAAC GACCTTCGTA CCAGGAATGA GATCCTCGAT CTGGTCGTGC CTTTCTCTGG CGGCCCAGAC GGTAACGTCG GGTATCTGAT CCCCCGGATC TTCGTGTATC TCGGGGTCGC CTTGATATAC CTGGAACTCG TCGTGCGGAA CGGTCTGATC TACATGGTCG TCGCGCTTGC CCCGCTGTCA TTCATGGCGA TAACGATGTC AGGGGCTAAA TTGGCGGCGC GGAAGGCTGT CGAAATGGTT GTCGCCATAA TTTTGATCAA GCCGGCGGTA TTCGTCGAGC TACGGGTAGG GCTCGACCTC GCTCATCCGG GTCTCGGTAG CCCGGCGGCC GATGGTGATG CGTGGGGAGA AATCTTTGTC GGCATGGCGA TCGTGTTTAT CGCCGCGTTC ATGCCATGGA TCATCTGGCG CCTCATGCCC CTGATGGAAC ATGCGATGGT CGCACAAGGA GTTGCCCGAG CGCCGTTCCG CGCTGGGATG CAGGCCATGC AGATGGTGTA CTTCGGGTCC GCGCTGGCCG GGCGCGGCGC CCGCGGTGGA GCAGGCGGTG GACGCGGCAG GGTGTTCGGG CAGCAGCCAG CCGGCGCTGG AGGCGGAGGC GGGTTCGGCC CGCCTCGGAG TCTGACCGGT ACGGGTGCCT CCGCGAACGG CCCAACCCGC CCGATGACAC GTGACAGCTC TGGCGCGGGC AGTGAACGGC GCGGAACTAG GCGGGCAGAG ACACCTTCTT CGTCGAAGCC GCCGTCTGGT CCGGACCTGG GGGAGCGTTC GCTGGGTGAT CCGCGGTCTG GCGGGCGCGT TCGGCGGGGC GAGCCTCCGC CTGCTCCGCC CGGTGACCGG CGAGGACCCG AAAGTCCGCG GGGCCGGTCA TGA
|
Protein sequence | MTGPRRQRAH ARHRAGHRPW TCPGRRPGCR EHGRSGPAPV SYRDARRGVR LTVGVLAVVS ILGFAVWTFV DRSSSPEALA ASVVASTADD PDCRTATATF TDVSVPGYGG PAPAGWQPAQ RCPSDSRLGA RSPESGFAAQ PAQIPGIGGI VGGLVGGGIT GVMETAVETV VNRVSQNLVD AAKSMILEFL GASTRPQVTA EEFIGPHGAY HSTASMATLL LVGCVMIGVG QGLWSGEPVQ AMLRLLGDIP VAVLAILGFP WVVDQLVTIS DVMADWVLGN DLRTRNEILD LVVPFSGGPD GNVGYLIPRI FVYLGVALIY LELVVRNGLI YMVVALAPLS FMAITMSGAK LAARKAVEMV VAIILIKPAV FVELRVGLDL AHPGLGSPAA DGDAWGEIFV GMAIVFIAAF MPWIIWRLMP LMEHAMVAQG VARAPFRAGM QAMQMVYFGS ALAGRGARGG AGGGRGRVFG QQPAGAGGGG GFGPPRSLTG TGASANGPTR PMTRDSSGAG SERRGTRRAE TPSSSKPPSG PDLGERSLGD PRSGGRVRRG EPPPAPPGDR RGPESPRGRS
|
| |