Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2814 |
Symbol | |
ID | 5671203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3329479 |
End bp | 3331239 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241723 |
Product | hypothetical protein |
Protein accession | YP_001507143 |
Protein GI | 158314635 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGTTCG TCGAGGAGTC CGGCCGGCGG GAGCGGACGT TCGATTTCGC CGTTCTGCCG GTCGAGCGGG ACGTCCAGCG ATGGCTGGCC CGGGTGTTCG CCCGCCGAGC CGGTCCACGT TCGGCGACGA AACGGATGGG CACCGCCGTG GGGCACTTCG ATGTCCTTCG GGGCTTCGCC GCGTCGCTCG CCGGGGCGTC ACCGTCCCCG CGCTGCCCGG TCGATCTGCG TCCCGCGCAC GTCACCGCGT TTCTCCTGCG CTACCGCGGT CAGCCCAGCG AGCGGGAGTA TCTCAAGCGG CTGCGAGTGC TGCTGCGTGA CGATCCGGAG CTGCCCGAGC CGACGCGAAC GGCGCTGTTG TCCGCACGGC TCGCGCCGGC GGCGCCGGCG AACCCGCTTG TCGGCTACAG CGACACGGAA TGGCAGCAGA TCATGACCGC GGTACGGCGG GACGTGCGGC TGGCCCGAGA CCGTATCCGG GCGAGCCGCC AGCTGCTCGA TCGTTTCCGC GTCGGCGCCG TGCCGTCCGA GGGCCCGGAG AGCGGGCTCG CCGTGTTGCT GGACGTCTTC GACCGCACCG GTGATCTTCC TCGGATCGAC TCGGGCGGGC ATTCCCGAGC CGTTCGGGAC GCCGGTGGCA TGACCGCCAT CGGTGGGCGG CTGTGCCTGT CCAGCGACGA GGCGGTCGCG TTCTGCCTGC TGCTGGTCGC GTTGACCGGG GAGAACTTCG GCACCGTCGC CGCGTGGCCG GCGGCACACC ACCTGCCCGA CGGTGGCCAC GGCGACACCG GTATCGCCCT GGTCGAAGCG GTCAAACCGC GACGCGGACC CGACCGGGAG CACATGGTCA TCGCGCTGGA GGACCTGCCC ACCGGACTGG AGGCATCCGG TGAGGAGACA CGGCTGTTCC GTTCGCCGCT GCGGGTCTAC CGGCTGCTGG TGGAGCTGAC CGAGCTCTCC CGCCGGCACG GCGTCCACAC GTCGGCGTTC AGCGCCTTCG TCGCCCGGCC CGGCCGGCTC GGCTCCCGCT GGGCCGAGGG GGTCAACGCC ACGGACCTGC TCTGGTGGGC CCGACGCCGC GACTTTCCCG CCGCAGCCGA CGCAGGTCCG GGCACGAAAC CGGCGGTGCA CGTCGGACGC CTGCGCCAGA CCGTGATCGA ACGCCGTCGG CAGCCGGTCG CCCATACCCG GCAGACCATG AACGACCACT ATCTGCGGCG CAGCCGCACG GTCCAGGACG ACAGCCGCAT GGTGGTCGGT GCCGCGCTGC GCGAGCAGGT CGACAGCGCG CGGACAGCCC AGAGCATGCC CGTACTCACC GTCGCCTTCC TTGCCCACGC CCGCCGCGAC CCCGCCGCCG CGGCGGCCAC GGCCGGGATG GACCAAGACA CCCTGCGCCG CCTGATCTCC GGGGTGCAGG ACACCGCCCT TGCCTCCTGC GCGGACCACC GCAACGGCCC GCACACCACG GCGGGACAGC CCTGCCTGGC GTCGTTTCTG GACTGTCTGG ACTGCCCGAA CGCCCGCGCG CTGCCCCACC AGCTCGGCGT GCAGATGCTG GCCGCCGAGC GGCTGCGCGC GCTGCGACCG AACATCACCC CGGCTGTCTG GGAGGCGCAC TTGCGCCGGC GTCTCGACCA GCTGGAGGAG ATCCTGAACC ACTACACTGC GGCCGAACGC GACCACGCCC GCGCCACCGT GACCGCCCGC CAGCAGCAGC TCGTAGACGA CCTGCTCGAC GGCCGATGGG ACCTGCGATG A
|
Protein sequence | MRFVEESGRR ERTFDFAVLP VERDVQRWLA RVFARRAGPR SATKRMGTAV GHFDVLRGFA ASLAGASPSP RCPVDLRPAH VTAFLLRYRG QPSEREYLKR LRVLLRDDPE LPEPTRTALL SARLAPAAPA NPLVGYSDTE WQQIMTAVRR DVRLARDRIR ASRQLLDRFR VGAVPSEGPE SGLAVLLDVF DRTGDLPRID SGGHSRAVRD AGGMTAIGGR LCLSSDEAVA FCLLLVALTG ENFGTVAAWP AAHHLPDGGH GDTGIALVEA VKPRRGPDRE HMVIALEDLP TGLEASGEET RLFRSPLRVY RLLVELTELS RRHGVHTSAF SAFVARPGRL GSRWAEGVNA TDLLWWARRR DFPAAADAGP GTKPAVHVGR LRQTVIERRR QPVAHTRQTM NDHYLRRSRT VQDDSRMVVG AALREQVDSA RTAQSMPVLT VAFLAHARRD PAAAAATAGM DQDTLRRLIS GVQDTALASC ADHRNGPHTT AGQPCLASFL DCLDCPNARA LPHQLGVQML AAERLRALRP NITPAVWEAH LRRRLDQLEE ILNHYTAAER DHARATVTAR QQQLVDDLLD GRWDLR
|
| |