Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6954 |
Symbol | |
ID | 5675267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8471885 |
End bp | 8473405 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641245803 |
Product | hypothetical protein |
Protein accession | YP_001511194 |
Protein GI | 158318686 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.166118 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCAT CGGCATCCTT CGACACTGCC ACGACCGCTG CTATCGCGGT CATTCGCTCG GAGCTGGAGG ACCGTATCGC AGGAATACTG CACGGCTGTC GCGAACTACA CGGCGACAGC AGCCGTCGCC TTCTTCTCTC CCAGGTCGAA CGGCATTCCG GCGTCGCGAT CCCGATTGCC GAGTACCCCA GCTCCCGCCA GTGGTTCATA GGGTTCGTCG AGGCATGCTG CGGCACGCCG GCCGGCATCC GGGCCATCAT CGCCGTCGCC CGTGTGTTCG GACTCGGCCC CGCGGTCACC GTCCCGCTCA CCTGCCTCCA CGACGAGTGG GATGCCGCAC CCACAGCCGT CGACGCGGGA GAGGATCTGT GGACCAGGCT CAGATCCGAG CTCACGGCGA CGACCCGGGC CGCGGCCGTC GCCGCCTGCC GGCTCTCACC CCAGGCGGGG CTGTCGCTCC CGCCGGCGCA CTGCTCCAAC GGCTGGCTGG TGCTGCTGCA TCTCGCCGGC CGCAACGCCG GCCCGGACGG GCTGCCGGTC TACATCTCCT ATCTCGAAGG CATTCTCGAC CAGCTCTCAC CCGAGACGCG CGCCATGGTC GAGCACTGGA ACCAACGCCG CGCCTACGAG CTGGGGCTGA CCGGCAAACT CAACGACACA CGGCTCCGGA GGGGTGCCCG CGCCACCGCG GCACGGCAGA ATGTGCATGT CATTCTGCAG TTTGAGCTGG ACCACCTGGA CCCGTCGACG CACCTGGTCT CCTGGTGGCG ACAATGGGAC GAGGAGTCGG CGGTTTTCGA ACGAGGTGCC CGATTTTTTC CCGTCCCGTT GGGCGACCTG GAGATGTTCA CCGAAAAAGT CGTGACCGAC ACCGAGGCTC TCCTCTCGGA GCGGGAGGAC CAGATAATTC TTGAATTCGT CCTCCCCATC GACCTGATGC ACCTTCCCGT CGAACGCTGG CCGAAGGAAA GCGGTTCGGT CCTGCCGAAG CAGCTCGGTT GCGACTATCC CGTCGTGGTG CGAAGCCTGG AGCGGGTACA CAACCCGCAC TGGCGGCGGG CGTGGCGCAT ACGGTGGCGT ATCCTGCACG AGAAGCAGGA TTCGGCGAGC ACGTGGCAGA TCAGGGACGA CGGAGACGGC TACATCCAAA GGCTCGATGC CGACCTTCTC TCCGACGAGA ACCTGGTGGC CGCGGTCCTC AGCGGCGCTC CCGCGCCGGC CCGCGCCACC GCGGCCGAAC TGGAGATCGT CCTGCGCTGT GGCCTGCCCG TGATTCTGTG GCACCGTGAC GGCACCGCGG CCCCGACGGT TTCCGACGCG GTGAAGGAGT TCGTCGATCT CGGCGGCCCG GCCGATCTCC GAGCGCGTAC CCACCGGCTC CGACTCGACG CCGCACGCTC GGAAGTTTCA AAAGGCGATC ATCTCGGGCA CAATCTCGTT ATTTTATGGG ATGACCCGAA CCGGCGGCCG GAGAGGGGTC CAGACGATTC CCTCTCCGTC GGAGGAGAGG TGCAGCAGTG A
|
Protein sequence | MTASASFDTA TTAAIAVIRS ELEDRIAGIL HGCRELHGDS SRRLLLSQVE RHSGVAIPIA EYPSSRQWFI GFVEACCGTP AGIRAIIAVA RVFGLGPAVT VPLTCLHDEW DAAPTAVDAG EDLWTRLRSE LTATTRAAAV AACRLSPQAG LSLPPAHCSN GWLVLLHLAG RNAGPDGLPV YISYLEGILD QLSPETRAMV EHWNQRRAYE LGLTGKLNDT RLRRGARATA ARQNVHVILQ FELDHLDPST HLVSWWRQWD EESAVFERGA RFFPVPLGDL EMFTEKVVTD TEALLSERED QIILEFVLPI DLMHLPVERW PKESGSVLPK QLGCDYPVVV RSLERVHNPH WRRAWRIRWR ILHEKQDSAS TWQIRDDGDG YIQRLDADLL SDENLVAAVL SGAPAPARAT AAELEIVLRC GLPVILWHRD GTAAPTVSDA VKEFVDLGGP ADLRARTHRL RLDAARSEVS KGDHLGHNLV ILWDDPNRRP ERGPDDSLSV GGEVQQ
|
| |