Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6445 |
Symbol | |
ID | 5674760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7836951 |
End bp | 7838834 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641245293 |
Product | hypothetical protein |
Protein accession | YP_001510688 |
Protein GI | 158318180 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.131326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0967044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGGACAA CCGGCACCGC GGCGGATCCC GCTGACCCCG CCGAGCCGAG CGACCCCCTC CCCGCGGATC CCCTCCCCCC TGATCCCGTC CCCGCTGATC CCGTCCCCGC CGGTGCGCGG CGGGACGGGC CGCGGCCGAG TCACCGGCAC CGACGTCAGC GGCGCCGCCG GCGCCCGCTG GGCCGCGCGG GCCTGGTCCC GGCGGCGGTC CTCGCGGGGT ACCTCGCGCT CGCTGTCGCC GTCTTCCGTG CCGCGTGGGC CGATCCCGGC GGCGTGGTTT ACGGCTACAG CGACTCGGTG CTGTTCGCGT GGTACCTGGG CTGGGTCCCG CACGCCCTGT CCGCAGGCAT CGACCCGTTC GTCACCTCGT ACCTGAACGC CCCGACCGGC ACGAACATCC TGTGGAGCAC GCCCGTCCCG CTGCTCGGGC TGGTGACCGC GCCGGTGACC GCGCTGTTCG GCCCCGTCGT CTCGCTCACC CTGCTGCTGA CACTGGCGCC GGCGCTCTCC GCGTTCGCCC TGTTCTGGGT GCTGCGGCGC TGGGTGCCGG CGCCGCCGGC CGCCGTCGCC GGCCTGCTCT ACGGCTTCGG CCCGTACATG GTCGGCGAGT CGTACGGGCA TCTGCATCTC ACCTTCGCGG TCTTCCCGCC GCTGCTGCTG CTCCTGCTCG ACGACCTGAT CGTGCGCCGG CGCCCACCGG GGCGCACCGG CGTGCTGCTC GGCCTGGCGG TCGCTGCCCA GGCCATGATC AGCGAGGAGG TGCTGGCCAC CGCCGCGCTG CTCGGCGCGC TCGGGCTCGC CATCGCCGGG CTGGCCCACC GCGCCGCCGT CCGCGCGCGG GCGGGTGCGC TGCTGCGGGG CCTGGCGGCC TGCGGCACGA CGGCCGGGAC GCTGCTCGCC TGGCCGCTGA CCGCGCAGTT CCTCGGCGAC CAGCGGGTCC ACGGCAACAT CCAGCCGCAC AACGTCGCGG TGTCCGATCT GCTGACCTTC GTCACGCCGA CCCCAGCCCA GCGGATCGCA CCCGACGTGG CGCTGCGGCA CAGCCTGCGT TTCACCGGCA ACGCGGTCGA GGTGACCGGC TACCTCGGGC TTCCCCTGCT GCTCGGGGTG GCCGCGATCG CCGTCCGGTT CCGCCGCGAG CCGCTGGTGG CGGTGTTCGC CCCGCTCGGC GCGGTGACGG CGCTGCTCTC CCTGGGCGGC CACCTGCACG TGGACGGACG GGTCACCGGC ATCCGCCTGC CCTGGCTGCC GCTGGAGAAC CTCCCGGTGA TCAGCAGCGC GCTCCCGTCC CGGCTCGCGC TGTATCTGGC GATGTCCGTG GCGATCGTCC TCGCGGTGGG CCTGACCCGC GTCGCCGCGT CCGCCCGGTT CCCGCGGCCG GTCACCCGGG CCGGGCTCGT GCTGCTCACC GCGGTGATGC TGGCGCCGCT CGTGCCGCGC AGCCACGTGG CGACACCGGC CGCCACGCCC GCCTTCTTCA CCGGCGACGC CGTCCGCGCG GTTCCCGAGG GATCGACGGC GCTGGTGCTG CCCTACCCCT ATCCGGCCCG CACCGAGGCG ATGCTCTGGC AGGCCGAGGC GGGCTACCGG TTCCGGCTCC CGGGCTGCTA CTGCACCGTC CCGGGCCCGG ACGGGCGCGC CGTCTTCAAC GCGTGGACCG ACCCGCTCAA CGGCGCGCTG GTCGCGGTCG AGCAGGGCCG GTCGGACGCG GCGGCCGCGC TGGCCGATCC CGCCGTGCAG GCCGCCTTCG ACCGGCTCGC GCCCGCCGCG GTGATCCTCG GCCCGAGCGC GAACCGGGAC GAGCTCGCCC GGCTGGTGAC CGGCCTGGCC GGCGCCGGGC CGGCGGACGT CGACGGGGTC CAGCTGTGGC TGACAGCGCC CTGA
|
Protein sequence | MRTTGTAADP ADPAEPSDPL PADPLPPDPV PADPVPAGAR RDGPRPSHRH RRQRRRRRPL GRAGLVPAAV LAGYLALAVA VFRAAWADPG GVVYGYSDSV LFAWYLGWVP HALSAGIDPF VTSYLNAPTG TNILWSTPVP LLGLVTAPVT ALFGPVVSLT LLLTLAPALS AFALFWVLRR WVPAPPAAVA GLLYGFGPYM VGESYGHLHL TFAVFPPLLL LLLDDLIVRR RPPGRTGVLL GLAVAAQAMI SEEVLATAAL LGALGLAIAG LAHRAAVRAR AGALLRGLAA CGTTAGTLLA WPLTAQFLGD QRVHGNIQPH NVAVSDLLTF VTPTPAQRIA PDVALRHSLR FTGNAVEVTG YLGLPLLLGV AAIAVRFRRE PLVAVFAPLG AVTALLSLGG HLHVDGRVTG IRLPWLPLEN LPVISSALPS RLALYLAMSV AIVLAVGLTR VAASARFPRP VTRAGLVLLT AVMLAPLVPR SHVATPAATP AFFTGDAVRA VPEGSTALVL PYPYPARTEA MLWQAEAGYR FRLPGCYCTV PGPDGRAVFN AWTDPLNGAL VAVEQGRSDA AAALADPAVQ AAFDRLAPAA VILGPSANRD ELARLVTGLA GAGPADVDGV QLWLTAP
|
| |