Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5897 |
Symbol | |
ID | 5674218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7161416 |
End bp | 7163059 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641244745 |
Product | hypothetical protein |
Protein accession | YP_001510147 |
Protein GI | 158317639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0016856 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.41818 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGCT CATCGGGCCC GCCGCCCGTC CTGGGCACGC CACCCTCCCA GGCCGGGCCG GCGGCGGGAA ACCTACCGCT ACAGCCCGAC CCTGCCCAGC CTGATCGTCA CAATTCGCGA CCTCATCGTG AGGATCGGTG GCGCGCGGAT CGGCCGTCCG CCGCGCCGCG CCGCTGGGGC TTCGTGCGGC GCCATGCCGC CTTCCTGATC CTGCTCACGC TGGCCGTCGG GGCCCGGGCC GCCGTCCTGC TGGCCTATCG GCCCGCGCTC TTCTACTACG GCGACTCGCC CGCCTACCTC GACCAGGCCA CCAACCGGCT GTGGGCGGGC GACTGGCGGC CGTCGGGCTA CCCGATGTTC CTGCGGGTCA TCGGCGCCCC CGACCACCTG ACCCGGCTGG TGGTCGTGCA GCACACGGCG GCCCTGCTGG CCGGGGTCGC CCTCTACGCG GCGTCGCGTC AGCTGTTCGA GCGGCACGGT CCCGTCGCGG CGGCCGGCCG CACCGGGGGC TGGCCGGCCG CGGTGGTCGC GGCGCCGGCG CTGCTGGCCC CCTGGGTGCT CGACCTCGGC CAGTTCGTCC TGGCCGACAG CCTGTTCGGG ACGCTCGTCC TCGGCGGGCT CGTGCTGCTG GCCTGGCCCG GGCGGCCGGC CGCGTGGCGC TTCGCCGTCG CCGGGCTGCT GCTCGGCGCC AGCCTCACCG TCCGCACGGT CGGCTACGGC CCGCTCGCGG TGGGCGCGGC CGGTGCCGTC GTCCTGGCTG TCACGCACTG GCGGCGGGCC CACGCGGCGG TCGTGGCGGC GGGGGCCGTG CTCGCGTTCG TCCTGGGAGC GGCCGTCCCG GTCGTCGCGT ACTCGGCGTG GAGCGCGGGG CAGGGGAAGG GCTTCACCGT CACCGCGCAC TCGGGGTTCT TCCTGTACGG CCGGGTCGCC CCGTTCGCCG ACTGCGCCCG CCTGCCCGAC GATCCCGACC TGCTGTCGCT GTGCGACCCG CGCCCGGTCG GCGAGCACGG CTCCCCGGTG ACCTACCTCT GGCCGGACGA CTCGCCGCTG CGCCAGGGCA ACGACCTGGT CCCACCGGGC CGCGAGGAGC TCGCCGGCGA GTTCGCCCGG CACGTGATCC GTGAGCAGCC CTGGACGATG GTCACCTCGA CCGCCCGCTA CCTGGCCGGG TACTTCTCGC CCGTCCCGTA CGAGAACAGG CTCACGAGCC GCGCCGACAC CTGGGAGCTG CCCCGGACGG GCACCAACCG CCTCGTCTCG GACGGCCCGC ACGCCGCCGA CGGGTACTTC TCCGTCGCGC GGCTGAACGA CCCCCCGGTC GAGCTGCTCG CCTTCTATTC CCGGCTCGGC TACGGGCCGA TGCCCCTGGT CGGCCTGGGT CTGCTCGCCG GGCTGCTGGC TCAGATCGTC GGGCGGGTGC GCGGGCGCGC CGGCCCCGGT CGGCTGTTCT GGCTGCTCGG GGGAGCAAGC CTGTCCACCC TGCTCCTGAG CTCCCTGACC TCGGCGTTCG ATTACCGCTA CCTGGGATCG GTCGTCGGCC TGCTCGCCCC GGCCGCGCTG CTCGGCGCGG CCGGGCTGGC ACGGGTGCTT CGACCGCGGC CGGTCGGCCG CGGCGGGAGC GTCAACGAGG GAGGTATCGC GTGA
|
Protein sequence | MASSSGPPPV LGTPPSQAGP AAGNLPLQPD PAQPDRHNSR PHREDRWRAD RPSAAPRRWG FVRRHAAFLI LLTLAVGARA AVLLAYRPAL FYYGDSPAYL DQATNRLWAG DWRPSGYPMF LRVIGAPDHL TRLVVVQHTA ALLAGVALYA ASRQLFERHG PVAAAGRTGG WPAAVVAAPA LLAPWVLDLG QFVLADSLFG TLVLGGLVLL AWPGRPAAWR FAVAGLLLGA SLTVRTVGYG PLAVGAAGAV VLAVTHWRRA HAAVVAAGAV LAFVLGAAVP VVAYSAWSAG QGKGFTVTAH SGFFLYGRVA PFADCARLPD DPDLLSLCDP RPVGEHGSPV TYLWPDDSPL RQGNDLVPPG REELAGEFAR HVIREQPWTM VTSTARYLAG YFSPVPYENR LTSRADTWEL PRTGTNRLVS DGPHAADGYF SVARLNDPPV ELLAFYSRLG YGPMPLVGLG LLAGLLAQIV GRVRGRAGPG RLFWLLGGAS LSTLLLSSLT SAFDYRYLGS VVGLLAPAAL LGAAGLARVL RPRPVGRGGS VNEGGIA
|
| |