Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0380 |
Symbol | |
ID | 5668804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 455358 |
End bp | 457001 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239312 |
Product | hypothetical protein |
Protein accession | YP_001504752 |
Protein GI | 158312244 |
COG category | [S] Function unknown |
COG ID | [COG5298] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.396423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.044592 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGCCCG CCGAGGGGCG CTTCGCCTCA CCGCCGGCTC CACCGGCGCC GCCGCGCACC CCGCCGCCAC GGCGGGCCGC GCGGCCCGTC CCGGGCACGA ACCAGCGCCC GCCCGCGGCA CCGGACGGCG CCACCGGGCC CGGGGCCGCG TCGGCGCCGT CCGGCCAGGC GACCGGGACG GGCACCCTGA TCCTGTACGA CACCACCGGC GCGTGGGGGT GGCTGGGCGA GCAGTACGCC ATGCAGGCGG CCAACCTGGC CTCCCGGTTC GGCACCTGGC AGGCCCGTCC GGTCAGTTCG TACACCGCGG GCCAGATGTC CGCGTACGCC GCGGTGGTGT ACGTCGGGTC GACCTACGAC GAGCAGGTCC CGACGGCGTT CCTCACCGAC GTGCTGGCGG GGAACCGGCC CGTGGTGTGG ATGTACAACA ACATCTGGCA GCTCACGTCG CAGGCGCCGA CCTTCCCGAC GACGTACGGG TGGAACTGGT CCGGCTTCGA CACGTCGGCG ATCGGGACCG TCAGCTACAA AGGGACCGAT CTGACCCGGT ACACGGCGAA CGCCGCCGGG ATCATGAACT ACGCCTCCGT GGACACCACC AGGGCCACGG TGCTCGCCGA GGCGGTGCGC GGTGACGGCA CCCGCTTCCC CTGGGCGCTG CGGTCCGGGA ACCTGACCTA CATCGGCGAG ATCCCGTTCG CCTACGCCGA CATGACCGAC CGCTACCTCG CGGTGGCCGA CATGCTCTTC GACGTCCTCG CGCCGCAGAC CGCCGCCCGG CACCGCGGGC TCGTCCGCAT CGAGGACGTC GGGCCGGACG CCGACCCCGC GGAGCTGCAC GCGATCGCCG ACTACCTGTC CTCTGCGCAG GTACCGTTCT CGGTCGCCGT CTACCCGCGG TACGTGGACG CGAACGGCAC GTACAACAAC GGCGTACCGC AGGACTACAC TCTCGCGTCC AAACCGGCGG TGGTCAGCGC GCTGAAGTAC ATGACCCAGC GCGGCGGCAC ACTGATCATG CACGGCTGGA CGCACCAGTT CTCGAACGTC GCCAACCCGT ACTCGGGCGC GAGCGCGGAC GACTTCGAGT TCTTCCGGGC GCATGTCGAC GCCCAGGACT ACGTGGTCTA CGACGGACCC GTGCCCGGCG ACAGCCAGGC GTGGGCGACC GACCGGATGA ACGGCTCGGC CGCCGCGTTC ACCGCCGCCG GGCTGCCGGT GCCGACGACC TTCGAGTTCC CGCACTACGC CGCGTCCGCC CCCGACTACG CCGCGGCCCG GGCGAAGTTC CCGCGCCGCT ACGACCGCGG GCTCTACTTC CGCAACCAGC TCGCCGGCGG CGCGGTGGAC CACACGAAGT ACGGCGGCCA GTTCTTCCCG TACCCGGTGA CGGACGTCCA CGGGTCCTTC GTCATTCCAG AGAACATCGG GAACATCGAG CCCGAGCCGT TCAACAACCA CCTGGCGCGG CTGCCCGCGG AGCTGATCGA CGCGGCCCGG CGCAATCTGG TCGTCCGGGA CGGTTTCGCC AGCATGTTCT ACCACCCGTA TCTGGGCGTT GACTACCTGC GCCAGACGGT GGAGGGCGTG CGCGCCCTCG GATACACCTT CGTCGCGGCC GGCTCCGTCG TCGCGGGCGG GTAG
|
Protein sequence | MLPAEGRFAS PPAPPAPPRT PPPRRAARPV PGTNQRPPAA PDGATGPGAA SAPSGQATGT GTLILYDTTG AWGWLGEQYA MQAANLASRF GTWQARPVSS YTAGQMSAYA AVVYVGSTYD EQVPTAFLTD VLAGNRPVVW MYNNIWQLTS QAPTFPTTYG WNWSGFDTSA IGTVSYKGTD LTRYTANAAG IMNYASVDTT RATVLAEAVR GDGTRFPWAL RSGNLTYIGE IPFAYADMTD RYLAVADMLF DVLAPQTAAR HRGLVRIEDV GPDADPAELH AIADYLSSAQ VPFSVAVYPR YVDANGTYNN GVPQDYTLAS KPAVVSALKY MTQRGGTLIM HGWTHQFSNV ANPYSGASAD DFEFFRAHVD AQDYVVYDGP VPGDSQAWAT DRMNGSAAAF TAAGLPVPTT FEFPHYAASA PDYAAARAKF PRRYDRGLYF RNQLAGGAVD HTKYGGQFFP YPVTDVHGSF VIPENIGNIE PEPFNNHLAR LPAELIDAAR RNLVVRDGFA SMFYHPYLGV DYLRQTVEGV RALGYTFVAA GSVVAGG
|
| |