Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5557 |
Symbol | |
ID | 5673887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6733734 |
End bp | 6734615 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244413 |
Product | rare lipoprotein A |
Protein accession | YP_001509817 |
Protein GI | 158317309 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4305] Endoglucanase C-terminal domain/subunit and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCCCG CCCCAGGTTT CGTCTCGCCG GTTCGGCGTG CCCTGCTCGG CATGGCGGTG GCGGCCGCCA CCGCCGTCCT CGCCACCGGC TGCCAACCGG CGGTCGGCGG TGGTCCGCCG GAGCCCGACC CGGTCAGCAC CGCGACCGCG GCGTCTACGC CCGGTCCGTC GTCCACGCCC ACCGGTCCGT CGGCGCCGCC CGGCCGGCCT TCGGTGACAG GCGCGCCGGC GCCGGGCGCA CCGGCAGGGC CGACAACCGT GCCGGTCACA CCCGGCCGTA TCCAGCCGGG TGTGGTCCGC ACCGGTCCGG CGACGCACTA CGGCGCGGAC GGCGGCGGCA ACTGCATGTT CGACCGCCTG ACCGACCCGG CCATGCCGGT AGTGGCGATG AACGAGCTCG ACTACGAGAC GGCCCGCGCC TGCGGCGCGT ACATCGAGGT CACCGGACCC GGTGGCACCA CGGTCGTGAA GGTCACCGAC CGGTGCCCGG AGTGCGGTCC TGGTCATCTC GACCTGAGCC AGCAGGCCTT CGCCCGGATC GCCGGCGGCG TGCCCGGGCT GGTCGACGTC ACCTGGCGGC TGGTGAGCCC GGCCGACATC GGATCCGTCC AGTTCCGGGT CAAGGAGGGG TCGTCGGCCT ACTGGCTGGC AATTCAGGTC CGCAACCATC GCAATCCGGT CGTCTCACTC GAGGTTCGGG TGAACGGGGC CTGGACGGCG CTGCCGCGGG AGATGTGGAA CTACTTCGTG GCACCGCAGG GTCTCGGTCC GGGGCCGTTC ACCGTGCGGA TCACCGACGT TTACGGCGAG CAGCTTGTCG AGACCGTGAA CCTTGCACCC GCCTCGGTGC AGACGACGGG CAGCCAGTTC GCCCGGCACT GA
|
Protein sequence | MSPAPGFVSP VRRALLGMAV AAATAVLATG CQPAVGGGPP EPDPVSTATA ASTPGPSSTP TGPSAPPGRP SVTGAPAPGA PAGPTTVPVT PGRIQPGVVR TGPATHYGAD GGGNCMFDRL TDPAMPVVAM NELDYETARA CGAYIEVTGP GGTTVVKVTD RCPECGPGHL DLSQQAFARI AGGVPGLVDV TWRLVSPADI GSVQFRVKEG SSAYWLAIQV RNHRNPVVSL EVRVNGAWTA LPREMWNYFV APQGLGPGPF TVRITDVYGE QLVETVNLAP ASVQTTGSQF ARH
|
| |