Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3871 |
Symbol | |
ID | 5672234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4601008 |
End bp | 4602132 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242749 |
Product | putative sulfonate binding protein precursor |
Protein accession | YP_001508169 |
Protein GI | 158315661 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.080113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.159849 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGGATG AGGGCAGACT GCCCGCAGGC CGTGACGCTA CGCCCGCGCC GTCCGTTCCG CTACGAGTCC TCATGATCGG CCGTCCGGCC GGACGCCGGT CCCGACGGCT GCTGGTGGTT GTTCCCCTCG CCCTGGTCGG ACTGCTCGCG GCGGCCTGCG GCGGGAGCGA CCCGGCGCCG GGTACGGCCG CGGCGGCGGC GAACGACCTC TCCGGCGTCA CACTGCGTTT CGGTGACCAG ATCAACGGGG TCCGCTCCGT GCTGGAGGCC TCCGGCGAGC TGGACGACGC ACCGTACAAG ATCGAGTGGA GCCAGTTCCA GGGCGGTGGC CCCACGGTGA TCGCGGCGCA GACCGGCGGG GACGTCGACC TCGGGACGAT GGGGGAGACA CCGGTGGTGT TCGCGCAGGC CGCGCACAGC CCGGTGACGG TCGTCGGGGC CGCCCGGATC GTCGACCCGG CGAAGTCGAA CTTCGCGCTG GTCGTGAAGA AGAACTCGCC GATCCGGTCC GTCGCGGACC TGCGCGGCGC GACGATCCTC AACAGCCAGG GCACGGTGTC GCAGTACCTG GTCGCGAAGG CGCTGGAGAA GGGCGGCCTC ACGACCGACG ACGTGAAGCT GGTGAACCTC CAGCAGGGTG CGCAGGCCGC CTACGACCGC GGCGACATCG ACGTCATCGC CAGCGGGGGA CCGCCCCTGG CGATGATGCT GGCGAAAGGC ACCGACCGGG TCCTCATGAC CGGCGCGGGT GTGCTGCCCG GAGTCAACTA CCTGGTCGCG CGCAACGGCG CTCTCTCCGA CGCCGGGCTC AGCGACGCCA TCGGCGACTT CCTCGGCCGG CTCGCGCAGG CGCAGGACTG GTACAACGCC CACCCGGACG CCGCCATCGC GATCGTGAAG CCCACCTACA AGGTGGACGA CACCGTCGCC CGCGCCATCA TCGACCTCGC GCCCCTGCAC TACGTGCCGA TCGACAGGAC GGTCACCGAC GCCCACCAGC GCGAGGCGGA CTTCTTCGCC GACCAGGGCG TGCTGAAGGC GAAGATCGAC ACGTCGACGG TCTTCGACGA CCGTTACAAC CCGATCATCA ACGCGGTGGC CCAGGGCCGG CCTGGTCCGT CATGA
|
Protein sequence | MGDEGRLPAG RDATPAPSVP LRVLMIGRPA GRRSRRLLVV VPLALVGLLA AACGGSDPAP GTAAAAANDL SGVTLRFGDQ INGVRSVLEA SGELDDAPYK IEWSQFQGGG PTVIAAQTGG DVDLGTMGET PVVFAQAAHS PVTVVGAARI VDPAKSNFAL VVKKNSPIRS VADLRGATIL NSQGTVSQYL VAKALEKGGL TTDDVKLVNL QQGAQAAYDR GDIDVIASGG PPLAMMLAKG TDRVLMTGAG VLPGVNYLVA RNGALSDAGL SDAIGDFLGR LAQAQDWYNA HPDAAIAIVK PTYKVDDTVA RAIIDLAPLH YVPIDRTVTD AHQREADFFA DQGVLKAKID TSTVFDDRYN PIINAVAQGR PGPS
|
| |