Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0156 |
Symbol | |
ID | 5668581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 186804 |
End bp | 187838 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239085 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001504529 |
Protein GI | 158312021 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0633468 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGTCGG CGAACGTGAG CGAACCAACG TCGGTGGCGG CAGAGGCTGG GCAGGCCGTG CACCGGCGCA CCCGGCGCAC CCGACGGATC CGCCCGTGCC CGCGCCGTGC CGGCCGGCGC CGGGTGGGCG CCGCGGCGGC GGCCATGCTA GCTCTCGCGC TGACCACTGC GGCCTGCGGC GACGGCTCCG GCGGCGCCAG CCTTGGTTCG CTCACCGTCG CCGACGCCGG TTTCACCGAG AGCAAGATCC TCGCCGACAT GTACGGCGAA CTGCTCACCA ACGCCGGCTA CAAGGTCGAG CGGACCTCGG TGCAGAGCAC CGAGATCGCC CAGTCCTCCC TGGAGAGCGG GCAGATCGAC GCCATGCCGC AGTATGTGGC GACCTACGCC GACCTGCTGA ACTCCCAGGT CAACGGCTCC GGAGCCACCT CGGTCTCGTC GTCGGACCTG AACGCGTCGC TGGCCGGACT GCGCCGCCTC GCGAAGGGGC TGGGCCTGAG CGTGCTCGAG CCGGCCGACG CGGTCGACCA GAATGCCTTC GCGGTGAGCA GGTCCTTTGC CGCGGAGCAC CACCTGACGA CCCTGACCGA CCTGGGTGCC AGTGGCCTGA CGGTGAAGCT CGCTGCGGGC GCCGAGTGCG CGACCCGCCC GTTCTGCCAG CCCGGGTTGG AGAAGACCTA CGGGATCAAG ATCTCCGAGA TCGTCGAAAC CGGTGTGGCG ACCGCCCAGA CCAAGGCCGC GGTACGGGAC AACACCGCGC AGCTCGGCCT GGTGCTCACC ACCGACGCCA CCGTCAACGG CTACAACCTG GTCGTGCTGA CCGACGACAA GAAGCTCCAG AACGCCGACA ATCTGGTGCC GATCGTGAAC ACCGACTCGC TCGCCCCGGA GATCACCTCG GCACTGAACG CGCTGGCGCC GGTGCTCACG ACGGCGGACC TGGCCGAGCT GAACAAGAGG GTGGACGCGG AGCGGGAGAA GTCCGAGGAG GTCGCGCACG ACTTCCTGGC CGAGAAGGGC CTGCTGGACA GCTGA
|
Protein sequence | MGSANVSEPT SVAAEAGQAV HRRTRRTRRI RPCPRRAGRR RVGAAAAAML ALALTTAACG DGSGGASLGS LTVADAGFTE SKILADMYGE LLTNAGYKVE RTSVQSTEIA QSSLESGQID AMPQYVATYA DLLNSQVNGS GATSVSSSDL NASLAGLRRL AKGLGLSVLE PADAVDQNAF AVSRSFAAEH HLTTLTDLGA SGLTVKLAAG AECATRPFCQ PGLEKTYGIK ISEIVETGVA TAQTKAAVRD NTAQLGLVLT TDATVNGYNL VVLTDDKKLQ NADNLVPIVN TDSLAPEITS ALNALAPVLT TADLAELNKR VDAEREKSEE VAHDFLAEKG LLDS
|
| |