Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2037 |
Symbol | |
ID | 5670438 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2449126 |
End bp | 2450337 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240959 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001506380 |
Protein GI | 158313872 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.80961 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCA GCCCACGCAC CCCTCGACCC AGGACCAGAC GGGCCGGGGT CCTGCGGCCG GTCGCCGGGG TGGTCGCGGG CCTGGCGATG GTCGGCGCGG CGTTGACCGC CTGCGGCGGT GGTGACTCGA ACGAGGCCGG TACCAAGGAG TTCCTCATCG GCTTCCAGGG ACCGCTGTCC GGGGAGAACC AGCAGCTCGG CATCAACGCC TACAACGGCG TGCTGACCGC CGTCGACCTG GCCAACCGCA CCGGCGGCCT GCCGTTCACC CTGCGGATCG TCGCCTCCGA CGACCAGGGC GTGCCCGAGC AGGGGCCGAC CGCCGCCCAG AAGCTCATCG ACAACCCGGA CGTCGTCGCG GTCGTCGGCC CGGTGTTCTC CGCGCCGACG AAGGCCAGCG AGCCGCTGTA CAGCCAGACA GGCCTGCTGT CGGTCAGCCC GTCGGCGACC GCCCCGGCGC TCACCGACCT CGGGTTCACC AGCTTCTACC GGGTGATCGC ACCGGACACC GTGCAGGGCG CCGGCGCCGC CGAGTACCTC GCCAAGGTCG TGAAGGCCAA GCGCGTCTAC TCGCTGGACG ACCGCAGCGA GTACGGCACC GGCCTCTCCG GCGCCGTCGA GCGCGGCCTG AAGCAGGCGG GCGTCGCCTA CACCCACGAC GGGATCAACC CGACGAAGGA CTACACCTCC CAGGCGACGA AGATCATGGC GGACGGCGCC GACGCCGTCT ACTACTCCGG GTACTACTCC GACTTCGCGC TCCTCACGAA GGCGCTGCGC AGCAAGGGGT ACGACGGGGC GATCCTCAGC GGCGACGGCT CGAACGACGA CCAGTACATC CGCCAGGCCG GCGCGGCGAA CGCCGAGGGC ACGCTCATCA CCTGCCCCTG CTCGGACGCC AACACCGACC CGGCCGCCAC CGGCTTCGTC AGCGAGTACA AGAAGGTCAA CAGCGGCCTG AAGCCGGGCA CGTACTCCGG CGAGGCCTAC GACGCCACCA ACGCCATCAT CTCGGTGCTG CGCAAGCTCG GGACCGGCGC CAGCCGCGAG TCGGTGCTCG CCGACTTCGG CTCGGTCGAC TTCCCCGGGG TGACCAAGCG GATCCGCTTC GAGCGCAACG GTGACGTCCA GGGCTCGACC GTCTACGTGT ACCAGGTGAA GGACGGCCAG CGCGTGGTGC TCGGGCCGGT CAGCTCGCTC GTCAAGCCGT GA
|
Protein sequence | MTGSPRTPRP RTRRAGVLRP VAGVVAGLAM VGAALTACGG GDSNEAGTKE FLIGFQGPLS GENQQLGINA YNGVLTAVDL ANRTGGLPFT LRIVASDDQG VPEQGPTAAQ KLIDNPDVVA VVGPVFSAPT KASEPLYSQT GLLSVSPSAT APALTDLGFT SFYRVIAPDT VQGAGAAEYL AKVVKAKRVY SLDDRSEYGT GLSGAVERGL KQAGVAYTHD GINPTKDYTS QATKIMADGA DAVYYSGYYS DFALLTKALR SKGYDGAILS GDGSNDDQYI RQAGAANAEG TLITCPCSDA NTDPAATGFV SEYKKVNSGL KPGTYSGEAY DATNAIISVL RKLGTGASRE SVLADFGSVD FPGVTKRIRF ERNGDVQGST VYVYQVKDGQ RVVLGPVSSL VKP
|
| |