Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3597 |
Symbol | |
ID | 5671966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4259439 |
End bp | 4260635 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242483 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001507903 |
Protein GI | 158315395 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCGCG GCTCGATCCG CCTCCTCGTC CCCCTGCTGG CCGCTTTATC GGTCGCCATG ACGGCATGTG GCGGCTCCGA TGACGGAGCG AGCAGTGACA GCGGAACCAT CAAGATCGGC GCCTGGATCC CACTGACTGG CGCGCAGGCA TCCTCCGGCG TCCCTCAGGC GGAGGGCGCG AAGGCGTACT TCGCATGGCT CAACGACAAC GGCGGTGTGA ACGGCCACCA GATCGAGTGG ATCGTCAAGG ACAACGCCTA CGACCCGCAG CAGACCGTCC AAGCGGCCCG CGAGCTTGTC GCCCAGGACC ACGTGGTCGC CATCGTGAAC GCCAACGGGA CCGCGCCGTC CGAGGCGGCG TTCCCCTATG TCCTCAACCA GTCGAAGGTC CCGATCGTCG ACCACTACGG CGGGTCCGCC GCCTGGTACG ACCCGCCCCG GCCGCTGCTG TTCGGCACCC AGACCCTCTA CGAGGACCAG GCCGCGGCCA TGGCCACCTG GGCGGTCGAG TCCGGAGCGC GCAAGATCAT GGTCGTGCAC GACGATCCGC AGGCATTCGC TAACGTCGCA AAGCAGATCG AACCCGCCGC CAGGCAAGCC GACCCGAGCG TGTCGACCAC GATGCTTTCA GTAAAGCTCG GTACCACCGA CTTCGCTCCG GCGGTTAGCC AGGTGCGCAA CGAGGCACCC GACGCCGTCA TGCTCATCAT GCCCACGCAG GAGACAGCCG CTTACCTCAA GGAGGCGAAG CTGCAGGGCG TGCAGGTGCA GGCGTACGGA TACTCGCCGA CGGCGTCCGC GACCACGGTG ACGCTGGCCG GAGCCGCCGC CGAGGGCTTC CGTGCCGTGT CGGTGGTCGG CGTGCCGTCC CACACGAGCC CGCAGATGCA GCAGTTCCGC GAGGTCATGG CGAAGTATGC GCCCGACCAG CCAGCGGACT TCTCGACTCT GCTCGGCTAC GTTAACGCGG CCGTGTTCGC CGAGGTCGCC AAGACGATCG ACGGCCCGAT CACCTCGGAG TCCATCGCCA ATGCGTACGA GAACGCGCAG GGCATCTCGA CCGGCGTCGC ACCCGACATG AGCTACTCGG CCGACCAGCA TCTCGGCACC CGCCAGGTTC AGCGGACGTA TGTCAAGGAC GGGCAGTGGG TGGCCGAGGG CGGGTTCTTC ACCCCGCCGG AGCGAGCAGC GGCCTGA
|
Protein sequence | MRRGSIRLLV PLLAALSVAM TACGGSDDGA SSDSGTIKIG AWIPLTGAQA SSGVPQAEGA KAYFAWLNDN GGVNGHQIEW IVKDNAYDPQ QTVQAARELV AQDHVVAIVN ANGTAPSEAA FPYVLNQSKV PIVDHYGGSA AWYDPPRPLL FGTQTLYEDQ AAAMATWAVE SGARKIMVVH DDPQAFANVA KQIEPAARQA DPSVSTTMLS VKLGTTDFAP AVSQVRNEAP DAVMLIMPTQ ETAAYLKEAK LQGVQVQAYG YSPTASATTV TLAGAAAEGF RAVSVVGVPS HTSPQMQQFR EVMAKYAPDQ PADFSTLLGY VNAAVFAEVA KTIDGPITSE SIANAYENAQ GISTGVAPDM SYSADQHLGT RQVQRTYVKD GQWVAEGGFF TPPERAAA
|
| |