Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4828 |
Symbol | |
ID | 5673169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5767683 |
End bp | 5768669 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243684 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001509100 |
Protein GI | 158316592 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00770957 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0433123 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGCT ATCTGTTCGG GCGGTTCCTG CAGGGGGCCT TCGTTCTCTG GGCCGCGTTC ACGCTCTCGT TCGTCGTGCT GTACGCGCTG CCGAGCGATC CGGCGGCGAT CATGATCGGG CCGAGTAACT CCCTCACCCC GGCCGAGCTC GCGGCCCGCC GGCACGAGCT CGGCCTGGAC CGGTCCCTGT TCGCGCAGTA CTTCGGGCGG CTGGGCGACC TCCTCCACGG CGACCTGGGC CGGTCGGTGC AGTCCGGTCA GCCGGTGCGG GAGCTGATCG GGGACGCCCT GCCGCAGACC GCCGCCATCA CGGGGCTCGG CCTGGTGGTC GGGGTGGCGC TCGGCGTGGG TCTCGCGGTC GCGGCGACGC TGACCCACCG GCGCTGGCTG CGCCAGACGC TGCTCACCCT CCCCCCACTC GGCGTGGCCG TCCCGAGCTT CCTCGTCGGG CTCCTGCTGC TGCAGTGGTT CTCGTTTCGG TGGCAGCTGT TCCCGGCCAT CGGCAACAGC GGCTGGCGCA GCCTGGTGCT CCCCGCCGTC ACGATCTCGC TGCAGCCCGC CGCCCTGATC GCGCAGCTGC TGGCGCGGAG CCTGGACAAC GAGCTGCGGC AGAACTACGT CGACCTGGCC AGGGCGAAGG GCGCCGGACC GGCGCGGGTG AACGTCCGGC ACGCGCTGCG CAACGCCGCG CTGCCGGCGC TGACCATCGC CGGCATCCTG GTCGGCGGAC TGCTGGCCGG CGCGATCGTG GTGGAGGTCG TGTTCTCCCG CAACGGCCTC GGCCGGATCT CCCAGGCCGC GGTCGACAGC CAGGATCTGC CGGTGGTGCA GGGCGTGGTC CTGCTCGGCT CGGCGGTCTT CGTCGCCGTC AACCTTCTCG TCGATCTGCT CTATCCGCTG CTCGACCCCC GCATCGCCCG GGACGGGAGC GCGGCCCGCC GCGCTCCGGA GGCGGTCGCG GTCGGTCCGG CCCCGTCCGT AACGTGA
|
Protein sequence | MTRYLFGRFL QGAFVLWAAF TLSFVVLYAL PSDPAAIMIG PSNSLTPAEL AARRHELGLD RSLFAQYFGR LGDLLHGDLG RSVQSGQPVR ELIGDALPQT AAITGLGLVV GVALGVGLAV AATLTHRRWL RQTLLTLPPL GVAVPSFLVG LLLLQWFSFR WQLFPAIGNS GWRSLVLPAV TISLQPAALI AQLLARSLDN ELRQNYVDLA RAKGAGPARV NVRHALRNAA LPALTIAGIL VGGLLAGAIV VEVVFSRNGL GRISQAAVDS QDLPVVQGVV LLGSAVFVAV NLLVDLLYPL LDPRIARDGS AARRAPEAVA VGPAPSVT
|
| |