Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6282 |
Symbol | |
ID | 5674601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7629914 |
End bp | 7631014 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245134 |
Product | helix-turn-helix domain-containing protein |
Protein accession | YP_001510530 |
Protein GI | 158318022 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2856] Predicted Zn peptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGCGG CAATGCATGA CGGCCAGTCG GCGTACGCGG GCGAGCGCAT CCGAGCGGCT CGTACGCTGC TCGGCCTCTC ACAGGGCGAT CTAGCCGAAG CCTCCGGGGT CGGCCAGACA ATGATCTCGA AGATCGAGAG TGGGGCGAAG TATCCATCGG ACGATCTTCT CGACACAATC GCCGCGGTCA CGGGCACGCC GCGCTCGTTC TTTGATGTTG TGCCCCTGGA TATCCCGCCG ATGACGCTGC GCTTCCGGAA GTCCTCTCAG GCGAAGCGGG GCGATACCAA GAGGGTTGAT CAACTGCTCG CGGAGGCTTA CCGCATCGTC TGGACCTTGG TGAAGCAGCA CGACCGCTAC CTCGCCCCGA CGATCCCCCT CGCGACCGGA GAAGAACTGT CCGGGGAGGA CATCGAGGAG CTGGCCGCGC GTACCCGGGA GGCGCTCGGC CTCGACCTGC ACAGCCCCGT CAGGCACGTC ACTCGCATCT GCGAGCGAGG CGGAATCCTC GTCGCCCCGA TCAGCCTGCC CGGCGACGGC GACGAGTCGG AGACCGTTGG CCATTTCGGG GCGTCCTGCT GGCCTGGCCC GCCTGAGCCG GCGCTCATCG GCTTCTTCCC CGACGGCCCC GGTGATCGGC AACGGTTCAC CCTGGCCCAC GAACTAGGAC ACTTGGTCCT CCACACCCGC CGCCACTACA TCCCCGACCC GGAAGGGGAA GCGAACCGCT TCGCGGGCGC CTTCCTAGTC CCCGCCGAAC CCCTGCGGGA AGCGATGGCT GACCAGGACT TCACCTTGCG GGACTTCGCG GGCCTCAAAG CGCGCTGGGG CGTCTCCATC CAGGCGCTGA TCATGCGAGG AAGTCATCTC GGGCTCATCG ACGCCCGACG CAAGGAGTCG CTGTTCAAGC AGATCTCGGC GCGAGGGTGG AGGAAGCAGG AGCCGGTCCA GGTCCACCAC GAGGAGCCGG CCCTGTTCTG GAAGCTCATG GCCACGGAGT TCGGCACCAG CAGGTCGGTC TACAACACGG CCGGCGATCG GCTCGGCCTT CACTCCTTCC TCCTAGGCCA GCTCGCTCCC CGCGCCACCC CGCGGAAGTA G
|
Protein sequence | MLAAMHDGQS AYAGERIRAA RTLLGLSQGD LAEASGVGQT MISKIESGAK YPSDDLLDTI AAVTGTPRSF FDVVPLDIPP MTLRFRKSSQ AKRGDTKRVD QLLAEAYRIV WTLVKQHDRY LAPTIPLATG EELSGEDIEE LAARTREALG LDLHSPVRHV TRICERGGIL VAPISLPGDG DESETVGHFG ASCWPGPPEP ALIGFFPDGP GDRQRFTLAH ELGHLVLHTR RHYIPDPEGE ANRFAGAFLV PAEPLREAMA DQDFTLRDFA GLKARWGVSI QALIMRGSHL GLIDARRKES LFKQISARGW RKQEPVQVHH EEPALFWKLM ATEFGTSRSV YNTAGDRLGL HSFLLGQLAP RATPRK
|
| |