Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2551 |
Symbol | |
ID | 5670945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3034466 |
End bp | 3035461 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641241467 |
Product | NLP/P60 protein |
Protein accession | YP_001506887 |
Protein GI | 158314379 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTCTGC TCATCGTGAT GATGGGTGCC GCCGCCGCGG GGGTGGCCGA GCGGGTCCTG TGCCTGCCGG TCGTCGGCTG GTTCCTCGGC TGCGACAGCG GCGGCAGCCC CAGCCAGACC GCCCTCGATG ACGTTCCCGC CGACTACCTG AACCTCTACA TGCGGGCCGC CGCGACATGC CCCGGCCTGT CCTGGACGAC GCTCGCGGCC ATCGGCAAAA TCGAGTCCGA TCACGGTCGT AGCCGGCTGC CGGGTGTGAT CAGCGGCACC AACAGCGCCG GGGCCGCAGG TCCCATGCAG TTCCTCGCCG GAACGTTCGC CGACGTCGTT GCTCACCACC AGCTCCCGGC GGGTGGCGCG AGCCCGCCGT CGCTCTACAA CCCCCAGGAC GCTGTGTACG CCGCTGCGTT CTACCTGTGC GACAACCACG TCGTCACCGA CCTGACCGGC GCCCTGTGGG CCTACAACCA CTCGGATGCC TACATCGCCC AGGTCGTCAC CCAGGCCACC CAGTACTCTG AACCGAACCC GAACGCTTCA GTAGCGTGCG CTACATTTCA GTCATCGTTT CAGTCCGGGA TCGATCATTT CAGTAGCGTC GCGCTGACTG CCGTTCAGTT CGCTTGCGGG CAGATCGGTA AGCCCTACGT CTGGGGTGGT AATGGCGAGC CCGGGTTCGA CTGCAGTGGT CTGACGGCTG CCGCCTATGC CGTCGCGGGT GCCCAGATCC CTCGTACGGC CCAGGGCCAG TACAACGCCG GCCCGCTTGT TTCAGCTGAT AGGCCCCTGC TGCCGGGGGA CCTGGTGTTC TTCGGTGGCG GACCAGCCAA GGTCACCCAT GTCGGCATCT TCCTTGGTAT CCAGAATGGC CAGGCGATGA TGGTGGACGC CCCTCACGAG GGGGCTTACG TCCGCGTCGA ACCGTTTCCC GCGACGGTGG GTGCCCGATG GGGATCAGAG CTCTACCTCG GAGCGAGTAC GCCAGCCGCT GGCTGA
|
Protein sequence | MVLLIVMMGA AAAGVAERVL CLPVVGWFLG CDSGGSPSQT ALDDVPADYL NLYMRAAATC PGLSWTTLAA IGKIESDHGR SRLPGVISGT NSAGAAGPMQ FLAGTFADVV AHHQLPAGGA SPPSLYNPQD AVYAAAFYLC DNHVVTDLTG ALWAYNHSDA YIAQVVTQAT QYSEPNPNAS VACATFQSSF QSGIDHFSSV ALTAVQFACG QIGKPYVWGG NGEPGFDCSG LTAAAYAVAG AQIPRTAQGQ YNAGPLVSAD RPLLPGDLVF FGGGPAKVTH VGIFLGIQNG QAMMVDAPHE GAYVRVEPFP ATVGARWGSE LYLGASTPAA G
|
| |