Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3809 |
Symbol | |
ID | 5672173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4523468 |
End bp | 4524958 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242688 |
Product | NLP/P60 protein |
Protein accession | YP_001508108 |
Protein GI | 158315600 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.100771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.247532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGT ACACCGCCGC TCAGATCTAC GCTTACGCCC GGGCCGCCGG CTTCACCTCG GACCAGGCCG TGACGATGAC CGCCGTCGCG CTGGCCGAGT CGGGCGGCAA CGGGATGGCG CACGCCACCC GGGGCGAGGA CTCGCGCGGG CTCTGGCAGA TCAACATGAA CGCCCACAAG CAATGGGCGC ACCTGGACGC CTCTGACCCC GCGGTCAACG CCCGGATGGC CTACGAGGTC TCCCGCCAGG GCCGGGACAT CTCCCCCTGG ACGGTCACCC ACGGCGGCGC GGATGCCCGC TACCTGGACT ACCGCGCGCA GGCGCAGCGG GCCGCGATCG AGGCCGGGGA CGCGGGCGCG CAGGGAAACT GGTCGGGGAC CGAGGGCTAC GGGCACGCGG TCGCCGCCGG CGGGGGTGGG AGTGGGGGCG ACCACACGGC CGGCTTCGCG GCGAGCGGGT ACCCGACGTA CTCGGCCGGC ACCGGCGGGA ACGGCCCGGG CGGCGCGCAG GTCGGCGAGA CCACCCGGCA TTTCCTCGAC GCCGCGCTCG CTCAGGCCGG CGACCGCTAC GTGTACGGCG CCGAGGCGCA GCTGAACGAC GCGGACCCCG ACGCCTTCGA CTGCTCGGAG CTGACCCAGT GGGCGGCGGC GCAGGCCGGC GTCGAGATCC CCGACGGCGC CGCGAACCAG TACGAGGCCC TGCGCACCGA GGGCCAGGAG GTCTCCGTCG ACGAGGCCCT GCGCACGCCC GGCGCGCTGC TGTTCCACGC CGACGCGAGC GGCTACGTCA GCCACGTCGC GATCAGCCTC GGTGACGGGC GGACGATCGA AGCCCGCAAC TCCCGCCTCG GCGTCGGGGT GTTCGACGGA CGGGAGAACT GGCTCAACCG GGCGGCCGTC CTCCCCGGAC TGTCCGGGGA GGTCGGCGCG GGCGCGTTCG CGGCCGGGCC CGGAACCGGC CTCGGGGCCG GGGGGGCCGC TGACGGTACC GACAGCGACG CGGACGGGCT CACCGACGCC CTGGAAGCGT CGCTCGGCAG CAACGCCCAT GAGGTGGACA GCGACAGCGA CGGCTTGTCC GACTCCTACG AACTGCTGCG CGTGCACTCG GACGTGATGT CGGCGGACAC CGACCACGAC GGTCTGGCGG ACGCGCTCGA TCTGGCCTCC GGCTTCGACC CGACGAACGC GGACAGCACC GGCACCGGTC GTCTGGACGG GGCGACCGGC GAGGCGGCCA GCCTCGACAC CGACGACGAC GGGCTCGCCG ACGCGCTCGA GCGGGTCCTC GGCACCGACT CGACTCTGGC CGACAGCGAT CACGACGGGG TGACCGACGG CGCGGAGCAC CTGAACCGCA TGGACCCGCT CGACCTGAAC GACGTGGGGT CGCTGGTGCA CAGCGCGGAC ACCGGCGCTG GGACGGGAAC CGACCTCGGT GCCACCCACC CCCCGGGCGG CACCGACCTC GACGGCACCA CCCACCTCTG A
|
Protein sequence | MATYTAAQIY AYARAAGFTS DQAVTMTAVA LAESGGNGMA HATRGEDSRG LWQINMNAHK QWAHLDASDP AVNARMAYEV SRQGRDISPW TVTHGGADAR YLDYRAQAQR AAIEAGDAGA QGNWSGTEGY GHAVAAGGGG SGGDHTAGFA ASGYPTYSAG TGGNGPGGAQ VGETTRHFLD AALAQAGDRY VYGAEAQLND ADPDAFDCSE LTQWAAAQAG VEIPDGAANQ YEALRTEGQE VSVDEALRTP GALLFHADAS GYVSHVAISL GDGRTIEARN SRLGVGVFDG RENWLNRAAV LPGLSGEVGA GAFAAGPGTG LGAGGAADGT DSDADGLTDA LEASLGSNAH EVDSDSDGLS DSYELLRVHS DVMSADTDHD GLADALDLAS GFDPTNADST GTGRLDGATG EAASLDTDDD GLADALERVL GTDSTLADSD HDGVTDGAEH LNRMDPLDLN DVGSLVHSAD TGAGTGTDLG ATHPPGGTDL DGTTHL
|
| |