Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5739 |
Symbol | |
ID | 5674065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6973619 |
End bp | 6974722 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641244592 |
Product | NLP/P60 protein |
Protein accession | YP_001509995 |
Protein GI | 158317487 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTCTG CGCAGAGTGC GCAGCTTGAT GCAGAAAGAC GCCGACAGGA AGGGCGCGGT CGCCACCGGG CTCCGTCCGT CCCCACAACG TCGAGCCGGG CCAGAGCGCG GGCCCGCGCG GTCGCAGCCG TGACCACTGG CACAGTCGTG GTCTCCGGTA TGGCTCTCGC CGGATGTGCC CCGGAGCCCA GCTCGGACGG CGCGTTGGAC GACAGCACAA GCACCACGTC ACTGACGCTT GCCACCCAGA TCGGGTCCCG GCCGGCGACC GACGGCGCCA TCCAGGCCGC CGCCGCGGCG GACGGCACGC CCGGTACCGT CCTTGACGCC ACGACGGACA TCTCTGCACC GACTCTGTCG TCCAAAATCG ACGTGGGTCT GCGCGTGACG AACCCCGACG TGACAGTCAA CGCGGACGAG CCTGTCAACA TCGGCTTCTC GCTCTTCAAC GAGGAGACCC ACGCCCCGCT GGCGGACCAG CTCATCAAGG TGCAGGTCAA ACTACCCACC GGCTGGGCGA CCTTCCTGCA CCTGACCACC GACGAGCATG GCATCGCCTC CTACACGGCG CGTGTCCTCA CCACCACGAA CGTCACGGTG ATCTTCGATG GAACGGACGC CCTGCAATCC GCCCGCTCCG AGAACGAAGC GACCCTGCGC GTGCGCCCAG CCGCGCCACC GGCGTCCATC AGGGCCTCCC GCGGCACAGT GAATGCTGAA ACTCCGACGA TCGGCATCGA TGTCCCGGCC AACACGCTCG GGGAAAAAGC CGTATACCTG GCGTCCCTGC AAGCCGGTAA GCCCTACGTT TACGGGGCCA CCGGACCGTA CAGCTTCGAC TGCTCCGGTT TGGTGCAATA CATCTACAAG CAGCTCGGCA AGACACTTCC CCGTACCACC GACCAGCAGT ACGCGGCGAC AACCAGGGTC GCCCAGGGCT CCGAACAGCC AGGCGATCTC ATCTTCTTCG GCCAGCCTGG CGCGATCTAT CATATGGGGA TCTACGCCGG TGGCGGGAAA ATGTGGGTCG CGCCGAAGAC CGGTGACGTG GTGAAGCTCC AGACCATCTG GGCAGACTCC TACTCGGTCG GTCGGGTGAC CTGA
|
Protein sequence | MLSAQSAQLD AERRRQEGRG RHRAPSVPTT SSRARARARA VAAVTTGTVV VSGMALAGCA PEPSSDGALD DSTSTTSLTL ATQIGSRPAT DGAIQAAAAA DGTPGTVLDA TTDISAPTLS SKIDVGLRVT NPDVTVNADE PVNIGFSLFN EETHAPLADQ LIKVQVKLPT GWATFLHLTT DEHGIASYTA RVLTTTNVTV IFDGTDALQS ARSENEATLR VRPAAPPASI RASRGTVNAE TPTIGIDVPA NTLGEKAVYL ASLQAGKPYV YGATGPYSFD CSGLVQYIYK QLGKTLPRTT DQQYAATTRV AQGSEQPGDL IFFGQPGAIY HMGIYAGGGK MWVAPKTGDV VKLQTIWADS YSVGRVT
|
| |