Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1461 |
Symbol | |
ID | 3903098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1752952 |
End bp | 1754052 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637878799 |
Product | NLP/P60 |
Protein accession | YP_480567 |
Protein GI | 86740167 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.495869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00837381 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGCCCCGTC GTGGTATGGC CTGGCTTCCG GAGGACTGGG ACCAGTTCGT CCGGAATCGT CAGCGCCCGC CGGCGCCCCA CGGCTCCTCA GGCTCCGGCG CGGCTCGCCG CCGTACCACT CCTCCTCCGC GACGCGCGGT CACCATCGCC GCGTTCACCA CCGGGACGAT CGCGGCCTCG ACCGCCGCCT TCGCGGCCAC CGTCCCCGGG GGCGCCGACG GAACCTCGTC ACTGGAGTCG AACTCCCTCA CCCAGGAAGC CGCGCTCGTC CCGGGACACC ACGCCACCGG CGGCGATGCG GCCTCGGCCC GCAGGCTCGC CCCGTTGGCC ACCGCCATCG CGGACCACGA CCCCGTCTTC ACCAGCGTCT CGATCGCCGC GGACAAGACC TCGGTCGCGC CCAACACCCC GGTGGTGCTC ACGGTGCGGG CGTTGGAATC GGACAGCGGC ACCCCGCTCG CGAACCAGGA CGTCCGTATC GTCGTGGTGA ACGGTCCCCA GTGGCAGACC TCCACGAGGT TGCGGACCGA CGCGAACGGC GCCGCGCAGA TCACCGCGCG CCTGCTCTCC ACGACGACGA TCACCGCGGT CTTCGACGGA TCGAACGCCC TGCGTCCCTC CGTGGCCGGT GCCGCCACGG TGACGATCGC GAGTCCGACG GGTCCGGGAC GGTCGGGTTC GGGAGGGTCG GGTTCGGGAG GGTCGGGTTC CGTCATCGAC CAGGCGATCC CAAAGGTCAT CCCGGGTAGC TCGATCGGGG AGAAGGCCGT CTACCTCGCG TCGTTGAACA AGGGTAAGCC GTATGTCTGG GGCGCGGAGG GTCCGTACTC GTTCGACTGC TCCGGACTCG TCCAGTACGT CTTCAAACAA CTCGGCCGCT CGCTGCCGCG CGTGGCCGAG GACCAGTACC GGGTCTCGAT GAAGGTGCCC CAGTCCGGCA AGCAGCCCGG CGACTTGATC TTTTACGGTA CTCCTGGGAA TATCTACCAT GTCGGCATCT ATGCCGGGAA TGGGTATATG TGGGCTGCCC CGCAGACCGG CGGTGTCGTG TCGCTGCGGC CCATCTACAG CTCCACCTAC AAGGTCGGCC GCATCCTCTG A
|
Protein sequence | MPRRGMAWLP EDWDQFVRNR QRPPAPHGSS GSGAARRRTT PPPRRAVTIA AFTTGTIAAS TAAFAATVPG GADGTSSLES NSLTQEAALV PGHHATGGDA ASARRLAPLA TAIADHDPVF TSVSIAADKT SVAPNTPVVL TVRALESDSG TPLANQDVRI VVVNGPQWQT STRLRTDANG AAQITARLLS TTTITAVFDG SNALRPSVAG AATVTIASPT GPGRSGSGGS GSGGSGSVID QAIPKVIPGS SIGEKAVYLA SLNKGKPYVW GAEGPYSFDC SGLVQYVFKQ LGRSLPRVAE DQYRVSMKVP QSGKQPGDLI FYGTPGNIYH VGIYAGNGYM WAAPQTGGVV SLRPIYSSTY KVGRIL
|
| |