Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0913 |
Symbol | |
ID | 5669327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1062862 |
End bp | 1063959 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239840 |
Product | hypothetical protein |
Protein accession | YP_001505275 |
Protein GI | 158312767 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.471869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.209889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGC TGGCTGTTCT CGCCGGCGTG CTGCTGCTCG CCGGGAACGC GTTCTTCGTC GGCGCCGAGT TCGCGCTGAT CTCGGCCCGC CGCGACCGGG TCGAGCCGAT GGCGGAGGAC GGCGACAAGC GGGCGGCGGC CGTCCTGTCG CACATGGAGC ACCTCTCCCC CATGCTCGCC GCCACCCAGC TCGGCATCAC CGTGTGCTCG CTGGCCTTGG GCGCGGTCGC CGAGCCGGCG GTCGCCCACC TCCTCGAGGC CGGGCTCGAG GCGGCGAACG TCCCCGTCGG CGCCCAGCAC GCCATCGCCT TCGTGATCGC GCTGTGCATC GTGGTGTCGC TGCACATGGT GCTGGGCGAG ATGGTCCCGA AGAACCTCTC GATCGCCGGC CCGGAGCGCG CGGCGCTCTG GCTGGGCCCG CCGCTGTTCG CGTTCGCCCG GTTCACCCGG CCGTTCATCG CCTTCCTGAA CCACTTCGCG AACGCGGTGC TGCGGCTGCT GCGGGTCACC CCGTCGGACG AGCTGACCTC CGCCTACACC CCGGAGGAAC TGGGGGCGCT GATCGGGCAG TCCCGGCAGG AGGGCCTGCT GCCCGCCGGT GAGCACGAGC TGCTCACGCA CGCGCTCGAA CTGTCCGGAC GGACCGTCCG GACAGTGATG ATCCCGTTGT CGGAGATCGT CACCGTGCCG TGGACGGTCA CCGCCGCCCA GCTGGAGGAG GCCGTGGCCG AAACGGGCTA CTCCCGGTTC CCCGTCCGTG CCCCGGGCCA GGACGCCGGT CGCGAGCCAG GCGGTGGCGT GGGGCCCGTG GCGGAGCCGG CCGGCTTCCT GCACGCCAAG GACGTCCTCG GTGTTCCCGA GCAGGAGCGC GACGAGCCGC TGCCGCCCCG CCGGCTGCGC CGGATGGCCG AGATCGGGGT CGACCTGCAC CTGGACGAGG CGCTCCGCCT CATGCAGCGC ACCAACAGCC ACCTGGGCCG GGCGGTGGAC GCGGCCGGCA CCACCCTGGG CGTCGTCGCC ATGGAGGACG TCGTCGAGGA GTTCGTCGGC GAGGTGGAGG ACGCGAGCCA TCGCGAGACG GCCGACCCCC GACCGTGA
|
Protein sequence | MNLLAVLAGV LLLAGNAFFV GAEFALISAR RDRVEPMAED GDKRAAAVLS HMEHLSPMLA ATQLGITVCS LALGAVAEPA VAHLLEAGLE AANVPVGAQH AIAFVIALCI VVSLHMVLGE MVPKNLSIAG PERAALWLGP PLFAFARFTR PFIAFLNHFA NAVLRLLRVT PSDELTSAYT PEELGALIGQ SRQEGLLPAG EHELLTHALE LSGRTVRTVM IPLSEIVTVP WTVTAAQLEE AVAETGYSRF PVRAPGQDAG REPGGGVGPV AEPAGFLHAK DVLGVPEQER DEPLPPRRLR RMAEIGVDLH LDEALRLMQR TNSHLGRAVD AAGTTLGVVA MEDVVEEFVG EVEDASHRET ADPRP
|
| |