Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2114 |
Symbol | |
ID | 5670514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2540300 |
End bp | 2541754 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241035 |
Product | CBS domain-containing protein |
Protein accession | YP_001506456 |
Protein GI | 158313948 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTCCG GTGATCTTCT CCTCGTCTTC ATCGCCGCGG TGACCGCGCT CGCCGCGGCC GGCCTGGGCG CGGTGGACGC GGCGCTGACC AGGGTCTCCC GGGTCAGCGT CGACGAGTTC GCCCGGCAGG GCAAGACGGG GGCGGCGAAC CTCGCCCGGG TCGTCGCCGA CCCCGGGCGC TACCTGGCGT TGTTGTTGCT GCTGCGGATC GCCGGCGAGA TGGCCGCGGC GGCCTGCATC ACCGTCCTGG CCGTGCACGC CTACGGCTCC GGGTTCGCCG CCGTCGGGCT GGGGGCCCTG GTGTCGACGC TCGTCGCCTA CATCCTCGTC GGCGTGATGT TCCGGACGCT CGGCCGCCAG CACGCCCCGG CGGTGTCGCT GGCCACCGCC GGGCTGACCA TCCGGCTCGC GAAGGTCTTC GGCCCGCTGC CCCGCCTGCT GATCGCCTTC GGCAACGCGG TGACGCCCGG CCCCGGCTAC CGGGACGGCC CGTTCGCCTC GGAGGCGGAG CTGCGCGACC TCGTCGACCT CGCCGAGGAG AACGAGGTCA TCGAGCCCGA GGAGCGCGAC ATGATCGCCT CGGTGTTCGA GCTGGGTGAC ACGCTCGTCC GCGAGGTGAT GGTGCCGCGG CCCGACATGG TCTTCATCGA GTCCGACAAG ACCGTCCGGC AGGCGATCTC GCTGGCGCTG CGCAGCGGCT TCTCCCGCAT CCCGGTGATC GGCGAGAGCA TCGACGACGT CGTCGGCATC GGCTTTCTCA AGGACATGGT CGGCTGGGAG CGGGAGGGCC GGGAGAGCAG CCGGGTCGCG GAGGTCATGC GCCCACCCGT GCTCGTCCCG GAGAGCAAGC CCGCCGACGA CCTGCTGCGC GAGATGCAGG CGTCGCGCAC CCACATGGCC ATCGTCATCG ACGAGTACGG CGGCACGGCC GGCCTGGTGA CCATCGAGGA CGTCCTCGAG GAGATCGTCG GTGAGATCAC GGACGAGTAC GACAGCGCCA CGCCGCCGGT CGAGTGGCTC GACGACGACA CCGCCCGGGT GACGGCGCGG CTCGACGTCG ACGACCTCGC GGACCTGTTC GGCGTCGAGG AGCTGCCCGG AGCGCAGGAC GTCGAGACTG TCGGCGGGCT GCTCGCGAAC GCGCTGGGCC GGGTGCCCAT CCCCGGCGCG ACGGCGGACG TCGCCGGCCT GCGGCTGTCG GCAGAGCGCG CGGCCGGGCG GCGCAACCAG ATCGGCACCG TCGTCGTCCA CCGGCTGTCC CCGGCACCGG GCAACGGCGG GAACGGAGCC GGCAGGAGCG GAGCCGGCAA GGGCGCCACC GGCAAGGGCG CCACCGGGAG TGACAGCAAG GGCAGCAACG GCAAGGGCAC CAACAGGAAA ACCGACGGCA AGAAGACCGA CGGCGAGTCG GAGGGCCACC CGCCGGGCCC AGCGAGCAGA AAGGTGACAT CGTGA
|
Protein sequence | MDSGDLLLVF IAAVTALAAA GLGAVDAALT RVSRVSVDEF ARQGKTGAAN LARVVADPGR YLALLLLLRI AGEMAAAACI TVLAVHAYGS GFAAVGLGAL VSTLVAYILV GVMFRTLGRQ HAPAVSLATA GLTIRLAKVF GPLPRLLIAF GNAVTPGPGY RDGPFASEAE LRDLVDLAEE NEVIEPEERD MIASVFELGD TLVREVMVPR PDMVFIESDK TVRQAISLAL RSGFSRIPVI GESIDDVVGI GFLKDMVGWE REGRESSRVA EVMRPPVLVP ESKPADDLLR EMQASRTHMA IVIDEYGGTA GLVTIEDVLE EIVGEITDEY DSATPPVEWL DDDTARVTAR LDVDDLADLF GVEELPGAQD VETVGGLLAN ALGRVPIPGA TADVAGLRLS AERAAGRRNQ IGTVVVHRLS PAPGNGGNGA GRSGAGKGAT GKGATGSDSK GSNGKGTNRK TDGKKTDGES EGHPPGPASR KVTS
|
| |