Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3822 |
Symbol | |
ID | 3905570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4582347 |
End bp | 4583429 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637881148 |
Product | hypothetical protein |
Protein accession | YP_482901 |
Protein GI | 86742501 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase |
TIGRFAM ID | [TIGR00147] lipid kinase, YegS/Rv2252/BmrU family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0763012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0331806 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACGC ATCCCCGGCT CCGCATCCCC GGCTCCGCAT CCCCGGCTCC GCATCCCCGG CCGCTGGGGG ACCTGCCGCA CCCGCCCGTA AGCTCGCTCA CCATGACGGA CGCCGCTGCC CCCCGCACAC CCTCCCGGGG AAGCCTCGGT GCGGTGGCGC CGCAGCCGCT GCGGGTCGAA CCCGCCGAGG TCGACCGCAA CCGGCTGGCC GTGATCGTCA ACCCGAGCGC GGGTCACGGG CGGGCGATGC GGATGCTCGA CGGCGTCCGC GTCGAGCTCG CGCGCTGGGC GCGGGACGTC CGGGTCACCC CGACCCGCGA CCTCGCCCAC GCGGACGATC TCGCGGCGGC GGCCACCGCC CAGGGCCGGG TCGTGGTCGC GCTCGGGGGC GACGGCCTAG CTGGCTCGGT GGCAGGGGGG GTGGCCCGCT GCGGCGGCGT GCTCGCGGTG CTCCCCGGCG GGCGCGGTAA CGACTTCGTG CGTGGTCTCG GCCTGCCCCG CGACCCGTGC CGCGTCGCGG CCGGGCTTGC GCACGCCCGG GAACGCCGGG TCGACCTGCC CGAGGTCGGC GGCCGGCCGT TTCTCGGGAT CGCGAGCGTT GGCTACGACT CCGACGTCCA GGTGATCGCC AACCGGACCC GGTTCCTGCG CGGCCAGCAG GTCTACACCT ACGCGGCGCT GCGGGCGCTG GCCGCCTGGC GCCCGGCGCG CTTCACGGTG ACGGTGGACG ACCTCGCGCC TCGGGACCTG GTCGGGTGGA CGGTTGCGGC GGCGAACTCG GCGTACTACG GCGGCGGGAT GCGGTTCGCC CCCGGGGCGG ACATCGCCGA CGGGCTGCTG GACGTCCTGC TGATCTCGCG CACCTCCCGC CTGACGTTCC TGGCGCTGTT CCCGCGGGTG TTCTCCGGGC GTCACGTCGA CACCCGGCAC GTGCGGGTGC TGCGGGCCCG GCGGGTGCGC ATCGAGGCCG ACCGGCCCTT CGCCGTCTAC GCCGACGGCG ATCCGCTGGC GTCGTTGCCG GCGGAGATCG TCGTGCGCCC CGGTGCCCTG CGGCTGCTCG TGCCGGTTAT TCCGGCGTCC TGA
|
Protein sequence | MVTHPRLRIP GSASPAPHPR PLGDLPHPPV SSLTMTDAAA PRTPSRGSLG AVAPQPLRVE PAEVDRNRLA VIVNPSAGHG RAMRMLDGVR VELARWARDV RVTPTRDLAH ADDLAAAATA QGRVVVALGG DGLAGSVAGG VARCGGVLAV LPGGRGNDFV RGLGLPRDPC RVAAGLAHAR ERRVDLPEVG GRPFLGIASV GYDSDVQVIA NRTRFLRGQQ VYTYAALRAL AAWRPARFTV TVDDLAPRDL VGWTVAAANS AYYGGGMRFA PGADIADGLL DVLLISRTSR LTFLALFPRV FSGRHVDTRH VRVLRARRVR IEADRPFAVY ADGDPLASLP AEIVVRPGAL RLLVPVIPAS
|
| |