Gene Francci3_3822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3822 
Symbol 
ID3905570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4582347 
End bp4583429 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content76% 
IMG OID637881148 
Producthypothetical protein 
Protein accessionYP_482901 
Protein GI86742501 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID[TIGR00147] lipid kinase, YegS/Rv2252/BmrU family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0763012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0331806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACGC ATCCCCGGCT CCGCATCCCC GGCTCCGCAT CCCCGGCTCC GCATCCCCGG 
CCGCTGGGGG ACCTGCCGCA CCCGCCCGTA AGCTCGCTCA CCATGACGGA CGCCGCTGCC
CCCCGCACAC CCTCCCGGGG AAGCCTCGGT GCGGTGGCGC CGCAGCCGCT GCGGGTCGAA
CCCGCCGAGG TCGACCGCAA CCGGCTGGCC GTGATCGTCA ACCCGAGCGC GGGTCACGGG
CGGGCGATGC GGATGCTCGA CGGCGTCCGC GTCGAGCTCG CGCGCTGGGC GCGGGACGTC
CGGGTCACCC CGACCCGCGA CCTCGCCCAC GCGGACGATC TCGCGGCGGC GGCCACCGCC
CAGGGCCGGG TCGTGGTCGC GCTCGGGGGC GACGGCCTAG CTGGCTCGGT GGCAGGGGGG
GTGGCCCGCT GCGGCGGCGT GCTCGCGGTG CTCCCCGGCG GGCGCGGTAA CGACTTCGTG
CGTGGTCTCG GCCTGCCCCG CGACCCGTGC CGCGTCGCGG CCGGGCTTGC GCACGCCCGG
GAACGCCGGG TCGACCTGCC CGAGGTCGGC GGCCGGCCGT TTCTCGGGAT CGCGAGCGTT
GGCTACGACT CCGACGTCCA GGTGATCGCC AACCGGACCC GGTTCCTGCG CGGCCAGCAG
GTCTACACCT ACGCGGCGCT GCGGGCGCTG GCCGCCTGGC GCCCGGCGCG CTTCACGGTG
ACGGTGGACG ACCTCGCGCC TCGGGACCTG GTCGGGTGGA CGGTTGCGGC GGCGAACTCG
GCGTACTACG GCGGCGGGAT GCGGTTCGCC CCCGGGGCGG ACATCGCCGA CGGGCTGCTG
GACGTCCTGC TGATCTCGCG CACCTCCCGC CTGACGTTCC TGGCGCTGTT CCCGCGGGTG
TTCTCCGGGC GTCACGTCGA CACCCGGCAC GTGCGGGTGC TGCGGGCCCG GCGGGTGCGC
ATCGAGGCCG ACCGGCCCTT CGCCGTCTAC GCCGACGGCG ATCCGCTGGC GTCGTTGCCG
GCGGAGATCG TCGTGCGCCC CGGTGCCCTG CGGCTGCTCG TGCCGGTTAT TCCGGCGTCC
TGA
 
Protein sequence
MVTHPRLRIP GSASPAPHPR PLGDLPHPPV SSLTMTDAAA PRTPSRGSLG AVAPQPLRVE 
PAEVDRNRLA VIVNPSAGHG RAMRMLDGVR VELARWARDV RVTPTRDLAH ADDLAAAATA
QGRVVVALGG DGLAGSVAGG VARCGGVLAV LPGGRGNDFV RGLGLPRDPC RVAAGLAHAR
ERRVDLPEVG GRPFLGIASV GYDSDVQVIA NRTRFLRGQQ VYTYAALRAL AAWRPARFTV
TVDDLAPRDL VGWTVAAANS AYYGGGMRFA PGADIADGLL DVLLISRTSR LTFLALFPRV
FSGRHVDTRH VRVLRARRVR IEADRPFAVY ADGDPLASLP AEIVVRPGAL RLLVPVIPAS