Gene Francci3_2484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2484 
Symbol 
ID3904862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2929339 
End bp2930742 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content72% 
IMG OID637879814 
Productdiacylglycerol O-acyltransferase 
Protein accessionYP_481580 
Protein GI86741180 
COG category[R] General function prediction only 
COG ID[COG4908] Uncharacterized protein containing a NRPS condensation (elongation) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.26387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGC TCGACGTGGT CACGAAGGCC GGTCCCGAGC GGACATACGC TCGTCCACTG 
ACCTCGGGTG ACCGGGCCTA CCTCGCCTTC AGCCGGCTCA ATCCCGGGGA GTTCCAGGAC
GTCGGTGCGC TGTTGTTCGT CGACGGCCCT CCGCTGGAGC TGGCGGACCT GCGTGCGCAC
GTCGCCGAGC GGCTGCGCGA CCCCCGGGCA CGGATGCTTA CCGACCGTCT CGAGACTGTC
GCGGTCCGGT TCTCCGGTCG CCGCTCGACG GCGAACGAGA CCCTCTGGGT CCGCGGTCCG
GGCCTGAACG TCGACGACCA CGTTGTCGCC ATGGACCTAC CGTCCGCCGA CGGTGTTGGC
GGTGTTGGCG GTGTGGGCGG GGGCGACGGT GGTGCCGGCG ATGCGCGGCT GCGTGCCGCC
GTCGACCGCA TTGCCGCCCA GCCGATCGAC CTCAGCCGGT CGCCGTGGAT GCTCTACCTG
CTGCGCGCCC CGGGGTCGTC CGGCACGGTG CTGGTGTATC GGACCAGCCA CATCCAGCAG
GACGGCTTCG CGCTCTACCA GGCGCTGCAC CTGCTCTTCG GGGGGGAGGA CGAACTCGAC
CTCGGTCTGC CTCCCGCTGT CCGCCGGCCG CGGCCCTCGG ACTATGCGGG TTTCGTCGGC
CGGGGGCTGC GTTGCCTGGC GCCCACCCGC GCCCTCGACG CCTGGGGCGG GCCGCCGGCC
GGGCCGGCGC GACATTCCTG GGTCACCACC GATCTGGGCA CGCTGCGAGC GGTGGCCCGC
CACCACGATG TCACGGTCAA CGACGTCTAT CTCGCCGCGC TCGCCGGGGC GCTGCGGACG
TGGTCCCTTC CCGAATGGCG CCGGGACCGT CAGCCCATCC ACGCGCTGAT GCCCATCAGT
TTCCGGACCG CCGCCGAACG GAACGTGCTG TCGAACTATT CCTCGGGCGT GCGTGTCCCG
CTTCCCTGCG GGGAGCCGGA CCCCGGCCGG CGACTGGCGT GGATCGCCGC GGAGACCCGT
CGCATGAAGA AGGGCGGGCT GGGGGTGGTG GAACGTCACC ACTTCTCCAC CCTGGCGACG
ATCGCCTCGC CCCGCATGCT CGCGGGCGCC GCCGCTTACC CGGCCCAGAT CCGGAAGATG
GCCCTGGTGG CGAGCAACGT CCGGACGATG CGCGGGCCGC TGGCTGTCGC GGGACGCGCG
GTGACCGGGC TGGTCGGCAC GGGCCAACTA CTCGTCGGAC GTCAGCATCT GGCCGTCGCC
ATGCTTGGCA TCGACGAGCG GGTCGGTGTC ACGTTCGTGG CCAGTGCCAG CGTGCCGAAC
CACGCCCGGT TGGGGCAGCT GTGGCTGGCG GAACTCGACG TGCTCAGCCG GCTGGTGCCC
TCACGCCGGC GTGAGCGTGC CTGA
 
Protein sequence
MALLDVVTKA GPERTYARPL TSGDRAYLAF SRLNPGEFQD VGALLFVDGP PLELADLRAH 
VAERLRDPRA RMLTDRLETV AVRFSGRRST ANETLWVRGP GLNVDDHVVA MDLPSADGVG
GVGGVGGGDG GAGDARLRAA VDRIAAQPID LSRSPWMLYL LRAPGSSGTV LVYRTSHIQQ
DGFALYQALH LLFGGEDELD LGLPPAVRRP RPSDYAGFVG RGLRCLAPTR ALDAWGGPPA
GPARHSWVTT DLGTLRAVAR HHDVTVNDVY LAALAGALRT WSLPEWRRDR QPIHALMPIS
FRTAAERNVL SNYSSGVRVP LPCGEPDPGR RLAWIAAETR RMKKGGLGVV ERHHFSTLAT
IASPRMLAGA AAYPAQIRKM ALVASNVRTM RGPLAVAGRA VTGLVGTGQL LVGRQHLAVA
MLGIDERVGV TFVASASVPN HARLGQLWLA ELDVLSRLVP SRRRERA