Gene Franean1_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1768 
Symbol 
ID5670170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2123879 
End bp2125657 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content67% 
IMG OID641240689 
Productphospholipase D/transphosphatidylase 
Protein accessionYP_001506112 
Protein GI158313604 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.346269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0112079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCACG CGCCGCACAA TATCGACACG CTGACAGCCG CCTACTTTAC CCGCCCCACC 
GACCTTCCGT TCTCCCCGCC TCCTGGAGAA ACCGCGCCGG AGCAGTGGGC GCCACGGGAT
CTGGGGCCGA ACGACGTCGA GGCATTCATC GATGGTCTAG CCTATTTCAC CGCCGTCGAG
AACGAGATCG ACCAGCTCAT CGCCGGTCCG GTCGGCGCGG GGTTCTTCTA CTGCACGGCG
TGGTGGCTCG GCCTGGTCTC GACGCCGGCG ACGGTGCGGA TCCGCTCGTT CGACAAGTCG
ATCGGAAAGA TCGCCGCGAC AGTGGGGATC GACCTCCAGG ACGAATTTAC CCGAAATCTG
GGCCAGGACG AGTTCAAACT ACCCAGCGGC ACCCCGCTGC GATCGAAGCT CGCCCAGCTC
ATCGGTCGGA AGGTGGATGT CCGGATCCTG GCCTGGACGT CGCCCTTCGC GCCGAAGTAC
AAGCCGGTGG CCGACCTGCT GGGAGGCCTG ACCGACCTCA ATCTCCACAC CATCCTCAGC
GTTGACTCGC TGCGGCGGAT CTACGGGCCC GCCGGCCAGA ACAAGATCCT GCTCAACACG
TGCGCGCATC CGCTGGGGGC CGCCCACGCC AAGATGATCG TGTGCGGCAG CGCGAATCGG
CTCGCCGCTT TCACCGGCGG GCTCGACGCG GCACCCGGCA GAATGATGCC GGTGCCCTGG
GCGCTGGCGA GTCCGACCCG TGGCTGGCAC GACGTGGCGG TCCGGGTCCA GGGCCCGGCG
GCCGGGGCGA TGCACAACTT CTTCCGCAGT CTGTGGAATG AGCAGCTCAA ACAGCCGATC
GACACGTTCG TCCTCGACGA CCAGCAGATT CCCACGCATT TCGCGGACTC GCCCCCGGTT
CCGGCCGCCC CGGCGCCGCC GGCCGTCGTC GGACCGCGCC AGCGGGTTCA GGTCCTGCGG
ACGGTGCCGC AGATGAAGTT CTCGGCGGCC GGCCCGGAAA GGCTTGGCCC GTCCGCCGCG
CAGCGATGGC TTCTCACCAC CGCGGCCGGG TTCAGGCGAG CGCCCATCAC CTTCGCCCCG
GATGGAATCT TCGAGTTTCA CGTAGCGCTG CGAAAGGCGA TTTCACAGGC CACCCGGTAC
ATTTTCATCG CCGACCAGGC ATTCTCCTCG CAGGAGGTGA TGGGCTGGCT CAACGCCAGA
CTCATCCAGC AGCCGGGCCT GAAAGTGATC CTGGTGCACG GCGCCGACCC CGCCGACCCG
CCGTCCGGGC TGCTGAACCA CGCCGTCAAC AACTTCCTGC TGCCCCACCT GCCGGGAGTC
GGTGGAAACC CGAACAATGT GATGGTCTGT GGCTGGGCTG GCGTCACGGT CCACAGCAAG
GTTGTCATCA TCGATGACCA GTGGTGTGCG GTCGGCTCGG CGAACTGTAT GCGCCGCAGC
CATTTCACGG ATATCGAGCT GTCGATCGCC GTGCTTGACG AGGACGAGCC TGGTTTCGCT
CAGCGGCTTC GCCGGGACCT CTGGGCGCGG TACTGCGGCG TGCCGCTGCC GGGGGAGTCG
ATCGTCTTCT CCTACGACGA CGAACTCACC GCCCTGCTCG ACCTCAACCG GGCGCTGGGG
ATCTGGAAGC CGGCGTGGGG GTCCGGGGCG GCGCTGCCGA TGGCCGCGCA CCTGCTCGGC
TCGGTCCAGC CGTACCCGCT GCCCATGCCC GCGCCCACGA TCGCCTACAG CGAGGCGGCG
TACGACGCGC AGGACATCGA CTCGCGCAAG GTTGTCTGA
 
Protein sequence
MSHAPHNIDT LTAAYFTRPT DLPFSPPPGE TAPEQWAPRD LGPNDVEAFI DGLAYFTAVE 
NEIDQLIAGP VGAGFFYCTA WWLGLVSTPA TVRIRSFDKS IGKIAATVGI DLQDEFTRNL
GQDEFKLPSG TPLRSKLAQL IGRKVDVRIL AWTSPFAPKY KPVADLLGGL TDLNLHTILS
VDSLRRIYGP AGQNKILLNT CAHPLGAAHA KMIVCGSANR LAAFTGGLDA APGRMMPVPW
ALASPTRGWH DVAVRVQGPA AGAMHNFFRS LWNEQLKQPI DTFVLDDQQI PTHFADSPPV
PAAPAPPAVV GPRQRVQVLR TVPQMKFSAA GPERLGPSAA QRWLLTTAAG FRRAPITFAP
DGIFEFHVAL RKAISQATRY IFIADQAFSS QEVMGWLNAR LIQQPGLKVI LVHGADPADP
PSGLLNHAVN NFLLPHLPGV GGNPNNVMVC GWAGVTVHSK VVIIDDQWCA VGSANCMRRS
HFTDIELSIA VLDEDEPGFA QRLRRDLWAR YCGVPLPGES IVFSYDDELT ALLDLNRALG
IWKPAWGSGA ALPMAAHLLG SVQPYPLPMP APTIAYSEAA YDAQDIDSRK VV