Gene Francci3_1054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1054 
Symbol 
ID3905300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1252842 
End bp1254857 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content71% 
IMG OID637878388 
Productacyltransferase 3 
Protein accessionYP_480165 
Protein GI86739765 
COG category[I] Lipid transport and metabolism 
COG ID[COG1835] Predicted acyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGCGC CGCTCGGGCA CTCCCCCGCG CTCGACGGCC TGCGCGCGCT CGCGGTCACG 
GCCGTCATCG CATATCACGC CGGCGTCTCG TGGATGCCCG GCGGCCTGCT CGGCGTCGAC
ACCTTCTTCG TCCTCTCCGG GTTCCTCATC ACCGGCCTGC TGATCGCCGA GTACCGCTAC
AACCGGCGCA TCGACCTTCG CGCCTTCTGG ATCCGCCGGT CCCGCCGCCT GATGCCGGCG
CTGCTGCTGC TGCTGCTCGG GGTGGCGGCC TACGCCCGCT GGATCGCCTC CCCCGGCGAC
GTGGGAACCC TGCGGCTGGA CGCGCTCTCC ACGCTGCTCT ATGTCGCAAA CTGGCGTTTC
GCGCTGTCCG ACCAGAGCTA CTTCGACCAC TTCTCGGCTC CGTCACCTCT GCTGCACACC
TGGTCACTGT CCGTTGAGGA GCAGTTCTAC GTCCTGTGGC CGCTGATCGT CTACCTGCTG
ATGCGGCACA GCGCGAAGAC CGTGGACCGC TGGCGCCGGC AGCGCCAGAA GGCCCAGTCG
CTCGCGCTCA CGGTCGCGGT CCTCGGCGCC GAGGCGTCCG CGTTGGTGGG CCTCGCCCTG
CTCGTGGTCG GGACGAACCC CTCCCGGATC TACTACGGCA CCGACAGCCG CGCCCAGGCC
CTGCTGGTCG GGGCCGCCCT CGCCGTCTGG CGGGCCCAGC GGACGACCCC GATCTCCGCC
CGGGCCCGGT CGGTCATGTC GGTCGTCGGC TGTGTCGCCA CGGCTGCGAT GATCCTCCTC
TGGACAACGG TGGGTGGGGA GAGCCGCGGA CTCTACGCCG GCGGCTTCCT CGGGGTTGCG
ATCATCGTGA TGCTGCTGGT CGCCTCGCTC GTGGAAGCGC CCCGAGGCCC GGTGGCGAGG
GTGCTGGCCA CCGCCCCGCT GACCTTCGTC GGCCGCATCT CGTACGGCCT GTACCTGTGG
CACTGGCCGA TCTTCCTCAC GCTGACGGCG ACCCGCACCG GGACGAACGG CGTGGTCCTG
CTGGGGCTCC GCCTCGCCGC CACGCTGGCG ATCACACTGG CCTCGTTCCA CCTCGTGGAG
AACCCGATCC GGCGGGGCAA AGTACGCTTC CCCGTCCCGC GGGTGACGGT GCCCGCCGCC
CTGGGCGGGG TGCTCGCGGT GATCCTGCTG GCGACCGCCG GGACGTCGGC GTCGGTGCGC
ACTCCCGCGG ACCTGGAGGC GTTGGCGCGG CGGGCGGCGA CCTCCCCACA ACCGGCCGTC
GCCGTCAAGG CCGGCGGGAA CCCCCCGATC AAGGTCCTGC TCGGCGGCGA CTCGGTGGCT
CTCACCCTGG GGTTCAGCGA CTTCTCCTCG ATGGCCGCCC AGCAGGGTAT GGAGATCCAC
GACTTCTCCA AGCTGGGGTG TGGCGTCGCA CGGGGGATGC AGCGGCGCAT CCTCGGCAGC
GCCGGTCCCA CCACCGACGG ATGCGACCAG TGGCCGCAAC GGTTCGCCCA ACGGGTGAAC
GAGGTCAACC CCGCGCTCGC GATCCTGCTG GTCGGTCGCT GGGAGGTCAC CGACCAGATG
CGGGACGGCC GGTGGACGCA CATCGGCGAT CCGGGGTTCG ACGCCTACCT CGGCCGGGAA
CTCGACCTCG CCATCGACAC GCTCGGCGCC AAGGGTGCCA AGGTGGTGCT CCTGACGACC
CCGGTGTTCA AGCCGACCGA GGCGCCGAAC GGCGGCATCT ACCCGGAGAC GAAGGCCGAA
CGGGTGGTGC GGTTCAACGC GCTGCTGCGG GCCGCGGCGG CTCGCCATCC CGGCGTCACC
GTGATCGACA TCACGTCAAT TCTCACTCCG GGTGACAAGT ACGTCGACGA ACTGGACGGT
GTCCGGCTAC GCGACGACGA CGGTGTGCAC ATCTCCAATG GCGGTGCCCG ACGGATCGGT
GTCGCGATCA TTCCCCAGCT TCTCCGGATC GCCCGGGGAG CAGCGGCCGC CGCACCCGCG
CCGACCACGC CGACCACGGG AGCGGGACCC GGCTGA
 
Protein sequence
MLAPLGHSPA LDGLRALAVT AVIAYHAGVS WMPGGLLGVD TFFVLSGFLI TGLLIAEYRY 
NRRIDLRAFW IRRSRRLMPA LLLLLLGVAA YARWIASPGD VGTLRLDALS TLLYVANWRF
ALSDQSYFDH FSAPSPLLHT WSLSVEEQFY VLWPLIVYLL MRHSAKTVDR WRRQRQKAQS
LALTVAVLGA EASALVGLAL LVVGTNPSRI YYGTDSRAQA LLVGAALAVW RAQRTTPISA
RARSVMSVVG CVATAAMILL WTTVGGESRG LYAGGFLGVA IIVMLLVASL VEAPRGPVAR
VLATAPLTFV GRISYGLYLW HWPIFLTLTA TRTGTNGVVL LGLRLAATLA ITLASFHLVE
NPIRRGKVRF PVPRVTVPAA LGGVLAVILL ATAGTSASVR TPADLEALAR RAATSPQPAV
AVKAGGNPPI KVLLGGDSVA LTLGFSDFSS MAAQQGMEIH DFSKLGCGVA RGMQRRILGS
AGPTTDGCDQ WPQRFAQRVN EVNPALAILL VGRWEVTDQM RDGRWTHIGD PGFDAYLGRE
LDLAIDTLGA KGAKVVLLTT PVFKPTEAPN GGIYPETKAE RVVRFNALLR AAAARHPGVT
VIDITSILTP GDKYVDELDG VRLRDDDGVH ISNGGARRIG VAIIPQLLRI ARGAAAAAPA
PTTPTTGAGP G