Gene Francci3_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0052 
Symbol 
ID3903531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp64448 
End bp67876 
Gene Length3429 bp 
Protein Length1142 aa 
Translation table11 
GC content73% 
IMG OID637877382 
Producthypothetical protein 
Protein accessionYP_479175 
Protein GI86738775 
COG category 
COG ID 
TIGRFAM ID[TIGR03607] patatin-related protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACC CGGCCACCCA GGAGGTACGG CTCGCGGTCG TCATGACCGG TGGCGCGAGC 
CTCGCCGTCT GGATGGGCGG GGTCGCCACC GAGATCAACC TTGCGACCAG CCCGGACCGT
GACCGGGCCG ACGACGCGGA CGCCGCCGTC GCTGCCCGTT ACGCCCGGCT GGCGACCATC
CTCGACGTCG AGGTCAGTGT GGACGTACTC GCCGGCACCT CGGCCGGTGG CATCAACGCC
GCCATGCTGG GCTACGCGAA CACCCACCAC GCCGATCTCA CCCCGCTGCG GGACCTGTGG
CTGTCCCTCG GCTCGTTTGA CGCGCTCATG CGCACCCCCC ACGAGAAGAC GTATCCCTCG
CTGCTGCACG GGGACACCGC CGTGCTGCCC GCGCTGCACA CCGCCTTGAC AGCGGTGGGG
GAGACGGCCC GAGCCGGCAT GGACCGAGCC GGCATGGACC GAGCCGGCAT GGACCGAGCC
GGCATGGACC GAGCCGGCAT GGACCGAGCC GGCATGGACC GAGCCGGCAT GGACCGCCCG
ACATCGGTCT TCATCACTAC GACGATCCTG CGCGGCGAGG TCGCCGAGCA CGCCGACTCA
CTCGGCGGCA CCATGTACGA CGTCGACCAC CGGGGCCTGT TCGCCTTCGG CACGCAGGAC
CTGACCGACC CTGGCGCGGT CCCCCGTCTC GCCCTGGCCG CACGGGCCAG TTCCGCCTTT
CCGGGCGCCT TCGAGCCGGC CTACGTGCCG GTGGGGCACG CCGTCGACGC CGCGCACCCG
GACATGGCCC GCTACGTCAA CGCCTCCCAC GGATTCCACG CATCGGACGG CGGGATCCTG
GTCAACCGCC CGATCGGACC GGCGCTGGCG GCCATCTTCG ACCGGCCCGC CGAACGCCAA
GTCCGTCGCG TGCTCGCCTA CGTCGTGCCC TCGCCGCGGA TGCCGGTACC GACGAATCCG
GCGGCACCAA CGAATCCGGC GGCACCAACC AGGACGTCCA CACCGCCCGC CCTGCCTGTG
CCACCCACGG TGATCCCCAC CCTTCTCCAG GTGCTCGGCG CGGCGCTGAA CCAGTCGATC
GGTACAGATC TGGCGACGCT GCGCGACCAC AACAGCGCCG TACGCGGTAC GCGGGCGAAC
CGGCGACGGC TGCTGCGCCT CGCCCCGGCC GGTGGTCCCC GATTAGCGGA CGAGAGCGTC
TATGACGCCT ACCGCCGCAG CCTGGCAGAG GAGATCGCCC CACCGGTGAT CGAGGCGCTG
CTGCGCGTCC TCGGTGGTCG GTTCGATCTG CCCCGTCCCC CGGACACCGA AGCGCTGCGC
ACCGACGCGG CCGGACCCGC CCGGATGACG AACGCCGCGG TCAACGCCGT GATCCAGATG
CTTCCCGACC GGCTGCCGAC CACGGCCGAC CTCGCCGACC TGTGGCGGCT GGGCCGACCC
GCGCTCGACG CCGCCAAGGG TCTACTGATC AACATGATCA ACGAAGGGTA CGTCCTCTCC
CCCGAGCCGG CGGACCGGAT CCGACTTGCC CGGCTCGCGG CGGCCGTGCA CGGCGCATCG
CACGCCACCA TCCGGAACAC CGACACCACC GACACCACAG CCACCACAGC CACCGCGGCC
CTGCGGCCCG GCGTCTTCAC GACGGTCAGC GAGACCCTCG GTGCGATGGC CGGGGCACCG
CTGCTCGACG TCATCGCCGA GGCTGCCCGC CGCTGGTTGC GGACCGACCC GGAGGGCGAC
GCGGGGCAGG ACGAGCTCAC CCGGGCCTGG CGCTCCCTGG AGCGGATGAT CGAGACCCTG
CGGACCGAGC TCACCGATCT CGTGGACCGG CGGTCCCCCG CGGCCCGGCC TCCGGCGGGA
CCAGAGACCA GGCCCGAGGG TCTCAGCGTC GGCCAGCGCC GGGCGGTGGC GGCCCGGACC
CTCGCGGACT TCGTCGGGTA CCTCCCGTCG ACTCCCGCGG ACGCCCTGAT CGCCGTGCTC
GACGTGCACC TGGTCGAACG AAGCACCGGC GCCGCGGTCC TCGACCAACC GGTTGAGCTC
GTCCAGATCA GCGCGGATCT ACCGAACCGC CTCGATCCCG CCCGCGCACT GGCCGAGGAG
AAGGTCACAG GCCTGCAGCT GGGAAACTTC GGCGCCTTCG CGAAGTCCTC GTGGCGGGCC
AACGACTGGA TGTGGGGACG GCTGGACGGT GCGGGCTGGC TGGTCCGGAT CATGCTGGAT
CCACGCCGGC TCGTGATCCG CCGGGACACC GCCGTCCCCG CCGGTCACGC CCACGCCGCG
GCGCTGGCCC GCCGGGGCTG GCTGGTGGAT CTGGTCGACG ATCTCACCGA GGTGGCCGGC
ATCCCGGCGC CGCGGGAGGT ACTCGACGAG CTTGGCTTTC TGACCGATCC GGACGCCCCC
GTGCCGCCGA ACCTTCCGGT CACGGCGACC TGGGTCGCCG CCGGCATCCA ACGGGACATC
GCGGCCAAGG AGCTGGTGGG GGTCGCCGAG GCGGTGCGGC GCGACAACAA GGCCGGGGTC
GATCCGCGGC CCACCGCCGA CTTCCTCGCC GCCGTCGACC GGGCCCTGAC CGTGGAACCG
AGCCGGCCCG GGACGACGAC CGGACCGGGG GCCTCGGCGG TACCGGAAGG GACAGGCTCC
CCGAACCTGG ACGAGCCGCG CGGCGCCGAG GTCATCTCGC TGGCCGCGGC CGTGGCCGCC
AGGATCGCCG CGGGACAGAC CGGTGCCCAC CGGCCTGCCG GCGTCGACAG TTCGGCTCTG
GCGCGCCGAC CGCCCCGGCC ACCCGGGCTC AGCGGCGTCC CCGTCACGCC CGGCGCGCTG
CCCCCGCGAG CGGTGGACAA GGTACTCGAG GCCTGCCGCG TCTCGGACGA GCGCATCACC
GACGCGGCGC AGGGCCCGGT TCTGGTGACC GCCCTGGCCC AGATCGTGGC CGTGGTCGTC
GCCTGGGCCA CATCGACCCG CCGGCTGCCA CGGCCCCTGC GGCCGGTCGC GTTCCTCGCC
CGGACCGTCA CCCGTCTCGC CTTCGAGATG ATCCGGGATG TCACGCACGG CCGGCGGCGG
ATGACGATCG CGGTGGGAAC AGCGCTGGTC GGCCTCGGCA CGGCCGGCGG GCTGACCGGT
TCGGGCATCG TCGGCGGCCT GGGGATCGTC GTCGGCCTGA TCGGCCTGCT GATGATCGGA
CTTACCGGCT GGCGGCACCT GCCCGGAGGA CTGGCGGTCG TGAGAGCGGG GCTCATCGCG
GTGTTCGCGG CGGCCGGTGT CGTTCCGGTG ATCCATGACC GGCTCTTCCC CTGGCTACAC
GACGATGTCG TGCCCTACCT GGCCGATCAT CCGTGGGCAT GGGCGACCGT CTTCGGAGCC
CTGGTGCTGC CGGCTCTCTG GTCCGTCGCC GAGGCCCTCA CCACCCGCCG GGCCCGCCGC
AACGGCTGA
 
Protein sequence
MRDPATQEVR LAVVMTGGAS LAVWMGGVAT EINLATSPDR DRADDADAAV AARYARLATI 
LDVEVSVDVL AGTSAGGINA AMLGYANTHH ADLTPLRDLW LSLGSFDALM RTPHEKTYPS
LLHGDTAVLP ALHTALTAVG ETARAGMDRA GMDRAGMDRA GMDRAGMDRA GMDRAGMDRP
TSVFITTTIL RGEVAEHADS LGGTMYDVDH RGLFAFGTQD LTDPGAVPRL ALAARASSAF
PGAFEPAYVP VGHAVDAAHP DMARYVNASH GFHASDGGIL VNRPIGPALA AIFDRPAERQ
VRRVLAYVVP SPRMPVPTNP AAPTNPAAPT RTSTPPALPV PPTVIPTLLQ VLGAALNQSI
GTDLATLRDH NSAVRGTRAN RRRLLRLAPA GGPRLADESV YDAYRRSLAE EIAPPVIEAL
LRVLGGRFDL PRPPDTEALR TDAAGPARMT NAAVNAVIQM LPDRLPTTAD LADLWRLGRP
ALDAAKGLLI NMINEGYVLS PEPADRIRLA RLAAAVHGAS HATIRNTDTT DTTATTATAA
LRPGVFTTVS ETLGAMAGAP LLDVIAEAAR RWLRTDPEGD AGQDELTRAW RSLERMIETL
RTELTDLVDR RSPAARPPAG PETRPEGLSV GQRRAVAART LADFVGYLPS TPADALIAVL
DVHLVERSTG AAVLDQPVEL VQISADLPNR LDPARALAEE KVTGLQLGNF GAFAKSSWRA
NDWMWGRLDG AGWLVRIMLD PRRLVIRRDT AVPAGHAHAA ALARRGWLVD LVDDLTEVAG
IPAPREVLDE LGFLTDPDAP VPPNLPVTAT WVAAGIQRDI AAKELVGVAE AVRRDNKAGV
DPRPTADFLA AVDRALTVEP SRPGTTTGPG ASAVPEGTGS PNLDEPRGAE VISLAAAVAA
RIAAGQTGAH RPAGVDSSAL ARRPPRPPGL SGVPVTPGAL PPRAVDKVLE ACRVSDERIT
DAAQGPVLVT ALAQIVAVVV AWATSTRRLP RPLRPVAFLA RTVTRLAFEM IRDVTHGRRR
MTIAVGTALV GLGTAGGLTG SGIVGGLGIV VGLIGLLMIG LTGWRHLPGG LAVVRAGLIA
VFAAAGVVPV IHDRLFPWLH DDVVPYLADH PWAWATVFGA LVLPALWSVA EALTTRRARR
NG