Gene Franean1_5555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5555 
Symbol 
ID5673885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6727674 
End bp6730538 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content71% 
IMG OID641244411 
Productpatatin 
Protein accessionYP_001509815 
Protein GI158317307 
COG category 
COG ID 
TIGRFAM ID[TIGR03607] patatin-related protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.744518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA GCGGCAACCA GCTCAATGTC GACCATGACG ACCTGGAGGA CATTCGAATC 
GCGGTCGTTC TCAATGGCGG GATAAGCCTC GCGGTGTGGA TGAGCGGGGT GGTCAACGAG
ATCAACAGCC TTACCCAGCG CCGTCCGAGT GATCCGGCCC CGGGTAAGGC GGATCTGCGC
TTTTCGGAAG CGGCGGCGGT CTACGGCGGA CTTCTTGATC TCGTCCACGG GAGGGCGCGG
GCAGATGTGA TCGCGGGGAC GTCGGCCGGT GGCATCAACG GGGCGTTGCT GGCGTATGCC
CAGGCCTACG GGGCCGATCT GCGCCCGCTC GGCGAGTTGT GGGCCGAGCT GGGCTCGTTC
GACGCGCTGC TGCGCGACCC GCGTGAGACG CACCCGGCGT CCCTGCTGCG CGGCGATGAC
TATTTCCTGC CCGAGCTGGT CAGTGCCTTC GAACGCATCG TGCCCGCGGG CGAACGATCC
CAGCGGTATG TGCCGGCGAG CGAGCGGCCG ATCGACCTGA TCATCAACAC GACGCTGATG
CGCGGCCAGC CGAAACAGCG CGTCGACGAC TTCGGGACGG AGATCATCGA ATCGGCCCAT
ACCGGAGCGC TGCGATTCAC CCGCGCTGCG GACGCGTCAC CCGGACTTGA CCCTTTCTGG
GACGCGAGGA TTACCCACCG GCTGGCGCTG GCGAGCCGGA GCACCGCGTC GTTTCCGGTC
GCCTTCGAAC CGAGTTTCAT CCCGGTGGGG GAAGCGGGGA GGGATTCCTA CCATCCGGAC
ATGGGCGCCG GCGCGGGCCT GCCGGCGGTG GCGCAGTTCG ACCGGAGCCG GTACGTCATG
GACGGCGGAG TGCTGCTGAA CCAGCCGGTG AAACCGGCGC TCGCCGCGAT CTACGCCCAG
CCGGCGGAGC AACAGGTACG GCGGCTCCTG GTGCATGTGA ACCCCGACCC GAGCAGTCCC
GCCCCGGCCG AGGTCGCCGG CCTCGGCGAC GTCTACAGCC TCGGCAGCGA TACCGACGAC
CGGCCGCCGA CGCCGGCCGC GGTGCTGCGG ACGCTGGCCA CGTTGCCCTA CGCGCAGTCG
GTCGACGCCG AGCTCACCGA GATCCAGACG ACGAACGACC GGGTCCGCCG CTACCGCCCT
GACCGCGCGA GGATCGTCCA CCACCTCAAC GAGAAGCTCG CCGAGAAGCT GATGGACGGC
TACCGGAGCA CGCGCTCGTA CCAGGAGGCC GATCGCATCG GTGCGCTGCT GGCGGCGGCT
GCCCCGCGGC CCCGGATCTG GTGGAGCCGT CCCGAGCTCG CAGCGGTGCT ACGGGGCGTC
GGCGCGCGCG CCGACGGCGT CTCCTACATA CCGCCGGACG ACCATCTGCC AGCCGCTGAC
GCCGACCCCG CGCGCTGGGC CTGGGGGGTC GAGCCGGTTG AGCGCATCGG CGCCCTAGCG
GTCGACGTCT TCAAGCGCGC CATGTGGCTC GCCGATCCGA CCGACCCCGC CGAAGTGACT
ACCCGCCGGC GGCTGCGCGG GCACCGCCGG CGCCTCCACG AAGCGCTGGC CGAGCTTCGT
GAATGCAGCC GTGACGACGA GGTGTTCTGG CGGAGCTGGG CCACCCGGCC GCCGGCTCCC
CCGGCGCGGG GCTCGGACGC GACAGAGCGT CAGGAGTGCC TTCGCTCCTG GGTGCAGGCC
GCGCTGTCAC GCTGGCCGCT GCCGCCGGGG GCCAAGGAGG CCCCGGCCGC CTCCCTACCT
CACCGGCTGG GCGCCGTCGC GGAGCGGATC GCGCGCGTCC TGGCCGAGGC GGGCGGCGAC
CTGCGGCATG TCGCCGCGCG CGCCCGGCCA GGCGGTCCGC CGGCTGGCCC GCCCGGCGTT
CGGCACGCGG CCGGTCCTCC CGACGCGGCT GGCGACTCCG ATGCGGCGGA CGAGGCCGAG
TTGCTGCACA ACCTGGTCGA GGCGTTGTTT CCCGACGGCG CGCAGCCGTC CCCGGCGTCG
TTGCTGCGCA GGCTGCTGGC GGTAGAGGTG TGCCAGATCG CGATCTCCGG CGCGCCGCCC
GAGGTGGAGC AGGAGGTGCT GCTCCAGCAG ATCAGCGGCT TCACCCCCAA CTCGTTCGGC
GGGCCCACCA CCCCGGCCAA GATCGCCGGA ACGGAACTGT TCTGGTTCGG CGGGTTCGCC
AGCAGGTCCT GGCGGGTCAA CGACTGGATC TGGGGCCGGC TGGACGGCGC CACGCGCATG
GTCCAGACCG TCCTCGACCC GGGCCGGCTG CGCCAGCTCG GCCTGTCGGC CAAGGACACT
CACGACCAGC TGCGCCGGCT CGCCGTCGGG GGCAGCTATC AGACCCGGTT GGGCGAGCAC
TTTGACGCGC AGAGCGAGGC GATCCTCGCG GAGCTGTCCT TCCTCGACGA TCCGGAGGCA
AGAGCAGAGC CCCTCGTGGC CACTGCGCTC GCGGTCGCCC GGCGGCTGCA CGCCGACATC
CTCACCGAGG AGCTGCCCCG CCTCGCCGTC GCGATCGAGG AGGATCGGAA GGAAGGCGGC
CTTCCCACCG CGGGGGCACG TTTCCTGCGG GTGTGGACGG CCACACCCGA CCCCGGCCTG
GACGAGCTCT TCGCCATGTT CGCCGACGCC GAGATCGCCA CGGAGTCATT CGCCGAGGCG
GTCTCAGCGG GGCTCCTCTG GAGATCGGCT GCCGTCGCCA CACGGTTTGC CGCGAGCCTG
TTCGCCCCGG TCCCGCTGGC CTACACCCCG ATGCTGCAGA CACTGGAACT GGCCTTCGAT
GTCGTCCGGA AGGAGGCCGG TAGCGTGCTG ATGCTGCCCG CCGCGGTGAT GAGATGGGCC
GGCCGGCAAA CCACCGCGAG GATCCGGGAA CACATCGGCG GATGA
 
Protein sequence
MSRSGNQLNV DHDDLEDIRI AVVLNGGISL AVWMSGVVNE INSLTQRRPS DPAPGKADLR 
FSEAAAVYGG LLDLVHGRAR ADVIAGTSAG GINGALLAYA QAYGADLRPL GELWAELGSF
DALLRDPRET HPASLLRGDD YFLPELVSAF ERIVPAGERS QRYVPASERP IDLIINTTLM
RGQPKQRVDD FGTEIIESAH TGALRFTRAA DASPGLDPFW DARITHRLAL ASRSTASFPV
AFEPSFIPVG EAGRDSYHPD MGAGAGLPAV AQFDRSRYVM DGGVLLNQPV KPALAAIYAQ
PAEQQVRRLL VHVNPDPSSP APAEVAGLGD VYSLGSDTDD RPPTPAAVLR TLATLPYAQS
VDAELTEIQT TNDRVRRYRP DRARIVHHLN EKLAEKLMDG YRSTRSYQEA DRIGALLAAA
APRPRIWWSR PELAAVLRGV GARADGVSYI PPDDHLPAAD ADPARWAWGV EPVERIGALA
VDVFKRAMWL ADPTDPAEVT TRRRLRGHRR RLHEALAELR ECSRDDEVFW RSWATRPPAP
PARGSDATER QECLRSWVQA ALSRWPLPPG AKEAPAASLP HRLGAVAERI ARVLAEAGGD
LRHVAARARP GGPPAGPPGV RHAAGPPDAA GDSDAADEAE LLHNLVEALF PDGAQPSPAS
LLRRLLAVEV CQIAISGAPP EVEQEVLLQQ ISGFTPNSFG GPTTPAKIAG TELFWFGGFA
SRSWRVNDWI WGRLDGATRM VQTVLDPGRL RQLGLSAKDT HDQLRRLAVG GSYQTRLGEH
FDAQSEAILA ELSFLDDPEA RAEPLVATAL AVARRLHADI LTEELPRLAV AIEEDRKEGG
LPTAGARFLR VWTATPDPGL DELFAMFADA EIATESFAEA VSAGLLWRSA AVATRFAASL
FAPVPLAYTP MLQTLELAFD VVRKEAGSVL MLPAAVMRWA GRQTTARIRE HIGG