Gene Franean1_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1999 
Symbol 
ID5670400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2402593 
End bp2405163 
Gene Length2571 bp 
Protein Length856 aa 
Translation table11 
GC content77% 
IMG OID641240920 
Producthypothetical protein 
Protein accessionYP_001506342 
Protein GI158313834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0638496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAC GGCGGGCGGC GGCCCTGCGG AAACCGCGCG GGTCGGCCGA CAGCTACCTC 
GACTTCGTCC AGGGCCTGGA CGAAGCGGCC CTCACCGCCG TCCTGCGGGC CCGCCAGGAC
GTCCTCGAGG ACCCGCCGCG CGGAGTCGGT GAGCTCGTGC GCCGGCTCGG GGACGCGGAC
TCGATGCTCG CCGCCGTCAA CGACCTGGAC CGCGACGGGC TCCTGCTGTG CGACGCGGTC
ATGATGCTGG GCCCACCCGT CCCGCTGGAC CGGCTGGTGA TCCTGCTCGG CGGGTCCAGG
GACGGGATCC GCGCGGCGCT GCGCCCGGTC ACCCGGCGGG CGCTGCTCTG GGAGAGCGAC
GGCGTCGTCC ACGCCTTCGA GCCGTTCCGG CGGCTGTGGG ACGACGAGGT CGCGGTGTGG
CACCCGGCCG CGGAGCTCAT CCCGGCCATC TCCGTCACCG ACCTGCGCCG TGCCGCCCGC
GGGCTCGTTC CCGGGGCCCG GGTCACCTCG TCCACGACCT GGGAGCGGTC CGCCCGCGTC
GTCGGGGAGC TGATGGCCGA CCCGGACGGC GTCGTCGCCG CGGTGCAACG GCTGCCCCGG
CCGGCCCGCG ACCTGCTGGC CGAACTGGTC CGCGACCCGA CGGGTCTGGC GACCGGCGAC
GAGCCCGACG ACGCCCTCGG TACAGGTACA GGTGCCTTCG GTACGAACGG CTTCGATGGC
GGGGCCCGTG GGCGGCTCCC GGCCGGGTCG AACCCGGAGG AAGCGGCCGG GATACTCGTC
GAGTCCGGGC TGCTCCTGCT CGTCGACGGT GAGATCGAGG TGCCCCGGGA GGTCGTGACT
GTCCTGTGGG CGGCAGACCC GGAGGTCCGG CTGACCGGGC CGCCGAGCCT CGGCCCGGCG
GGCGCGGACC GCGCGGACGA GACGTTCGCC GCCGGGATCC AGGCCGCCGC CGGGCACGCC
CTGCGGTCGG TGGCGTCGCT GCTCGCCGAG GCGGAGCGGG CGCCGCTCGC CGCGCTGAAG
AAGGGCGGCA TCGGCACCCG GGAACGCGCC CGGCTGGCCC GGACCCTCGC GCTCACGGAG
GATGAGCTCC CGCTCTGGAT CGACGTCGCC CACGCCGCGG GGCTGCTGGC CCGCGAGGAG
GGCGGCTACG CGCCGACCGG CGAGTACCCC GGCTGGCGCG GGTCGGACGT CGGACGTCGC
TGGGCGGTGC TCGCCCTGGC CTGGTTCCTC CTCGACCAGT CCCCGACACA CCGTGAGATC
GACAGCGGCC GGGACCAGCC GCCGCCGCTC CCGCTCGCCT CCGGCGCGGG CCGGGTGCGC
CGGACGCTGC TCACCACGGC CAGCCCCGGC CGGTCGCTGG CCGCCGCGGG CAGCCACCTC
GAGTGGTTCC TGCCGCTGCA CGGCTACGAC CGGCCCCAGC TGGCGAACAA GCGCCGGGCC
GCCGTCCGGG AGGCCGAGCT GCTCGGCGTC GCCGCCGGGG ACACCGTCTC CGATCTCGGT
GTCGCGGTCT GCGAGGTCCT CGCGGCGGCG CCGCTCGACC CGACCGGGCA GATGCGGGCC
CACCAGTACG GCTCCCCGAC GCCCACGCTG GCCGGGTTCG TCACCGGAAC CCTCGGGCCG
CTGGTGGACG AGCTGGCCCG CCGGTGCGGC CCCGCCCTGC CGCGCGACGC TTCCACCATG
ATCCTGCAGT CCGATCTGAC CGCGACCGTG GCCGGCCAGC CCAACCACGC GATCGCCCGC
CTGCTCGCCG ACGCGGCCGT GTCGGAGTCG CGCGGCGCCG CCGGGACGTG GCGGTTCACC
TCCGCGAGCA TCCGCGGCGC GCTGGACGTC GGCTGGAGCG CGCAGGAGCT GCTCGACGAG
CTGCGCGCCG GCGCGGCGCA CGAGGTGCCT CAAGGGCTCG AGTACCTCGT GGCGGACGTC
GCCCGCCGCC ACGGCCACAT CCGCGCGCGG GCGGTGCGGG CCGCGGTCGT CGCGGACGAG
GCGACGATCA ACGAGATCCT GGCGACCCGG GCGCTGCGGT CGCTGGACCT GCGGCGACTG
GCACCGACCG TCGCGGCGTG CGGCGCGGAC GCCGAGGACG TGGCCGAACG CCTGCGCGCC
GCGGGCATGG CTCCGGTGGT CGAGGACGAG CACGGCACCG TCGTCATCCG CCGGCGGTCG
GGCGGCCAGC CGGCCTCGGG TCGCGACGCA GCCGCGGGGG CGGCTTCGGA GCGTGGCAGC
GCGGGGAGTG CGGGCGAGTC CGGAACGTGG GCGGCTCCGG CGGATGTGGC GCGGCGCATC
CTCGCGAACC GGGCCGGCGC CGGCGACGCC GCCCGGGCCG ACCCGCGCGT CGTACGCCAG
CTCGGCCGGC TGAACCCGCG GCTCACCCTC GCCGAGCGCC AGCTCCTCGC CGCCGCGCTC
AACCACGGCG AGATCGTGAC GATCATCTAC CGGGACCGGT TCGAACGGCG GACCAGCCGT
CTGATCGCCC CGGTGGAGCT GCTCGGCGGG CGTCTCGACT CCTGGTGCCA CCTCCGCGAG
GCGCGGCGCG AGTTCGCCGT CGCCCGCATC GAGGGGGTCA CACCCGGCTG A
 
Protein sequence
MAGRRAAALR KPRGSADSYL DFVQGLDEAA LTAVLRARQD VLEDPPRGVG ELVRRLGDAD 
SMLAAVNDLD RDGLLLCDAV MMLGPPVPLD RLVILLGGSR DGIRAALRPV TRRALLWESD
GVVHAFEPFR RLWDDEVAVW HPAAELIPAI SVTDLRRAAR GLVPGARVTS STTWERSARV
VGELMADPDG VVAAVQRLPR PARDLLAELV RDPTGLATGD EPDDALGTGT GAFGTNGFDG
GARGRLPAGS NPEEAAGILV ESGLLLLVDG EIEVPREVVT VLWAADPEVR LTGPPSLGPA
GADRADETFA AGIQAAAGHA LRSVASLLAE AERAPLAALK KGGIGTRERA RLARTLALTE
DELPLWIDVA HAAGLLAREE GGYAPTGEYP GWRGSDVGRR WAVLALAWFL LDQSPTHREI
DSGRDQPPPL PLASGAGRVR RTLLTTASPG RSLAAAGSHL EWFLPLHGYD RPQLANKRRA
AVREAELLGV AAGDTVSDLG VAVCEVLAAA PLDPTGQMRA HQYGSPTPTL AGFVTGTLGP
LVDELARRCG PALPRDASTM ILQSDLTATV AGQPNHAIAR LLADAAVSES RGAAGTWRFT
SASIRGALDV GWSAQELLDE LRAGAAHEVP QGLEYLVADV ARRHGHIRAR AVRAAVVADE
ATINEILATR ALRSLDLRRL APTVAACGAD AEDVAERLRA AGMAPVVEDE HGTVVIRRRS
GGQPASGRDA AAGAASERGS AGSAGESGTW AAPADVARRI LANRAGAGDA ARADPRVVRQ
LGRLNPRLTL AERQLLAAAL NHGEIVTIIY RDRFERRTSR LIAPVELLGG RLDSWCHLRE
ARREFAVARI EGVTPG