Gene Franean1_1551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1551 
Symbol 
ID5669954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1854025 
End bp1857204 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content78% 
IMG OID641240470 
Productzinc finger SWIM domain-containing protein 
Protein accessionYP_001505896 
Protein GI158313388 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.712758 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCCGC AGGCCTCGCT GTGCCTTCGA ACGCCGGTTC GAATGTGGGT GCGGGGCCGC 
GAGGACGGAA CGGAACTGTC GGACCCCGTA GGCAGAATCG CCGTCGTGTC GAACACCACC
AGTCCGCCCG TCGAGCGCGG GCAGGCGCTC GCCCTGGCGC CGGATCAGGC GTCCGTGAAG
GCGGGCGAGA GGCTCGCGGT CGCCGGGTCC TGGCCGCTGG CGGGCGGCGA CGCCGAGGCG
CTGTGGGGTG AGTGCAGGGG CAGCGGAAAG TCGCCCTACC GGGTCGTCGT GGCCCTCGCG
GATCATGCCT CGAAGTGCTC GTGCCCGAGC CGGAAGTTCC CCTGCAAGCA CGCCCTCGGC
CTGATGCTGC TCGCCGCGGC CGGCGGCGTG GCCGGTGCCG GCGGCGGGCG TCCGCCGTGG
GCCGCGGAAT GGCTCGACCG GCGGGTGGCG CGCTTCGCCG CGGCGGCGAG CGCGGGCACG
CGGGCCGCCA CCGGGCCTGA GTCCGAGCGC TCCGCGGCGG GGGCGGCCCG GCGCGAGGCG
AGCCGTGCGG CCAAGGTCGA CGCCGGTGTC GCGGAGCTGG CCCGCTGGCT GTCCGACCTG
GCCGGGGAGG GGCTCGGAGC CGCCCAGGCC AGGCCGGCCG ACTGGTGGCG GGCCGCCGCC
GCCCGCATGG TCGACGCGCA GGCACCCGGC CTCGCCGAGA TGATCCACGA GGCCGCGCAG
ATCGCCGGGT CGGCGGCCCG GCGGCCTGAC TGGCCGTCCC GGCTGGTCGA CCGTGTCGGG
CTGCTGCACC TGCTGTGCGA GGGCTGGGCC CGCCGCGCCG ACCTGCCCGC CGACGTCGTG
GCGGTCCTGC GCGACCGGAT CGGTTTCACC GTTCCCGTGG CCACTGTGCT CGCCGGCGAG
CACATCGTCG GCGAGGTCGA CGTGCTGGGG GCGCACGAGT TCGGCGCGGG GCGGGCGCGG
GGGCGCCGGC AGTGGCTACG TCTGGTCGAG TCGGGGCGGC TGGCGGTGCT CGTCGACTTC
GCGGTGAACG GCCAGGGGAT GCCACCGCCG CTGCCCCGGG GCTCGCGGGT CAGGGCCGAT
CTCGCGCCGT ATCCCGGGCG CCGCCCGGCG CGGGTGGCGC CCGGCGCGCC GGGCGGCGCC
GCGGCCGTCG TCGACCCCGT CGGCACAGAT CTCTTCGCGC CCACCTGGCG GGCCGCGCTG
GCCTGCGTCG CGTCGGCCCT GGCGGTCGAC CCGTTCGCGG CCGTCGTCGC GCTGACCGTC
CGCGGTGTCA CGGTGCTGCC GCCGGGCGCC GGGGCCGGTC CGCTGCCCAG GAGCGGTCCG
TGGCTGCTGC GGGACTCCGC CGGCGAGGCG TTGCCGCTCG CCGACGAGGC CGTGGCGCAG
TGGGGCTGGC ACATGCTCGC GGCCGGCGGG GGAGCGGGCC TCGACGTGGT CGCCGAATGG
GACTCGTTCA CCCTGACACC CCTGGCCGCC ACGCCGTCGC GCACCGCGCG GGCGGAGGAC
ATGCCCGGCC GGGCCCCCGT CCCGGTAGTC GCGGCACCGG TCGCCCCGGT GCCGACGCCG
GGGTGGACGG ACCTCGTCGA CGCCGCGGTG ATCGGCGTGA GCCGCCGCCC GTCGCCGGTG
ATTCCCGGGC TCCCTGATCC CGGCGGGCGT CCGGGCGAGG AGGAACGCCT GCTCCGGGTC
GCGGCGCTGG CCGCGGTGAC CAGGCGGGCC GGTCAGCTGG CCGCCGATGC CTCCGCGGTG
CCCGCGCCGC CTCCCGCCGC GGCGGATGAG CACCCACCGT GCCCCCCGAC GGCGGTCGTG
CCCGTTCCCA CCGAGGTGGG CGCCGCCGAG ACAGCCGAGC TCGAGGAGTG GCTGGACCTG
CTGGCCGAAG GAGGCTGGCG CCCGCCGGAC ACCCTGCTTC CCGCGCTGGT CGAGCTCGGC
CGGCGGTCGA CCGAGCTGCG CCCCCGCCTG CTACGGGTCC TCGGCCCGCG CGGCCGCTGG
TTCGCCGCGC TCCACCCGGA CGCCGGCTGG GCGGGGCCCG TCGGCGTCGC GGGCTGGCCG
AGCGCGAGCG CGCAGCAGCG CCGATCCCTC ATCGCCGCGC TGCGGCACAC CGACCCGGCG
GCCGCCGCCG AGCTGCTGCG CGGCGGCGCG GGCGAACCTC CCTTCGTCCC GTTCCGCCGG
GCCGGCGGAG CCGAGCGCCT CGCCTTCGTC CAGGCGCTGC GCACCGGCCT CGGCCCGTAC
GACGAGGAGC TGTTGGAGGC AGCCCTGGAC GACCGCCGGT CCGACGTGCG CGACGCGGCG
GTCCAGGTTC TGCTGGAACT TCCCGGTACC CGTTTCGCGG CGCGGGCCGC CGCCCGCACC
GAGCGGGCCT TCACGGTGCA TCGCGGCACC CTGCGCGTGC ACCCGCCGGC GGTGCTCACC
GACGAGATGG CGCGGGACGG CGTCTCGCCT GACGGGCCGT CCCGCGCCAG GGCGCAGGCC
GACGGGCCGG CCCGGATGCT GCTCGCGGAG ATCGCCCGGG TGGACCCGCG GCTGTGGCCG
GAGCGGACCG GCCTGTCCCC GCAGAAGTTC CTGAGCGCCA GGGCGCTCTG GGAGCCCGCG
CCGAGCTTCC CCGCGCTGAC CCTGGCGCCC TACCTCGTCG GCCCGGTCGT CCGGCACCGT
GACCCGGAAT GGGCGCTCGC GCTGATCCCG AAGGTCGAAC CGCGCTCGCA GGGGGTGCTG
ATCGGCTGCC TGCCGGAGCC CGACCGGCTG CGCGCGTTCG ACCTCGCCCT GAGCGGGACC
GGCCATGCCA CGCAGCGCTG GGGCCTGCTC CTCGGCAGAT CCGGCGGGCA CGGCCTCGGC
AACACCGCCC TGGGGAACCT CATCACGCTT CTCGCCGGGA TACCCGGGCC CTGGTCGGCG
GAATTCACCC GGGCGGCCGG CCCCGCCATC ACGGCCATCG TCACCGCACC GGTCGACGCA
CCGGGCTCCG ATCCCCAGCA GCGGGCAACG GCCCGCACGC TGCACGCCCG GGTGCGGGCG
CTGCTCGCCA ACCTCGCCTG GCGGGTGGAG CCGTACGTCG GCATCCCCGA CCTGGACCCG
TCCACCATGC CGGCCGAGAC GGTGTCCGGC TACCAGCGCC TCGCGGCCGT GCTCGCCCAG
CGGCGAGCCC GCCGCGACAC CCTGCTCTCC CGTTCGAGCT CGAAGAAAGG CTCGTCATGA
 
Protein sequence
MWPQASLCLR TPVRMWVRGR EDGTELSDPV GRIAVVSNTT SPPVERGQAL ALAPDQASVK 
AGERLAVAGS WPLAGGDAEA LWGECRGSGK SPYRVVVALA DHASKCSCPS RKFPCKHALG
LMLLAAAGGV AGAGGGRPPW AAEWLDRRVA RFAAAASAGT RAATGPESER SAAGAARREA
SRAAKVDAGV AELARWLSDL AGEGLGAAQA RPADWWRAAA ARMVDAQAPG LAEMIHEAAQ
IAGSAARRPD WPSRLVDRVG LLHLLCEGWA RRADLPADVV AVLRDRIGFT VPVATVLAGE
HIVGEVDVLG AHEFGAGRAR GRRQWLRLVE SGRLAVLVDF AVNGQGMPPP LPRGSRVRAD
LAPYPGRRPA RVAPGAPGGA AAVVDPVGTD LFAPTWRAAL ACVASALAVD PFAAVVALTV
RGVTVLPPGA GAGPLPRSGP WLLRDSAGEA LPLADEAVAQ WGWHMLAAGG GAGLDVVAEW
DSFTLTPLAA TPSRTARAED MPGRAPVPVV AAPVAPVPTP GWTDLVDAAV IGVSRRPSPV
IPGLPDPGGR PGEEERLLRV AALAAVTRRA GQLAADASAV PAPPPAAADE HPPCPPTAVV
PVPTEVGAAE TAELEEWLDL LAEGGWRPPD TLLPALVELG RRSTELRPRL LRVLGPRGRW
FAALHPDAGW AGPVGVAGWP SASAQQRRSL IAALRHTDPA AAAELLRGGA GEPPFVPFRR
AGGAERLAFV QALRTGLGPY DEELLEAALD DRRSDVRDAA VQVLLELPGT RFAARAAART
ERAFTVHRGT LRVHPPAVLT DEMARDGVSP DGPSRARAQA DGPARMLLAE IARVDPRLWP
ERTGLSPQKF LSARALWEPA PSFPALTLAP YLVGPVVRHR DPEWALALIP KVEPRSQGVL
IGCLPEPDRL RAFDLALSGT GHATQRWGLL LGRSGGHGLG NTALGNLITL LAGIPGPWSA
EFTRAAGPAI TAIVTAPVDA PGSDPQQRAT ARTLHARVRA LLANLAWRVE PYVGIPDLDP
STMPAETVSG YQRLAAVLAQ RRARRDTLLS RSSSKKGSS