Gene Franean1_7063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7063 
Symbol 
ID5675373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8618727 
End bp8620721 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content65% 
IMG OID641245908 
Productrestriction endonuclease 
Protein accessionYP_001511299 
Protein GI158318791 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGT CAGGTGGCGG TCAGTCGGAA TGGCAGCAGC GACGCGCCGC GGCTGCAGGT 
GCTGAGCGGG AGGCTGCGCG ACAACGTAAA CGCGAAGCGA CCGCGCGAGC AAAGCAGGAC
GAGCGCGAGC GGGAGCAGGC CCAGCGCCAG CAGTCGGTGG ACGACGACAA CGCTGCCGCA
GCCGCTCACA TCGCAGAGCT GGGCAGGAAC CTGCTACTCA ACGTCTTGGG CCTCCCTGCC
TTCACCGTGG CCGCGCTGGA GGTAGTACCG GAGCAGTCGA TCTTTCAGCC GGGATCGCTG
GCAGTCGCTG GTAGGGCGCC GGATTGGGAG CAATACAAGC CTCCGGCTCC TGGTCGGGCC
AGCCGGCTGG TGGGCGGGTC GGGTCGATAC CAGCGGGAGT TCGACGCGGC CCGAAGCAAG
TGGGAAGCCG ACACGGCCGA GTTCCAGCGG GTCGAGACCG ACCGGCTACG ACAACTCGCT
GCGGCTCGTA CCAGACACGG CCAGCAGGTG GCCGCGGCCC TCGACCGTGC CGCGGCGCAC
AACGCCCGCA TCGCGGCACA GTGGGCGGCG TGTCTGGACG GTGACCCGGA AGCGGTCGAG
TGGTTCGTCG GCCAAGCGCT CGCGGCCACG TCCTATCCTG ACGGCTTCCC CGTAGCACGC
AAGGTCGCCT ACCGGCCCCA GGAACACGAC ATCGTCATCG AGATCGAGTT CCCACGACGA
TCCGTGATCC CCGAAATCAG GGGATACAAG CTATTCAAGA CCGCACCCGA GGTCAGGCCG
GTAAAGTGGA AAGAGTCTGA AGTCAAGAAG CTATATGCGC AGCTCGTTGC CTGGATAACG
TTGCGGATAG TGCACGAGGT TTTCGAGGCT ACGAAGGCGC TTGACCTCAT CGAGGTTGTT
GTCTTCAACG GGACCGTCAT CGATGTAGCA CCCACGACCG GCAAAGATAC TCTCTACCAT
CTGGTAAGTC TCGAACCCGA ACGGTCCCTG TTCGAGGCGA GCCTTGAACT CGACCGGGTC
ACCGATCCGA TCGGATGCCT GCGCGAGCTG GGCGCGAAAG TCTCACCCAA CCCTTACGAC
CTAGAAGCCG TGAAACCCGT TGTCACGTTC GATCTCCGCC GATTCAGGCT CGCCAACGAT
GCGGCCGAAC TAGCCGATCT AGACTCCCGA CCGAACCTCA TGGAACTCAC CCCGAGCGAG
TTCGAGAAAC TCATCGAAAA ACTTCTCAAG GCCATGGGTA TCGAAGCGTA TCGCACCATC
GACTCCCGCG ACGATGGCAT AGACGTCGTC GCGACTAAAG ACGACATCAT CTTTGGTGGT
GTCTGCCTGG TACAGGCTAA ACGTACCAAG AACCGAGTCG AGCTCGGGAC GGTTCAAGCG
GTCGCTGGCT CCATGAACGA CCACAACGCC GCTACCCCCG ATGTTCAGCA GCAGTCGGTA
CAGGCCCGTG GCCGGTCGAA CAGCCGGCAG CGCCTCGGTG GCGGCCAGGG CATCAGCCGA
GGCGGGAATG CGCAGACCGT CAGCGTCCAG ATCGCCGAAG TAGGCGATCG AGGTGATGCT
GGGCAGCTCT GCGACGGTAA CGGCGGTCAC CGCCTCGTCG AGGCCTTCCT GCGCGCGACG
CTGGTCAGCC TGAGCCTCCA GCAACAGCTG CTGCCCATGG GCGACATCGC GAGGCCGCCG
GTCGACGGGA AGTGCGGTCC ACACCTTGCG GTCGCGGTCG CCGGGGCTCG CCTGCTCCAG
CTGAGTGCGC AGCGCCGAGG TCCGCTCGAG GGCAGTGTTG TGGACGGCGG CGTTCCGGTC
GCGGTCACGC CGAGCCGGGC TACGGCGCTG GCCCGGCCCT CGCGGTCAGC GCCCTCGGTG
CTGGCGAGCA ACCACTCGGC ACGGGCCACG ATCTCCGCCG GTCGGCCAGC GACCGCGGCG
CGGGCACGGG CTGCCACGGT CTCGGCTGAC CTGGCCTCCA ATCGAAGATC TTGCCCCACC
TCTACCGCCA GGTAG
 
Protein sequence
MAQSGGGQSE WQQRRAAAAG AEREAARQRK REATARAKQD EREREQAQRQ QSVDDDNAAA 
AAHIAELGRN LLLNVLGLPA FTVAALEVVP EQSIFQPGSL AVAGRAPDWE QYKPPAPGRA
SRLVGGSGRY QREFDAARSK WEADTAEFQR VETDRLRQLA AARTRHGQQV AAALDRAAAH
NARIAAQWAA CLDGDPEAVE WFVGQALAAT SYPDGFPVAR KVAYRPQEHD IVIEIEFPRR
SVIPEIRGYK LFKTAPEVRP VKWKESEVKK LYAQLVAWIT LRIVHEVFEA TKALDLIEVV
VFNGTVIDVA PTTGKDTLYH LVSLEPERSL FEASLELDRV TDPIGCLREL GAKVSPNPYD
LEAVKPVVTF DLRRFRLAND AAELADLDSR PNLMELTPSE FEKLIEKLLK AMGIEAYRTI
DSRDDGIDVV ATKDDIIFGG VCLVQAKRTK NRVELGTVQA VAGSMNDHNA ATPDVQQQSV
QARGRSNSRQ RLGGGQGISR GGNAQTVSVQ IAEVGDRGDA GQLCDGNGGH RLVEAFLRAT
LVSLSLQQQL LPMGDIARPP VDGKCGPHLA VAVAGARLLQ LSAQRRGPLE GSVVDGGVPV
AVTPSRATAL ARPSRSAPSV LASNHSARAT ISAGRPATAA RARAATVSAD LASNRRSCPT
STAR