Gene Franean1_5583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5583 
Symbol 
ID5673911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6767268 
End bp6769892 
Gene Length2625 bp 
Protein Length874 aa 
Translation table11 
GC content76% 
IMG OID641244437 
Productputative helicase 
Protein accessionYP_001509841 
Protein GI158317333 
COG category[R] General function prediction only 
COG ID[COG3973] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGACGA CACGTGACGG AGAGCTCGCC CGTGAGCAGG CCTACGTCGA CACCCTCTAC 
GGCCGGTTGG ACGAGGTCCG GGAGACCACC AAAAGGCAAC TCCGACAGGT GCTGCTCGAG
GCTGGCACCG GCACGCCGCA GTCGATCGTG GAACGCGACG TGTTCGCCGC GACGCATGCC
GACCGGCTGG CCCGGCTCGA CGCCGCCGAG GGGCGGCTGT GCTTCGGCGC GATGGACCAC
GCGGCCGGGG GGCGCACCTA CATCGGCCGG ATCGGGCTGT CCGACCAGGA GCAGGAGCCG
ATCCTGGTCG ACTGGCGGGC GCCCGTGGCC ACCGCGTTCT ACCAGGCGAC CATCGCCGAT
CCGCGTGGCC TGACCCGCCG CCGCCACCTG CGCACCCGCG GCCGGCGGGT CACCGGCCTC
GCCGACGACC CGCTCGACCC CCGCGCCTAT CTCGCGCAGG CCGGCGCCGC CGACGGGGCT
TCGGGGGCCG CCGAGGGCGA TGCGGCCCCC GAAGCCGGCG CGGGCGCCGA AGCCGGGTTC
GGCGCGACCG GGGACACGAT GCTCCTCGAG GCGCTGTCCG CCCCGCGCAC GGGCCGGATG
CACGACATCG TCTCGACGCT GCAGGCCGAA CAGGACAGGA TCATCCGGGC GGCGGCGAAC
GCGGTGCTGG TGGTCGACGG TGGGCCGGGC ACCGGCAAGA CCGCCGTCGC GCTGCACCGT
GCCGCATATC TCCTCTACAC CGATCGCGAC CGGCTCGTCC GGTCCGGGGT GCTCGTGGTC
GGCCCCAGCC CCGTCTTCCT CCGCTACATC GAGCAGGTCC TGCCCTCCCT CGGCGAGACC
GGAGTGGTGT TCGCGACCCC GGGACGGCTC TTCCCCGGCG TGGACGCGAC CGGCGAGGAC
CCGGTCGCCG CCGCGTCCCT CAAGGGCGAC GCACGGATGG CCGACGTCAT CGCCGGCGCG
GTCCGGGACC GCCAGCGGGC ACCCGGCCGA GGGGTGCGGA TCCGCCACGA CGAGCACGAC
CTGCACCTGG ACCGCGACAC CATCGTCCGG GCCAGGACTC GGGCCCGCCG CAGCCGCCGC
CCGCACAACT CCGCGCGCCG CGTGTTCATC CGCGAGCTGC TCGGCGCGTT GACGAACCAG
GTGGTCTCCC GGTTGCCCGG TGGCCTCTTC GAGCCCGAGG AGCGCTCGGA GATCACTTCA
GATCTGTGGG CGGACCCCGG GGTGCGGCGG GCCCTGAACG ACCTGTGGCC GCTGCTCACC
CCGGCCCGCC TGCTGGCCGA CCTGTACGCC TCGCAGGAGC TCCTCGCCCG GGCCGCCGGT
ACCCGGCTCA CCGCCGAGGA ACGCGCCCTG CTGCGGCGCG AGGGCCCGGC GGACGCGCCG
GCGCGGCGCT CCGGCGAAAC CCGCTCCGCC GAAACCGGCC GGATCCGCTG GACTCCGGCG
GACGTCGCAC TGCTGGACGA GGCCGCGGAG CTGCTCGGCG ACCCGGAGGA GGACGCCCGC
CTCGACGAGG CGCGCCGCGC CGCCGCCGAG CGCGCGGCCG AGCGGGAGTA CGCCCGCGGC
GTGCTGGAGA TGCTCGGCCT CGACGACCGG CTCGACGCCG ACACGGTCGC CGAGCGCTGG
ACCGCACCCC GCCAGCGCCG CGGCGCCGCC GAGTACGCGA CCGGGGACCG GACCTGGACG
TTCGGCCACC TCATCGTCGA CGAGGCCCAG GAGGTCTCGC CGATGCTGTG GCGCCTGCTT
TGGCGGCGCT GCCCGGGGCG CACCGCGACG CTCGTCGGGG ATCTCGCCCA GGCGGCCCGC
CCGCGGCCGC CGGAGAGCTG GGGCGAACTG CTGGGTCCGG CTGTCGGTGC CCACTACACC
GTCGAGCGGC TCACCGTGAA CTACCGGACC CCCAGCGAGA TCATGGATGT CGCGGCCGAC
GTGCTGACGG CGGCCGATCC CACGGCCGCG CCGCCCCTGT CGGTCCGCTC CGCCGGCCGG
CGCCCGGACG CCGTCCGGAT ACCATCCGCC GACGAGGCGC TCCTGCGGGG CGTGGTCGAC
GAGTCCGTGC GGGCCGCGGG CGAGGCGGCC GGGGGGCGGG TCGCCGTGAT CTGCCCTCCG
GGGCGGACCG CGGCGATCCG GGCCGCCCTG CGCGCTGCCG TGCCCCAGCT CGCGCTGCCG
AGGCAGCCCG ACGACCCGGA CGACCGCCCG ACCGGGACCC GGGAGCCGGC GGAGGCGGAC
CTGCTGGACG CGCCCGTCGC CGTCCTCACC GTCGCGGAGT CCAAGGGCCT CGAGTTCGAC
GCCGTGGTCC TCGTCGAACC GGCCGAGATC CTGGCCGGCC CGACCCGGGG CCTGGCGGAT
CTCTACGTGG CGCTGACCCG GGCCACCCGC GTGCTCACCG TGGTGCACAC CGGGGAGCTC
CCGGGCGTCC TGCATCGGAT GCCGGTGCGC GGTGGCCCGG CGACACCCGA GGCCGGCGCC
GCCCCGAGGG CGACCAGTCG CACCGAAGCC GAGGCCCGCA CCGCCCCCGC CCACACCGAC
ATCGATGACA CCGACACCGA TGACACCGAC ACCGGCGATG CCGACACCGA TGCCGGTCTC
GGTGACCCGG TGGTCGGAGG CCGCCAGCTC AACCTGCTGT GGTGA
 
Protein sequence
MPTTRDGELA REQAYVDTLY GRLDEVRETT KRQLRQVLLE AGTGTPQSIV ERDVFAATHA 
DRLARLDAAE GRLCFGAMDH AAGGRTYIGR IGLSDQEQEP ILVDWRAPVA TAFYQATIAD
PRGLTRRRHL RTRGRRVTGL ADDPLDPRAY LAQAGAADGA SGAAEGDAAP EAGAGAEAGF
GATGDTMLLE ALSAPRTGRM HDIVSTLQAE QDRIIRAAAN AVLVVDGGPG TGKTAVALHR
AAYLLYTDRD RLVRSGVLVV GPSPVFLRYI EQVLPSLGET GVVFATPGRL FPGVDATGED
PVAAASLKGD ARMADVIAGA VRDRQRAPGR GVRIRHDEHD LHLDRDTIVR ARTRARRSRR
PHNSARRVFI RELLGALTNQ VVSRLPGGLF EPEERSEITS DLWADPGVRR ALNDLWPLLT
PARLLADLYA SQELLARAAG TRLTAEERAL LRREGPADAP ARRSGETRSA ETGRIRWTPA
DVALLDEAAE LLGDPEEDAR LDEARRAAAE RAAEREYARG VLEMLGLDDR LDADTVAERW
TAPRQRRGAA EYATGDRTWT FGHLIVDEAQ EVSPMLWRLL WRRCPGRTAT LVGDLAQAAR
PRPPESWGEL LGPAVGAHYT VERLTVNYRT PSEIMDVAAD VLTAADPTAA PPLSVRSAGR
RPDAVRIPSA DEALLRGVVD ESVRAAGEAA GGRVAVICPP GRTAAIRAAL RAAVPQLALP
RQPDDPDDRP TGTREPAEAD LLDAPVAVLT VAESKGLEFD AVVLVEPAEI LAGPTRGLAD
LYVALTRATR VLTVVHTGEL PGVLHRMPVR GGPATPEAGA APRATSRTEA EARTAPAHTD
IDDTDTDDTD TGDADTDAGL GDPVVGGRQL NLLW