Gene Franean1_1332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1332 
Symbol 
ID5669743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1601904 
End bp1605104 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content71% 
IMG OID641240263 
Productlantibiotic dehydratase domain-containing protein 
Protein accessionYP_001505690 
Protein GI158313182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.591859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATCC GGGGCGGCCC GCTGGCCTGT GCGCCCCTCG ACGCCGCGCT TTTCCGTGCT 
GCCGTTATGT CGGACGAGTC AGCGAAGGCC CTGGCCGCAG GTTCCTGGCC GGGCGCGAAC
TGCGACGACG TGGACCGCTG GTGCACCTGG CTGGCCGACG TCTGGGCGCA GCCGGCCGTC
GCAGAGGCGA TCGATACCGC GAGTCCGGCG CTGGCTGTCG GCGTGGAGGA CGCGTGCCGG
GGATCCAGAC CGTCCGTGCG GCGGGTCCGG CGGCTGGGGT TGGCGGTGGC GCGCTATCTG
GTGCGGATGA GCCGACGTGC AACGCCATTC GGGTTGTTCG CCGGTGTGGC CCCACTGCGC
TTCGACGTGG AGCCCGTACT GCGGTGGACT GGAAGTGATC GCCCCTACCC GCGTGTGGAC
TCGGTGTGGC TGGCCGCGGT CGTCGCTCGG CTGGAGTCCA TGAGCAGTGT CCGTGCGGCG
CTGACCGTCG TCGCCAACGA CTTGGCCATG GTGCGCGGCG AGCGGCTTGT CGTTCCGTGG
CGCCCGCATG CACACCCGCC GACGGAGACC TTCCTTCCTG CCGAGGTCTC GGTCCGCCAC
GGCCGACCGG TTCAGGCTGC CGTGGCGTCG GCAGCGGCCC CCGTCCGATT TGTAGATCTT
GTCGATTCGG TGGTGGTCGC GCTCGGTCTC TCTCGGCCTG GCGTTGAGGA GATGCTGGCC
CAACTGCTGA CGTGCGGAGT GCTCGTGAGC AGCCTGCGGC CACCTGCGAC GTGCACGGAC
GGCGTTGCCC ACCTTCTCGA TGCGCTGAAC GGCATCGAGA ACACGCGCGT ACCGGCCGCC
ACCGACCGGA CTGACGACAG CACCAGCGAT CCCTCACACG GCCTTGTCGG TGATCTTCGG
ATGATCCAGG CGGAGCTAGG GGCGGCACGC GATGCGGACA GCTCGTTGGG ACGTCCACGT
CTGGCCGCCC TTCGTGAGCG GATGCGGGCG GTATGCGCGG TGGTCGAGCA GCCGCTCGCG
GTCGACCTGC GGTTGGACTG CACGGCGGTG TTGCCGGAAC TCGTGGCGGT CGAGGCCGCT
GCAACGGCGG GAGCACTGTT GCGACTGACC CCGCACCCGG CTGGCAACCC GTCCTGGCGG
GACTACCACC ACAGGTTCCT TGCCAAGTTC GGTGAATCGG CGCTGGTCCG AGTCGATCAG
CTGGTTGACC CGGTGATCGG CCTGGGCTAC CCGGCGCACT TCGGCGTCGC GGAACCACCG
CCTACCGGCG GTGTATCGCC TCGGGACGAG TACTTGCTGC AGCTCGCCCA GCGCGCGGCG
TTCGACGGCC AGCAGGAGGT CGTCCTGGAC GAGGCCGCTC TTGACACCAT GTCGGCGCAG
ACGGCTGGGT CCTGGCCAGA GCCGCATGCG GAGATATGCG TCGAGGTGCT GGCACCGTCG
ATGGCGGCCC TGTCCCAGGG CGCGTTCACC CTGCTGGTCA CCGGTGTCAG CCGCACCGGG
GCAGCGATGA GTGGCCGCTT CCTCGATCTG CTCCCCGAGG CTGACCGGCT CAGGATGATC
GATGTCTTCC GTCGGTTACC GCCGACGGCG GAAGGCGCGG TGCCGGTGCA GCTGTCGTTC
CCGCCAGCGC ATGCCCGGCT GGAGAACGTC ACCCGCACCC CCCACGTCCT GGCGGACTGG
GTCTCACTCG CCGAGCACCG CGACAGCCAG CGCGGCAGGC TGCCGTGGAC GGACCTGGCG
GTCACCGCGA ATGACGAACA GCTCCACCTG GTTTCGCTGT CCCGCGGTGC CGTCGTCGAG
CCGTTGTTGA CGAATGCCGC CGCCCGGCAG ACCTTCCCGC CGCTCGCGCG GCTGTTGCAC
GAGCTGCCCC GTGCCAGCAG CGGGGCCGTA GCCCCGTTCT CGTGGGGCGC GGCGAGCTGT
CTGCCGTTCC TGCCGAGGGT TCGTCATGGC CGCGCGGTTC TGGCGCCGGC CCGATGGCGA
ATCAACCCAT CCGATCTTCC GGGTCCAGGC GCGGGAGACC AGGAATGGGC CGACGCGCTC
GCACGGTTGC GGGAGCGGTC AGGACTGCCC CGCTGGGTGA GCACCGGCCG GGCGGACGTA
CGGCTGCGCC TGGATCTCGA CGAGCGGATG GATCGCGATC TGCTGCGTGC CGAACTCGAC
CGCGCCGGCG CTCTGACCGC CCTCGAGGCT CCGGGCCCGG ACGACTATGG CTGGGCCGAC
GGCCGAGCCC ACGAGATCGT CGTCCCGGTG GCGACCACCG CCGCCCCGCG ACCCGCGCCG
GCCATCGTGA CGACGCCCGG CCCGATGCCA GTGAGCGACG CGACCTCCGG GATCCTGCCG
GGATCGAGGG TGCTCTTCGC GAAGCTGTAC TGCCATCCGG ACGTCGTCTC CACCATCCTC
ACCGACCATC TGCCCGCGCT GCTCGACGCC TGGGGCGTCC CTCCGCAGTG GTGGTTCATC
CGATACCGCG ACCCCGCCTC ACACCTACGC CTGCGACTGC ACGTCGCGCC CGACGCCGAC
GGCGTCACCG ACAGCTACGG ACGAGCGGCA GCTCGGGTCG GTGCATGGGC GGAGCAGCTG
CGGGCACGAC GTCTCATCGG CGATCTCGTG CTCGACACCT ACCACCCCGA GACAGCCCGC
TACGGCGACG GCGAAGCACT CGTCGCAGCC GAGCACCTGT TCGCCGCCGA CTCGGCCGCC
GTCGTCATCC AGTTGGCCGC CCAGAACGCG AACCGGGCAC TTCACCCTGC GGCGTTGACC
GCAGTGAGCA TGGTCGACCT GACGTGTGCG TTGCTCGGCG GCAGACAGAC GGGGATGCGG
TGGCTGCTGG AAAACCGGCG GGCCTCCGGA GCACCCACCC AGCGGCCGGT GCTGCGCCAG
GCCATCGCGC TGTCCCGCAC ATGTGATGAC GAGCCCGCGC CCAACGCCGC GATCCCGCAG
CTCCCGGCGC CGTTACGGGC CGTGTGGGGT GTGCGGCGCC GGGCGGCCGA GCGGTATGCG
GCCCAGCTCG CCGCGTTGCC GGGCCTGCCC GCCAGCAGCG AGGTACTGGG ATCACTGCTG
CACCTGCACT ACGTCCGCTC TCATGGCATC GACTCGTCAG CGGAGCGCAC CTGCCACCGT
CTCGCCCGCG CGGTCGCGCT CGCCTGGCAC AACACCGGCC AGCACCCCCC TCTCGACCTC
GCCAGAGCGG AAGGCCCATG A
 
Protein sequence
MDIRGGPLAC APLDAALFRA AVMSDESAKA LAAGSWPGAN CDDVDRWCTW LADVWAQPAV 
AEAIDTASPA LAVGVEDACR GSRPSVRRVR RLGLAVARYL VRMSRRATPF GLFAGVAPLR
FDVEPVLRWT GSDRPYPRVD SVWLAAVVAR LESMSSVRAA LTVVANDLAM VRGERLVVPW
RPHAHPPTET FLPAEVSVRH GRPVQAAVAS AAAPVRFVDL VDSVVVALGL SRPGVEEMLA
QLLTCGVLVS SLRPPATCTD GVAHLLDALN GIENTRVPAA TDRTDDSTSD PSHGLVGDLR
MIQAELGAAR DADSSLGRPR LAALRERMRA VCAVVEQPLA VDLRLDCTAV LPELVAVEAA
ATAGALLRLT PHPAGNPSWR DYHHRFLAKF GESALVRVDQ LVDPVIGLGY PAHFGVAEPP
PTGGVSPRDE YLLQLAQRAA FDGQQEVVLD EAALDTMSAQ TAGSWPEPHA EICVEVLAPS
MAALSQGAFT LLVTGVSRTG AAMSGRFLDL LPEADRLRMI DVFRRLPPTA EGAVPVQLSF
PPAHARLENV TRTPHVLADW VSLAEHRDSQ RGRLPWTDLA VTANDEQLHL VSLSRGAVVE
PLLTNAAARQ TFPPLARLLH ELPRASSGAV APFSWGAASC LPFLPRVRHG RAVLAPARWR
INPSDLPGPG AGDQEWADAL ARLRERSGLP RWVSTGRADV RLRLDLDERM DRDLLRAELD
RAGALTALEA PGPDDYGWAD GRAHEIVVPV ATTAAPRPAP AIVTTPGPMP VSDATSGILP
GSRVLFAKLY CHPDVVSTIL TDHLPALLDA WGVPPQWWFI RYRDPASHLR LRLHVAPDAD
GVTDSYGRAA ARVGAWAEQL RARRLIGDLV LDTYHPETAR YGDGEALVAA EHLFAADSAA
VVIQLAAQNA NRALHPAALT AVSMVDLTCA LLGGRQTGMR WLLENRRASG APTQRPVLRQ
AIALSRTCDD EPAPNAAIPQ LPAPLRAVWG VRRRAAERYA AQLAALPGLP ASSEVLGSLL
HLHYVRSHGI DSSAERTCHR LARAVALAWH NTGQHPPLDL ARAEGP