Gene Franean1_2799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2799 
Symbol 
ID5671188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3311180 
End bp3314245 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content70% 
IMG OID641241708 
Productlantibiotic dehydratase domain-containing protein 
Protein accessionYP_001507128 
Protein GI158314620 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACG TGGAGGTGCC GCTGTACCGC CATGCTGGCG GTGCGATGCT GCGGGCGGCG 
GTCCTTCCGC TGTCCCAGCA GCCGGAGGAC TGGCCGACGT TGTCCGATCC GGACTCGTGC
CGGTCGTGGC TACGCGCCGT ATGGGCGCTG CCCGGGTTTG CCGATGCGAT CCGCTACGCG
AGCGGATCAT TCGCCGCTCA GGTCGAAGCT GTTCTCGACG ACCAGGCTGC GGGCGGCAAG
CAGGTTCGAC GGGTGACGTT GGCAGTCGTC CGCTACCTGC TGCGTTCCCT GGGACGGCCG
ACTCCGTTCG GCTGGTTCGC CGGTGTCGCC GGGGTCCGTA TCGGCGATGA CGTGCGTGCG
CGCTGGGGAT CGGCCCACCG GCTGGTCTTG CGGGCGGACA CGCTGTGGTT GGACGACGTT
GTCGAGCGGC TGGAGATGCT GCCCGACCTG CTCGCCCATC TGGACGTCAT GGCCTGCGAC
CTGTTGGTCG AACGCGGCGA TCGGATCGAG ATGCCTCGCG GCCCGGGGCG GGTGACGGTC
CGTAACACGG CGGTGATGCG TCTCGTCCGT AACCTGGCGG CCAAGCCGAT CCCGTTCCAC
CTCCTGCTCG ACCAGGTCGC CCTCGCGTTT CCTGCCGCCC CTGCCACGGC TGTCCGCCGA
GTGCTCGGTG ACCTGATCAC TCAAGGCATC CTGATCACGG GTCTTCGTGC GCCTATGACG
GTGGCTGACC CGCTGACGCA TGTGATCGCG GTGGTCGAGG GTGCCATGAT CGGCGATGGC
GAGGCTGTGG CGGCCGTGCT GGGTGATCTC CGCGCCGCGC AGCAGGTGAT CCGGACGCAC
AACGACGACC ACGTCGAGCC GGGCACGCAG GCACGGCTTC GGGAGGAGGC ATCGGAACGG
ATGCGTCGCC TGTCGCCGGC CGGCCGGACA TCGCTGGCCG GTGACCTGCA CCTCGACTGC
GACATCGCTG TTCCCACCGA CCTCGCCGAC GAGATGGCCC ACACGGTGGG TGCCTTGCTG
CGGCTCACGT GCCAGCCACG TCCGGACCAG GGGTGGAACG ACTGGTGCCG CGAGTTCTGG
GACCGCTACG GCACGGGAGC ACTCGTCCCC GTCCTAGACG CGGTGCATCC CGATACGGGC
ATCGGCTGGC CGGCTGGCTT CCCGGGCAGC ATGCTCGCCG AACCCGAGGA CACGGTCAGC
CACCGCGACC AAGAGCTGCT GCGCCTCGCC TGGGACACGG TTACCGCCGG ACATCACGAA
CTCGTGCTCA CCGACACGCT CATCGCCACG ATCACAGCGG ACAAGCCGGT GGACCCACGG
TGGATTCCGC CGCACGTCGA GCTGGGAGCC CGTGTCCACG CCACCAGCGT CCAGGCACTG
GCGGCGGGGG ACTACACGTT CGGCGTCCAT CCTGCGTGGG CGTTCGGCAC GCTCACCTGC
CGGTTCGGCG CCGCCGTGGG CCGTGCCGGC CTGGACACCG TCTTCGCCGC CGCGCCTTCG
GCTGTCGAGG GCGCGCTGCC AGTCCAGATG TCGTTCCCGC CACTGTTTCC GCACTCGGAG
AACGTCTCCC GGGTGCCCCG CTGGCTGCCC CAGGTGCTGC CGGTCGGTGA GCACCGGGCG
GACGAGTCGA CCGTGATCCG CCTGGATGAT CTCGGGATCG TCGCGGTGGC CGACGGGCTT
CACCTGGTCA GCATCTCGCG GCGCCAGGTC CTGGAACCGC AGGTCTTCCA CGCGCTCGCG
CTGCGCAAGC AGGCGCCGCC GCTGGCACGG TTCCTCGCCA CGCTGACCCG GGGCTTTCTC
GCCCGTTTCA CCGAGTTCGA CTGGGGACCG CTGGCCACCG GCCTGCCGCA CACGCCTCGG
GTGCGTTACC GGCGCGCGAT CCTGTCACCC GCGACGTGGC GGATCAACAC CGCGGACCAC
ACCGCACTAC GCGCCGACGG CCACTCCTGG GAGGAGGCAT TCGCCCGGTG GCGGCAGCGA
GGGGCCTGCC CGGACATCGT GGAGCTCCAT GATGACCACC GGTCGCTGCG CCTGGATCTC
ACCGTGGACG CGCACCTGGC GATCCTGCGC GAGCACCTGG ACAAGCACGG CCGCGCGACT
CTGACCGAGA CCGACTCGGT CGAGGACACG GGATGGATGG GCGGCCACAT CCACGAGATC
GTCCTGCCGT TCGTCCGCGC GGTACCGGCG GCGCCGAACC TCGTCACCGG CGCCCTGCCG
CTCGTGACCA GCGCGAACGC CGCACACCGA CCGGCCTCAC CGCACGGCTC CTGGCTCTAC
ACGCAGGTCT TCACCCATCC CGAACGTCTC GACGACATCC TCCGCACGCA CCTGCCCCGG
CTGCTTGACC TGCTCGACGG AGACCGGTCG TTTTGGTTCG CCCGTTACCG CAGCGTCCGT
GAGACCGACC ATCTGCGGCT GCGGATCCGC ACGGCCGGCC AGGAGGAGTA CGCCGCGGTG
GCCTGTGCGG TCGGCCAGTG GGGGCAGCAG CTCTGCGACG CGGGCGCGGC GTCCCGGCTG
ACCCTGGCGA CCTACCATCC CGAGATCGGC CGTTACGGCA GCGGGGCCGC GATGGACGCC
GCCGAAGCGG TGTTCGCGGC CGACTCGCAC GCGGTCGCGA CCGCACTCCA ACTCCCGACG
CCACTCGCCG TCCATCCGAT GGCGCTCGTC GCCATCGGCA TGGTGGACAT CGCCGACGGT
TTCCACTCCG ACCCCGTCCA CGCGAACACC TGGCTGCTGG AGCACCTCGC CGCCAAAGCC
ACGCCCGGCG CGGATCGGAC CGTCACCGAG CAGGTCACCC GCTGGGCAGC CCGAAGGACC
CTGCCGGGCG ATACATCCCT GCCCGCCGCC CTCGTGCAGA CGTGGCAGGC TCGCCGGGAA
GCGTTGATCC GCTATCGGCT CGCGCTCCCC GGGAACGCCG ACGCCGACCA GGTTCTGTCC
GCGCTCCTGC ACATGCATCA CAACCGCGCC AGACTCATCG ATCGTGCTGA CGAGGCCACC
TGCCGGCGCC TGGCCCGGCA GATCGCCCTC ACCCGACGCG CACACACCAC GGCGGACAGC
CCGTGA
 
Protein sequence
MVDVEVPLYR HAGGAMLRAA VLPLSQQPED WPTLSDPDSC RSWLRAVWAL PGFADAIRYA 
SGSFAAQVEA VLDDQAAGGK QVRRVTLAVV RYLLRSLGRP TPFGWFAGVA GVRIGDDVRA
RWGSAHRLVL RADTLWLDDV VERLEMLPDL LAHLDVMACD LLVERGDRIE MPRGPGRVTV
RNTAVMRLVR NLAAKPIPFH LLLDQVALAF PAAPATAVRR VLGDLITQGI LITGLRAPMT
VADPLTHVIA VVEGAMIGDG EAVAAVLGDL RAAQQVIRTH NDDHVEPGTQ ARLREEASER
MRRLSPAGRT SLAGDLHLDC DIAVPTDLAD EMAHTVGALL RLTCQPRPDQ GWNDWCREFW
DRYGTGALVP VLDAVHPDTG IGWPAGFPGS MLAEPEDTVS HRDQELLRLA WDTVTAGHHE
LVLTDTLIAT ITADKPVDPR WIPPHVELGA RVHATSVQAL AAGDYTFGVH PAWAFGTLTC
RFGAAVGRAG LDTVFAAAPS AVEGALPVQM SFPPLFPHSE NVSRVPRWLP QVLPVGEHRA
DESTVIRLDD LGIVAVADGL HLVSISRRQV LEPQVFHALA LRKQAPPLAR FLATLTRGFL
ARFTEFDWGP LATGLPHTPR VRYRRAILSP ATWRINTADH TALRADGHSW EEAFARWRQR
GACPDIVELH DDHRSLRLDL TVDAHLAILR EHLDKHGRAT LTETDSVEDT GWMGGHIHEI
VLPFVRAVPA APNLVTGALP LVTSANAAHR PASPHGSWLY TQVFTHPERL DDILRTHLPR
LLDLLDGDRS FWFARYRSVR ETDHLRLRIR TAGQEEYAAV ACAVGQWGQQ LCDAGAASRL
TLATYHPEIG RYGSGAAMDA AEAVFAADSH AVATALQLPT PLAVHPMALV AIGMVDIADG
FHSDPVHANT WLLEHLAAKA TPGADRTVTE QVTRWAARRT LPGDTSLPAA LVQTWQARRE
ALIRYRLALP GNADADQVLS ALLHMHHNRA RLIDRADEAT CRRLARQIAL TRRAHTTADS
P