Gene Franean1_7305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7305 
Symbol 
ID5675606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8927975 
End bp8929828 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content71% 
IMG OID641246142 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001511530 
Protein GI158319022 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.150096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTT TGCGTTCCCG CACCACCACC CACGGTCGAA ACATGGCCGG TGCCCGCGCC 
CTGTGGCGGG CGACCGGCAT GACCGACGAC GACTTCGGCA AGCCGATCGT CGCGATCGCG
AACAGCTTCA CGCAGTTCGT TCCCGGCCAT GTCCACCTGC GGGACCTCGG GAAGATCGTC
GCGGACGCGG TGGCCGGGTC GGGCGGGGTG GCCAAGGAGT TCAACACGAT CGCCGTCGAC
GACGGCATCG CGATGGGCCA CGGCGGCATG CTGTACTCCC TGCCATCCCG GGAGATCATC
GCGGACAGCG TCGAGTACAT GGTCAACGCC CACTGCGCCG ATGCGCTGGT CTGCATCTCG
AACTGTGACA AGATCACCCC GGGGATGCTG ATCGCCGCGC TGCGGCTCAA CATCCCGACC
GTCTTCGTCT CCGGCGGCGC GATGGAATCC GGTCACGCCG TCGTCACCGG CGGGATCGTG
CGGTCGCGGC TCGACCTGAT CGACGCGATG ACCGCGGCCG TCAACCCCGA CGTCAGCGAC
GCCGACCTGG ACACCATCGA GCGGTCCGCC TGCCCCACCT GCGGCTCCTG CTCCGGCATG
TTCACGGCCA ACTCGATGAA CTGCCTGACC GAGGCGCTCG GCCTGGCGCT GCCGGGCAAC
GGCTCGACGC TGGCCACCGC GGCGGCCCGG CGCGGCCTGT TCGTCGAGGC CGGTCGCCTG
GTCGTCGACC TGGCCCGCCG GTACTACGAG AAGGACGACG AGGCCGTCCT ACCCCGCTCG
ATCGCGAGCG CGGCGGCGTT CCGCAACGCG TTCGCGGTGG ACGTCGCCAT GGGCGGTTCG
ACGAACACCG TCCTGCACCT GCTCGCCGCC GCCGTCGAGG CCGGGGTCGA GGTGACCCTC
GACGACATCG ACCAGGTCTC CCGCTCGGTG GCCTGCCTGT GCAAGGTGGC GCCCAGCTCG
ACCGACTACT ACATGGAAGA CGTCCACCGG GCCGGCGGGA TCCCGGCGAT CCTCGGTGAG
CTCGACCGCG GCGGTCTGGT GGACCCGAAC GTGCACAGCG TGCACGCGGC GAGCCTGCGC
GAGTTCCTCG ACCGCTGGGA CGTGCGCGGG GCGGACCCGT CCCCGGACGC GATCGAGCTG
TTCCACGCCG CGCCCGGCGG CGTGCGCACC GTCGAACCGT TCGGCTCGAC GAACCGCTGG
GACACCCTCG ACACCGACGC GAAGAACGGC TGCATCCGCT CGGTCGAGCA CGCCTACTCG
GCCGACGGCG GCCTGGCCGT GCTGCGCGGC AACCTGGCCC CCGACGGCGC CGTGGTGAAG
ACGGCCGGCG TCGACGAGAG CCAGTGGACG TTCCGCGGGC CCGCGCTGGT CGTCGAGAGC
CAGGAGGCCG CGGTCGACGC GATCCTGAAC AAGGTCGTCA AGGCGGGCGA CGTGATCATC
GTCCGGTATG AGGGCCCCCG CGGTGGGCCC GGCATGCAGG AGATGCTCTA CCCGACGGCG
TTCCTCAAGG GCCGCGGCCT CGGGCCGAAG TGCGCGCTGA TCACCGATGG CCGCTTCTCC
GGTGGCAGCT CGGGCCTGTC GATCGGCCAC GTCTCCCCGG AGGCGGCGCA CGGTGGCCCG
ATCGCGCTCG TCCGGGACGG TGATCTCGTC GAGATCGACA TCCCGCGGCG GCGGATCGAC
CTGCTGGTGC CGGACGCCGA GCTCGCCGCG CGGCGGGCCG AGATCGAGGC GAACGGCGGC
TACCACCCGG CGAACAGGGA GCGTGTCGTG TCGGCCGCGC TGCGCGCCTA CGCGGCCATG
GCGACGTCCG CCTCGACCGG TGCCGCCCGT GACGTCCGGC TCATCACGGG ATGA
 
Protein sequence
MPALRSRTTT HGRNMAGARA LWRATGMTDD DFGKPIVAIA NSFTQFVPGH VHLRDLGKIV 
ADAVAGSGGV AKEFNTIAVD DGIAMGHGGM LYSLPSREII ADSVEYMVNA HCADALVCIS
NCDKITPGML IAALRLNIPT VFVSGGAMES GHAVVTGGIV RSRLDLIDAM TAAVNPDVSD
ADLDTIERSA CPTCGSCSGM FTANSMNCLT EALGLALPGN GSTLATAAAR RGLFVEAGRL
VVDLARRYYE KDDEAVLPRS IASAAAFRNA FAVDVAMGGS TNTVLHLLAA AVEAGVEVTL
DDIDQVSRSV ACLCKVAPSS TDYYMEDVHR AGGIPAILGE LDRGGLVDPN VHSVHAASLR
EFLDRWDVRG ADPSPDAIEL FHAAPGGVRT VEPFGSTNRW DTLDTDAKNG CIRSVEHAYS
ADGGLAVLRG NLAPDGAVVK TAGVDESQWT FRGPALVVES QEAAVDAILN KVVKAGDVII
VRYEGPRGGP GMQEMLYPTA FLKGRGLGPK CALITDGRFS GGSSGLSIGH VSPEAAHGGP
IALVRDGDLV EIDIPRRRID LLVPDAELAA RRAEIEANGG YHPANRERVV SAALRAYAAM
ATSASTGAAR DVRLITG