Gene Francci3_4515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4515 
Symbol 
ID3907492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5389611 
End bp5391464 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content71% 
IMG OID637881848 
Productdihydroxy-acid dehydratase 
Protein accessionYP_483590 
Protein GI86743190 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.133337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCC TGCGCTCCCG CACCACCACC CACGGCCGGA ACATGGCCGG CGCCCGCGCC 
CTGTGGCGCG CGACCGGGAT GACCGACGAC GACTTTGGCA AGCCGATCGT CGCCGTGGCT
AACAGCTTCA CCGAGTTCGT CCCGGGGCAT GTGCACTTGC GTAACCTTGG CTCGCTGGTG
GCCGGGGCGG TGGCCGAGGC CGGTGGGGTG GCGCGCGAGT TCAACACCAT TGCCGTGGAC
GACGGCATCG CGATGGGGCA TGGCGGGATG CTCTACTCGC TGCCGTCCCG CGAGCTCATC
GCCGACAGCG TCGAGTACAT GGTGAACGCC CACTGCGCCG ACGCCCTGGT CTGCATCTCC
AACTGCGACA AGATCACCCC GGGGATGCTG CTCGCGGCAC TGCGGCTGAA CATCCCGACC
GTGTTCGTCT CCGGCGGAGC GATGGAGTCG GGCAACGCGG TCATCTCCGG CGGCACGGCT
CGGTCCAGGC TGGACCTCAT CACCGCGATG TCGGCGGCGG TCAACCCGGA CGTCTCGGAC
GGCGACCTGT CGACGATCGA GCGTTCGGCC TGCCCGACGT GTGGATCCTG CTCCGGCATG
TTCACGGCGA ACTCGATGAA CTGCCTGACC GAGGCGATCG GGCTGTCCCT GCCCGGCAAC
GGGTCGACCC TGGCCACCGC CGCCGCCCGC CGTGAGCTGT TCGTCGAGGC CGGGCGCCTC
GTGGTCGACC TGGCGCGGCG CTATTACGAG AAGGACGACG AGGCGGTCCT GCCCCGGTCG
ATCGCGACCG CCGCCGCCTT CCGTAACGCC TTCGCGGTCG ACGTGGCGAT GGGCGGCTCG
ACGAACACCG TGCTGCATCT GTTGGCCGCC GCCGTCGAGG CCGGCGTTGA CGTCACCCTC
GCCGACATCG ACCAGATCTC CCGCACCGTC CCCTGCCTGT GCAAGGTGGC GCCGAGCTCC
ACCCGTTACT ACATGGAGGA CGTCCACCGG GCGGGCGGCA TCCCCGCGAT CCTCGGCGAG
CTCGACCGGG CCGGGCTGCT CGACCCGGAC CCGCACACGG TGCACTCCGC GAGCCTGCGC
GAGTTCCTCG ACCGCTGGGA CGTCCGCGGC CCGAGCCCCT CGCCGGACGC GATCGAGCTG
TTCCACGCGG CGCCGGGTGG CGTGCGCACG ATCGAGCCGT TCAGCTCCAC CAATCGGTGG
GACACCCTTG ACACCGACGC CAGGGACGGT TGCATCCGTT CGGTCGAGCA CGCCTACTCC
GCCGAGGGTG GGCTCGCGGT GCTGTTCGGC AACCTGGCCG TCGAGGGCGC CGTCGTGAAG
ACGGCCGGTG TGGACGAGGG CCAGTGGACC TTCCGCGGCC CGGCGCTCGT GGTCGAGAGC
CAGGAAGAAG CGGTCGACGC CATCCTCACC GGGCGGGTCA AGGCCGGGAA TGTGATCATC
GTCCGCTACG AGGGCCCTCG CGGCGGTCCG GGGATGCAGG AGATGCTCTA CCCCACCGCG
TTCCTCAAGG GCCGCGGTCT CGGCCCGAAG TGCGCCCTGA TCACCGACGG CCGGTTCTCC
GGCGGGAGCT CCGGACTGTC GATCGGTCAC GTCTCCCCGG AGGCGGCCCA CGGCGGGACG
ATCGCCCTGG TCCGCGACGG GGACATCATC GAGATCGACA TCCCGGCCCG CCGGCTGGAG
CTCGTGGTCT CCGACGAGGA GCTGGCGAGC CGACGCGCGG CGCTGGAGGC GGCCGGCGGC
TACCGTCCCA CCGGGCGGGA ACGGCCGGTG TCCATGGCGC TGCGGGCCTA TGCGGCGATG
GCGACCTCGG CCTCCACCGG TGCCGCGCGC GACGTCGGTC TGCTCGGCGG CTGA
 
Protein sequence
MPALRSRTTT HGRNMAGARA LWRATGMTDD DFGKPIVAVA NSFTEFVPGH VHLRNLGSLV 
AGAVAEAGGV AREFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADALVCIS
NCDKITPGML LAALRLNIPT VFVSGGAMES GNAVISGGTA RSRLDLITAM SAAVNPDVSD
GDLSTIERSA CPTCGSCSGM FTANSMNCLT EAIGLSLPGN GSTLATAAAR RELFVEAGRL
VVDLARRYYE KDDEAVLPRS IATAAAFRNA FAVDVAMGGS TNTVLHLLAA AVEAGVDVTL
ADIDQISRTV PCLCKVAPSS TRYYMEDVHR AGGIPAILGE LDRAGLLDPD PHTVHSASLR
EFLDRWDVRG PSPSPDAIEL FHAAPGGVRT IEPFSSTNRW DTLDTDARDG CIRSVEHAYS
AEGGLAVLFG NLAVEGAVVK TAGVDEGQWT FRGPALVVES QEEAVDAILT GRVKAGNVII
VRYEGPRGGP GMQEMLYPTA FLKGRGLGPK CALITDGRFS GGSSGLSIGH VSPEAAHGGT
IALVRDGDII EIDIPARRLE LVVSDEELAS RRAALEAAGG YRPTGRERPV SMALRAYAAM
ATSASTGAAR DVGLLGG