Gene Franean1_2220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2220 
Symbol 
ID5670619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2655941 
End bp2657224 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content74% 
IMG OID641241140 
Producthydroxyglutarate oxidase 
Protein accessionYP_001506561 
Protein GI158314053 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.531505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0413594 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGAGC GGATCGCGGT CATCGGCGGC GGGATCCTCG GGCTCGCGGT GGCACGCAGG 
CTCGGGCAGG TCGTGCCCGG ATCGACGGTC ACCGTGTTCG AGAAGGAGCA TGACGTCGCC
CAGCACCAGA CGGGGCGCAA CAGCGGCGTC GTCCACGCGG GCCTCTACTA CAAGCCGGGC
TCGCTGAAGG CGACGCTGTG CCGCCGCGGC GTCGGCCTGC TGCGCGAGTA CTGCGAGGCC
CGCGGCATCC GCTACGAGGA GTGCGGCAAG GTCGTCGTGG CCGTCGACGA CAGCGAGCTG
GGCCGGCTCG ACGACATCGC GCAGCGGGCG ACGGCCAACG GCGTGCCCGA CACCCGCATG
CTTGACGCCA CCGAGCTGCG CACGATCGAG CCGCACGCCC GCGGGGTCGC CGCGCTGCAC
TCCCCGACCA CGGCGATCGT GGACTACCCG GCCGTCGCCC GGGCGCTGCG CGCGGACATC
CTGGACGCGG GCGGCGCGGT GCGCACCGGC GCCGAGGTGA TCGGCGTGGA CGACGGCCCG
GCGGGCGTCC GGCTGCGGCT GCGGGTACGC GGCTCCGCGC CGGTCGCGCC GAACGGGAAC
CACCACACGG CGGCGGTCGA CGGCGGGACG GTCCGGGTGG TGTCGGAGTC GGTCGGCCCG
TTCGACCGGC TGATCTCCTG CGCCGGCCTG CACTCCGACG AGGTCGCCGC GCTCACCGGA
GAGGACAGCT CGCCGCGGAT CATCCCGTTC CGCGGCGACT ACTGGCTGCT GCGCCCCGAG
CGGCGCAACC TCGTCCGTGG CCTGATCTAC CCGGTGCCCG ACCCGCGCTA CCCGTTCCTG
GGCATCCACC TCACCAAGCG GGTGGACGGG GAGATCCTCG TCGGCCCCAA CGCCGTGCTC
GCCACCGCCC GCGAGGGCTA CACGGTGGGC ACCGTCGACC GGGGTGACCT GCGGCAGACC
CTGTCCTGGC CGGGCTTCCA GAAGATGGCT AAGACGCACT GGCGCACCGG CGCCAAGGAG
ATACTGCGCA CCGCCAGCAG GCGCGCGTTC GTCGCCGAGG CCCGACGCTA CGTTCCCGAG
CTGCGCACCG CGGACGTCGT GCGGGGCCCG GCGGGAGTGC GGGCCCAGGC CGTCGCGCGG
GACGGCAGCC TGGTCGACGA CTTCGTCCTC GCCGTGCGCG GACGGGTCGT CCACGTGCGC
AACGCCCCGT CCCCGGGTGC GACCGCGTCG CTGGCGATCG CGGAGCACAT CGTCGCCGAC
GCGGTACCCG AACGCACGTC CTGA
 
Protein sequence
MAERIAVIGG GILGLAVARR LGQVVPGSTV TVFEKEHDVA QHQTGRNSGV VHAGLYYKPG 
SLKATLCRRG VGLLREYCEA RGIRYEECGK VVVAVDDSEL GRLDDIAQRA TANGVPDTRM
LDATELRTIE PHARGVAALH SPTTAIVDYP AVARALRADI LDAGGAVRTG AEVIGVDDGP
AGVRLRLRVR GSAPVAPNGN HHTAAVDGGT VRVVSESVGP FDRLISCAGL HSDEVAALTG
EDSSPRIIPF RGDYWLLRPE RRNLVRGLIY PVPDPRYPFL GIHLTKRVDG EILVGPNAVL
ATAREGYTVG TVDRGDLRQT LSWPGFQKMA KTHWRTGAKE ILRTASRRAF VAEARRYVPE
LRTADVVRGP AGVRAQAVAR DGSLVDDFVL AVRGRVVHVR NAPSPGATAS LAIAEHIVAD
AVPERTS