Gene Francci3_1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1888 
Symbol 
ID3906837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2219726 
End bp2221009 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content70% 
IMG OID637879226 
Producthydroxyglutarate oxidase 
Protein accessionYP_480993 
Protein GI86740593 
COG category[R] General function prediction only 
COG ID[COG0579] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.452346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAGGTC GCATCGCGGT GATCGGCGGT GGAATTCTCG GGCTGGCGGT CGCCCGTCGG 
CTCGGGCAGG TCGAACCGGG ATCGGCCGTG ACGGTGTTCG AGAAGGAACA GGACATCGCC
CGGCATCAGA CCGGCCGCAA CAGCGGCGTC GTCCACGCCG GTCTCTACTA CGTCCCAGGT
TCGCTCAAAG CGATCCTGTG CCGCCGTGGG GTCGGGCTGC TGCGGGAGTT CTGCGCCACG
CACCGTATCC GGTACGACGA GTGCGGCAAG ATCGTGGTCG CCGTCGACAA CAGCGAGCTC
GAACGGTTGG CCGAGATCGA GAAGCGGGCG ACGGCGAACG GCGTACCGCG GACCCGGATG
CTCGACGCCG ACGAGCTGCG GTCCGTCGAA CCCCACGCCC GCGGGGTCGC CGCCCTGCAC
TCACCCACCA CCGCCATCGT CGACTATCCC GGGGTCGCGC GGGCCCTGCG TACCGAGATC
GTCGCCGCTG GCGGGGCCGT GCGCACCGGA GCCGAAGTGA TCGGGGTGGA GGATCGTCCA
GCCGGCGTCC ACCTTCGCCT CACGGTCGCC GGAACCGCGC GCGCGCGGCC CAACGGCAAT
CACGAGATGG CGTCCCGCAA CGGTGGCCGG GTCGCCGTCG TCTCCGAAAG CGTCGGGCCG
TTCGACCTGC TGATCTCCTG TGCCGGGCTG CAGTCGGATC TGGTGGCGAC ACTGACCGGC
GAGGACGCCT CCCCGCAGAT CGTTCCCTTC CGGGGCGACT ACTGGCTGCT GCGCCCCGAG
CGGCGTGGCC TCGTCCATGG GCTGATCTAT CCGGTTCCGG ATCCGCGATA TCCCTTCCTC
GGTATCCATC TCACCAAGCG CATCGACGGG GAGATCCTGG TCGGGCCGAA CGCGGTGCTG
GCCACTGCCC GAGAGGGGTA CACGGTCGGC ACCGTCCAGG GCGCTGACCT GCGACGGACG
CTCGCCTGGC CGGGGTTCCA CAAGATGGCG AAGACCCACT GGAAGACCGG CGCCAAGGAG
ATGCTGCGCA CGGCGAGCAA GCGGGCCTTC GTCGCCGAGG CCCGGCGCTA TGTCCCCGAG
CTGCGGGCCA CCGACGTGGT TCGTGGCCCC GCGGGAGTCC GGGCCCAGGC CGTCGCCCGC
GACGGTAGCC TCGTCGACGA CTTCGTCCTG TCCCACAACG GGCGGGTCGT GCATGTCCGT
AACGCGCCAT CTCCCGGCGC GACGGCGTCG TTGGCGATCG CTGAGCACAT CGTCAGCAAA
ATCGTTCCTG AACGGGCCAG CTGA
 
Protein sequence
MVGRIAVIGG GILGLAVARR LGQVEPGSAV TVFEKEQDIA RHQTGRNSGV VHAGLYYVPG 
SLKAILCRRG VGLLREFCAT HRIRYDECGK IVVAVDNSEL ERLAEIEKRA TANGVPRTRM
LDADELRSVE PHARGVAALH SPTTAIVDYP GVARALRTEI VAAGGAVRTG AEVIGVEDRP
AGVHLRLTVA GTARARPNGN HEMASRNGGR VAVVSESVGP FDLLISCAGL QSDLVATLTG
EDASPQIVPF RGDYWLLRPE RRGLVHGLIY PVPDPRYPFL GIHLTKRIDG EILVGPNAVL
ATAREGYTVG TVQGADLRRT LAWPGFHKMA KTHWKTGAKE MLRTASKRAF VAEARRYVPE
LRATDVVRGP AGVRAQAVAR DGSLVDDFVL SHNGRVVHVR NAPSPGATAS LAIAEHIVSK
IVPERAS