Gene Francci3_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0983 
Symbol 
ID3905839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1160487 
End bp1161740 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content70% 
IMG OID637878317 
Productoxidoreductase, molybdopterin binding 
Protein accessionYP_480096 
Protein GI86739696 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.729039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGACAGT TGCCCCGCTG GTTGCCCCGC CGGTTACCTC GTCTGCTGAC CGAAAAGCTG 
CCGGAACCAC CCCCGGTCCT GCGGCGCGGC CCCCTGCGCG AAAACGCATT CCCCAGTCGG
CTGCACGATC CACGGATGGC GGCTCTCCTC GGAGTCTGGC TCGGGATCGC CTTCGGAACA
TGCTTTCTGA CCGGGCTGGT ATCGCATTTC ATGCAGCATC CGCCGTCCTG GCTGGAATGG
CCTTCCCGAC CCGCCTGGCT GTACCGGGTG ACCCAGGGCC TGCACGTGAC CACCGGACTC
GCGAGCGTCC CACTGCTGCT GGCGAAGCTG TGGACGGTGT ATCCGCTGTT GTGGGAATGG
CCCCCGATCC GTTCCGTCGC CCATGCGGTG GAGCGGATGA CGATCGTGCC GTTGATCGCA
GGGGCGATCT TCCAGCTCAG CACGGGCATC GCGAACATCG CGCAGTGGTA CCGGTTCCAC
TTCTTCTTCA CCGTGACCCA TTACTGGGTC GCCTGGATCA CGATCGGCGC GCTGGTCCTC
CACATCGTCG CGAAGCTCAC CGTCATCCGG GCGAATGTGG GCCGTCGCCG CCATGAGCGC
GCGCACCTGG CCGCCGCCAC CATGGCCACC ACCACCATGG CCGGCGCCGC CCCGGGGACG
GGCGGTCTGA CGCGGCGCGG CCTCGGCCTC GCGACGGGGA CCGCCGCCGG GGTGATCGTG
GCGACGACCG CCGGTCAGTC GGTTCCGGGC CTGGCCCGGC TGGACCTGCT CGCGCCGCGG
CGGCCCGACA TCGGCCCGCA GGGGCTCCCG GTGAACCGCA CGGCGCGAGC CGCGCGGGTC
ACCTCCCTCG CCCGTGACCC CGGGTACCGG CTGGAGGTGG TCGGTCCGCG CCGGGTGGTG
TACACCATCG AGGAGCTGCA CGCGCTGCGT CGGTACAGCT CGAAGCTGCC GATCACCTGC
GTCGAGGGAT GGGCCGCGGA CGCCACCTGG CACGGGCCGC GGCTGCGGAA TCTACTGGAC
GCCGCCATGA TCCCCGCCGA CGCGACGGTC CGGGTCGAGT CCCTCGAGGC CCGCGGGGGC
TACCGGGCAA GCGACGTCAA CCCCTCGCAC GCCCGGGACT CGTTGACCCT GCTCGCGACC
GGAGTGAACG GCGCCGACCT CGACCTCGAT CACGGTTATC CGGCCCGGCT CATCGCGCCG
AACCGGCCCG GGGTCCTCCA GACGAAATGG GTACATCGGG TGGTGATGGT ATGA
 
Protein sequence
MRQLPRWLPR RLPRLLTEKL PEPPPVLRRG PLRENAFPSR LHDPRMAALL GVWLGIAFGT 
CFLTGLVSHF MQHPPSWLEW PSRPAWLYRV TQGLHVTTGL ASVPLLLAKL WTVYPLLWEW
PPIRSVAHAV ERMTIVPLIA GAIFQLSTGI ANIAQWYRFH FFFTVTHYWV AWITIGALVL
HIVAKLTVIR ANVGRRRHER AHLAAATMAT TTMAGAAPGT GGLTRRGLGL ATGTAAGVIV
ATTAGQSVPG LARLDLLAPR RPDIGPQGLP VNRTARAARV TSLARDPGYR LEVVGPRRVV
YTIEELHALR RYSSKLPITC VEGWAADATW HGPRLRNLLD AAMIPADATV RVESLEARGG
YRASDVNPSH ARDSLTLLAT GVNGADLDLD HGYPARLIAP NRPGVLQTKW VHRVVMV