Gene Francci3_2603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2603 
Symbol 
ID3906509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3071746 
End bp3073308 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content74% 
IMG OID637879928 
Productcobyric acid synthase 
Protein accessionYP_481694 
Protein GI86741294 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1492] Cobyric acid synthase 
TIGRFAM ID[TIGR00313] cobyric acid synthase CobQ 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.478422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.158438 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGG GACTGCTGGT CGCCGGGACC GCGTCGGATG CCGGCAAGAG CGTGCTGACC 
GCGGGGATCT GCCGGTGGCT GGCGCGGGAG GGGGTGCGGG TCGCCCCGTT CAAGGCGCAG
AACATGGCGT TGAACTCGGC GGTCACCGCC GATGGTGCGG AGATCGGCCG GGCCCAGGCG
ATGCAGGCGG CGGCGGCCGG TGTCGAACCG GAGGCGGCGA TGAACCCGGT GCTGCTCAAA
CCGGGGGGCC AGCGGCACAG CCAGCTCGTT GTGCTGGGCC GCCCGGTCGC CGAGGTCGAC
GCGCTCGGCT ACCGCCCGTA CAAGGAACGG CTGGCCGCGA TTGTCCTGGA GTGCCTGGAC
GACCTGCGCG GCCGGTTCGA CGCGGTGATC TGCGAGGGGG CCGGTTCTCC GGCGGAGATC
AACCTGCGTT CGACCGACAT CGCCAACATG GGCCTGGCGC GCGCCGCGAA CCTGCCGGTG
ATCGTGGTCG GCGACATCGA CAAGGGCGGG GTCTTCGCCG CCCTGTTCGG CACGCTGGCC
CTGCTCGATG CGGCCGACCA GGCGTTGGTT GCCGGCTGGG TGATCAATCG GTTCCGTGGC
GACGCCCGAC TGCTCGAACC CGGACTGCGC CAGATCGAAC GGCTCACCGG CCGGCCGGTG
CACGGCGTCG TCCCCTGGAA GGCGGGGTTG TGGCTGGACG TCGAGGACTC CCTCGACCTC
GCTGCCTTCC CCGACGCCGA GCCCTGTCCC GACGCCGAGC CCTGTCCCGA CGCCGAGCCC
TGTCCCGAGG CGCGGCCTGC CTCGCACGGC GGTCGGCGGG AGGTGCTGCG GGTCGCCGTC
ATCCGGCTGC CCCGGCTGTC GAACGTGACC GACATCGACG CGTTGCGCGT CGAGCCCGGG
GTCGCGGTGC GCCTGGCCAC CCGACCGGAC GAGCTCGCCG ACGCCGACCT CGTGATCCTG
CCGGGCACCC GTTCCACCGT CGAGGACCTG CGCTGGCTGC GTCGCCGTGG TCTCGCCGCG
GCCCTCGCCG AACGCGCCGC CGCGGCCCGT CCGGTGCTGG GTATCTGTGG CGGCTACCAG
ATCCTCGGCC GTCGCATCCG TGACGACGTC GAATCGGGTG CGGGCGAGGT CGATGGTCTC
GGCCTGCTCC CGGTCATCAC CACGTTCGAC CCGGTGAAGC TGCTCGGTCG GCGCGCGGCC
ACCGATGCCG CCGGCCGACC GCTGACCGGC TACGAGATCC GGCACGGGCG GCTGACCGTC
GAGGAGCATC CGGACAGCGC GCCGTTCGCC GCGGACGGGG TGCGCGTCGG CGCGGTCGCC
GGCACGAGCT GGCACGGGGT GCTGGAGAAC GACGCGTTCC GCCGCGCCTA TCTCGCCGAC
GTGGCCACGG CCGCGGGGCG TTCGTTCGTC CCGGCGTTCA CGTGCTTCGC CGATGCTCGG
CAGCGCCGCC TCGACGCCCT CGGTGACCTC GTCGCCGACC ATCTCGACAC AGGCGCCCTG
CGCCGCCTGC TCGCCGAGGG CACACCCGCC GGCCTGCCGT TCGTCCCCCC CGGCGCATCC
TGA
 
Protein sequence
MSGGLLVAGT ASDAGKSVLT AGICRWLARE GVRVAPFKAQ NMALNSAVTA DGAEIGRAQA 
MQAAAAGVEP EAAMNPVLLK PGGQRHSQLV VLGRPVAEVD ALGYRPYKER LAAIVLECLD
DLRGRFDAVI CEGAGSPAEI NLRSTDIANM GLARAANLPV IVVGDIDKGG VFAALFGTLA
LLDAADQALV AGWVINRFRG DARLLEPGLR QIERLTGRPV HGVVPWKAGL WLDVEDSLDL
AAFPDAEPCP DAEPCPDAEP CPEARPASHG GRREVLRVAV IRLPRLSNVT DIDALRVEPG
VAVRLATRPD ELADADLVIL PGTRSTVEDL RWLRRRGLAA ALAERAAAAR PVLGICGGYQ
ILGRRIRDDV ESGAGEVDGL GLLPVITTFD PVKLLGRRAA TDAAGRPLTG YEIRHGRLTV
EEHPDSAPFA ADGVRVGAVA GTSWHGVLEN DAFRRAYLAD VATAAGRSFV PAFTCFADAR
QRRLDALGDL VADHLDTGAL RRLLAEGTPA GLPFVPPGAS