Gene Francci3_3763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3763 
Symbol 
ID3906047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4512016 
End bp4513092 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content70% 
IMG OID637881089 
Productbiotin synthase 
Protein accessionYP_482843 
Protein GI86742443 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.47987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.559931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGCAT CCGTGACCAC GTCCGTGACC GCGCACCCCA CCGCACCTGC ACCTGCGTCG 
CAGCGGCCGC CGGCGGACCA GGACGGTGCC GACATCCTCA CCGTGGCACG GCGCGAGGTC
CTCGATGGCG GACGGGGGCT GGACGAGGCC GGCGTGCTCG CCGTGCTCCG GCTCCCGGAC
GAGACGCTGA CCGATCTGCT CGCCCTGGCC CACGAGGTGC GGATGCGCTG GTGTGGTCCG
GAGGTCGAGG TGGAGGGGAT CGTCAGCCTC AAGACCGGCG GATGCCCGGA AGATTGTCAC
TTCTGCTCGC AGTCCGGCAA GTTCGACTCG CCGGTGCGGT CCGCCTGGCT GGACGTGCCC
TCGCTCGTCG ACGCCGCCCG GCAGACCGCG GCGACAGGCG CCACCGAGTT CTGCATCGTC
GCGGCCGTGC GGGGCCCGGA CGCCCGGCTC ATGGCGCAGG TGCGGGAGGG GGTCGCCGCC
ATCCGTGCGG CGGTCGACAT CAACGTCGCC TGCTCGCTGG GCATGCTGAC CTCCGAGCAG
GTCGACGAAC TCACGGCGAT GGGTGTGCAC CGTTACAACC ACAATCTGGA GACGGCCCGC
TCGCACTTCC CGAACGTGGT CACCACCCAC AGTTGGGAGG AGCGGTGGGA GACCTGTGAG
ATGGTGCGGG CCGCGGGGAT GGAGCTGTGC TGCGGCGCCA TTCTGGGCGT CGGCGAGAGC
CTCGAGCAGC GTGCCGAGCT CGCCACCCAG CTTGCGGCCC TGGAGCCCGA CGAGGTTCCG
CTGAACTTCC TCAACCCGCG GCCGGGAACG CCCTTCGGGG ATCTTCCGCT GGTCGAGCCG
CGTGACGCGC TGCGCGCGAT CGCGGCGTTC CGCCTCGCCA TGCCGCGCAC GATCCTGCGC
TACTCCGGCG GACGCGAGAT CACCCTGGGC GATCTCGACG TGCAGGGGAT GCTCGGTGGC
ATCAACGCCA TGATCGTCGG AAACTATCTG ACGACGCTGG GCCGCTCCGC GGAGGCCGAC
CTGAAGATGC TGGCCGAGCT GAGCATGCCG ATCAAGTCCC TGCAGGCCAC TCTCTAA
 
Protein sequence
MTASVTTSVT AHPTAPAPAS QRPPADQDGA DILTVARREV LDGGRGLDEA GVLAVLRLPD 
ETLTDLLALA HEVRMRWCGP EVEVEGIVSL KTGGCPEDCH FCSQSGKFDS PVRSAWLDVP
SLVDAARQTA ATGATEFCIV AAVRGPDARL MAQVREGVAA IRAAVDINVA CSLGMLTSEQ
VDELTAMGVH RYNHNLETAR SHFPNVVTTH SWEERWETCE MVRAAGMELC CGAILGVGES
LEQRAELATQ LAALEPDEVP LNFLNPRPGT PFGDLPLVEP RDALRAIAAF RLAMPRTILR
YSGGREITLG DLDVQGMLGG INAMIVGNYL TTLGRSAEAD LKMLAELSMP IKSLQATL