Gene Francci3_4187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4187 
Symbol 
ID3907152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4998542 
End bp4999633 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content68% 
IMG OID637881515 
Productbiotin synthase 
Protein accessionYP_483264 
Protein GI86742864 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACA TGGACCTGTC AGCCACTCTG AACAGCCTTG TCAGCAAGGG GGTGAGTGGT 
CAGGCTCCGA CCCGGGACGA GGCCCTGGCC GTCCTGCGCA GTGACGACGA CGATCTCTTG
GACGTCGTCG CCGCCGCCTA CCGGCTCCGG CGGAGGTACT TCGGCAGGCG TGTCAAGCTG
AACTTTCTGG TGAACCTCAA GAGCGGACTC TGTCCGGAGG ACTGTTCCTA TTGCTCGCAG
CGGCTCGGTT CGAACACGGG AATCCTGAAG TACACCTGGC TCAAGCCCGA GGAGGCCGCC
GCGACGGCCG GCGCCGGCAT CTCGGGCGGT GCCCGCCGGG TGTGCCTGGT CGCGAGCGGC
CGCGGGCCGA CGGACCGGGA CGTCGACCGC GTGGCGGACA CGATCGGCGC GATCAAGACC
GCGCATCCGG ACGTGGAGGT GTGCGCGTGC CTCGGCCTGC TGTCCGACGG GCAGGCCGCA
CAGCTGCGGG CGGCCGGTGC GGACGCCTAC AACCACAACC TGAACACGGC CGGTGAGAAG
TACGCAGACA TCTGCACGAC GCACACCTAC AACGACCGGG TCGACACGGT GCAGGAAGCC
AGGCACGCCG GCCTCTCACC CTGCTCGGGT ATCATCGCCG GCATGGGGGA GAGCGACGAG
GACCTCGTCG ACGTCGCCTT CGCGCTGCGC GAGCTCGCCC CGGACTCCAT CCCGGTCAAC
TTCCTCATGC CATTCGAGGG CACGCCCCTG GGGGCGGAAT GGAACCTCAA CCCCCGGCAG
TGCCTGCGCA TTCTCGCCAT GGTCCGGTTC GTCAACCCCA CGGCCGAGGT GCGGCTCTCG
GGCGGCCGGG AGATTCATCT CGGCTCGATG CAGCCCCTCG CCCTCTCGGT GGTGAACTCC
ATCTTCCTTG GTGACTACCT GACCAGTGAG GGTCAGGAGG GCCACCAGGA CCTGAAGATG
ATCGCCGAGG CGGGATTCAC GGTGGAAGGC CTCAACACCG ACGCCGAGGC GGCGCTGGCC
ATGGGCGCGG GCCTGGAGCG GGTCGCGCTA CGTCAGCGCG GTGCCGGCAC CGACCTGCCG
CCCAACGCCT GA
 
Protein sequence
MTDMDLSATL NSLVSKGVSG QAPTRDEALA VLRSDDDDLL DVVAAAYRLR RRYFGRRVKL 
NFLVNLKSGL CPEDCSYCSQ RLGSNTGILK YTWLKPEEAA ATAGAGISGG ARRVCLVASG
RGPTDRDVDR VADTIGAIKT AHPDVEVCAC LGLLSDGQAA QLRAAGADAY NHNLNTAGEK
YADICTTHTY NDRVDTVQEA RHAGLSPCSG IIAGMGESDE DLVDVAFALR ELAPDSIPVN
FLMPFEGTPL GAEWNLNPRQ CLRILAMVRF VNPTAEVRLS GGREIHLGSM QPLALSVVNS
IFLGDYLTSE GQEGHQDLKM IAEAGFTVEG LNTDAEAALA MGAGLERVAL RQRGAGTDLP
PNA