Gene Franean1_0967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0967 
Symbol 
ID5669381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1129596 
End bp1130681 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content71% 
IMG OID641239895 
Productbiotin synthase 
Protein accessionYP_001505329 
Protein GI158312821 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00736488 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.128814 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCAC CTGTGACCGC GCCTACGACC ATGCCCGCGC AGACTCCCCC GACCGTCGAG 
ACCGACGTCG TCACCGAGGC GCCCCGACCC CTCGACGACG ACATTCTCGC CCGTGCCCGC
CGACAGGTGC TCGACGAGGG GCGCGGCCTC GACGAACAGG ACGTGCTCGC GGTCCTCCAG
CTGCCGGACG AGGCACTGGG CGACCTGCTC GCGCTCGCCC ACGAGGTGCG GCTGCGCTGG
TGCGGGCCGG AGGTCGAGGT TGAGGGCATC ATCAGCCTCA AGACCGGCGG CTGCCCCGAG
GACTGCCACT TCTGCTCCCA GTCCGGGCGC TTCGACTCCC CCGTGCGCTC CGCCTGGCTG
GACGTCCCGT CCCTGGTCGA GGCGGCGAAG GCGACGGCGG CCACCGGCGC CACCGAGTTC
TGCATCGTCG CCGCGGTCCG CGGCCCCGAC CAGCGGCTGA TGGCGCAGAT CCGCGAGGGC
GTGGCGGCGA TCCGGGAGGC CGTCGACATC AACGTCGCCT GCTCGCTCGG CATGCTCACC
CAGGAGCAGG TGGACGAGCT CGCCGGCCTG GGCGTGCACC GCTACAACCA CAACCTGGAG
ACGGCGCGCT CGCACTTCCC GAAGGTGGTC ACCACCCACA GCTGGGAGGA GCGCTGGGAG
ACCTGCGAGC TCGTCCGCGC CGCCGGGATG GAGCTGTGCT GCGGCGCGAT CATCGGTGTG
GGCGAGTCCC TCGAACAGCG CGCCGAGCTG GCCGCCCAGC TCGCCGCTCT GGAGCCGGAC
GAGGTTCCGC TGAACTTCCT CAACCCGCGA CCCGGCACCC CGTTCGGTGA CCTGCCCGCG
GTGGACTCAC GCGAGGCCCT GCGCACCATC GCCGCGTTCC GGCTGGCGCT GCCCCGCACG
ATCCTGCGCT ACGCCGGCGG GCGCGAGATC ACGCTGGGCG ACCTGGATGT CCAGGGAATG
CTCGGCGGCA TCAACGCGGT GATCGTTGGG AACTACCTGA CCACGCTCGG CAAGAATCCG
GAGAGCGACC TGGCCATGCT CACCGAGCTG CGGATGCCGA TCAAGTCCCT GCAGGCCACG
CTCTAG
 
Protein sequence
MTAPVTAPTT MPAQTPPTVE TDVVTEAPRP LDDDILARAR RQVLDEGRGL DEQDVLAVLQ 
LPDEALGDLL ALAHEVRLRW CGPEVEVEGI ISLKTGGCPE DCHFCSQSGR FDSPVRSAWL
DVPSLVEAAK ATAATGATEF CIVAAVRGPD QRLMAQIREG VAAIREAVDI NVACSLGMLT
QEQVDELAGL GVHRYNHNLE TARSHFPKVV TTHSWEERWE TCELVRAAGM ELCCGAIIGV
GESLEQRAEL AAQLAALEPD EVPLNFLNPR PGTPFGDLPA VDSREALRTI AAFRLALPRT
ILRYAGGREI TLGDLDVQGM LGGINAVIVG NYLTTLGKNP ESDLAMLTEL RMPIKSLQAT
L