Gene Noca_3415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3415 
Symbol 
ID4598213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3616735 
End bp3617859 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content69% 
IMG OID639778021 
Productbiotin synthase 
Protein accessionYP_924602 
Protein GI119717637 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.371377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACAT CCTTCGATCA CCTGGCGGAC CGCATCCTTG CCGGAGGTGA CGCGACGCCC 
GCCGACGCGT TGGCGGTGCT ACGCGCCGAC GAGAAGGACC TGCTCCACGT GGTTGCGGCA
GCGGGTCGGC TGCGCCGCGC GCGCTTCGGC AACACGGTGA AGGTCAACTA CCTGGTGAAC
CTGAAGTCCG GGCTCTGTCC GGAGGACTGC CATTACTGCA GCCAGGCGCT GGGATCCCGG
GCGCCGATCC TCAAGTACAA CTGGCTCTCG TCCGAGGAGG TCCTGGAGCA GGCCGGTGCC
GGCCTGCGAG GCGGGGCGAC GCGGGTGTGC CTGGTGTCCT CGGGCCGTGG CCCGTCGGAC
CGGGACGTGG ACCGGGTCGC AGCGATGGCC CAGGAACTGA AGGGTGAGCA GCCCGGCGTC
GAGATCTGCG CCTGTCTAGG GTTGCTGAAG GACGGGCAGG CCGAGCGGCT CCGGGCAGCC
GGAGTGGACG CCTACAACCA CAACATCAAC ACCGCCGAAT CCCACCACGA CACCATTGTC
TCGACCCACT CCTACTCCGA TCGAGTGGAC ACCATCGAGA AGGCGGCGGC CGCTGGGCTC
TCGCCGTGCT CGGGATTGAT CGCCGGACTC GGCGAGACCG ACGAGCAGCT GGTCGAGGCG
CTGTTCGCGC TCAAGGCTCT GGGCGCGGAC TCGATCCCGG TGAACTTCCT GATGCCGTTC
GACGGCACCC CCAGCGAGCG CACTTTCGAG CTCACGCCGA TCCGGTGCGT GCAGATCCTG
GCGATGACAC GATTCGTGTG TCCCGATACC GAGATCCGCA TCGCCGGCGG CCGCGAGATG
CACCTGCGGT CGCTGCAGGC CCTCGCCCTG CATGTCGCGA ACTCCATCTT CCTCGGCGAC
TACCTCACTT CCGAGGGCCA GGACGCGCGC GCCGACCTGG AGATGCTGCG CGACAACGGG
TTCGCCATCC TCGGCGCGGA GGCCAAACCC GCCGGCACGG CCACTGCGGC CCACCGCGCC
CAGACAGCCC ACGACATTGC CGGCGGCACC TCCGTTGCGG GGTCCGCCCC GGATCCGGCG
ATCCGCCGCC GTGGCGCCGG AACCGACGTG CCGGCCAACG CGTGA
 
Protein sequence
MQTSFDHLAD RILAGGDATP ADALAVLRAD EKDLLHVVAA AGRLRRARFG NTVKVNYLVN 
LKSGLCPEDC HYCSQALGSR APILKYNWLS SEEVLEQAGA GLRGGATRVC LVSSGRGPSD
RDVDRVAAMA QELKGEQPGV EICACLGLLK DGQAERLRAA GVDAYNHNIN TAESHHDTIV
STHSYSDRVD TIEKAAAAGL SPCSGLIAGL GETDEQLVEA LFALKALGAD SIPVNFLMPF
DGTPSERTFE LTPIRCVQIL AMTRFVCPDT EIRIAGGREM HLRSLQALAL HVANSIFLGD
YLTSEGQDAR ADLEMLRDNG FAILGAEAKP AGTATAAHRA QTAHDIAGGT SVAGSAPDPA
IRRRGAGTDV PANA