Gene Ndas_3768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3768 
Symbol 
ID9247637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4526634 
End bp4527929 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content76% 
IMG OID 
ProductF420 biosynthesis protein FbiB, C-terminal domain protein 
Protein accessionYP_003681672 
Protein GI297562698 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG AACTGCGCGT GCGCGGGCTC GGCGGCATCC CCGAGGTGCG GTCGGGCGAC 
GACCTGTCGG CCCTGGTCGC CGAGGCGGTG GCCGCCTCCG GGACCGCGCT GGCCGACGGG
GACATCCTCG TGGTCACCTC CAAGATCGTG AGCAAGGCCG AGGGCCGCGC CCGCCGTATG
GACCGGGAGG AGGCCGTCGC CTCCGAAACC GTCCGGGTGG TGGCCCGCAC GGGCAGGACC
GGGATCATGC AGACCCGGCA GGGGCTGGTC ATGGCCGCCG CGGGGGTGGA CGCCTCCAAC
GTCGAGCCCG GCACCGTGCT GCTGCTGCCC GAGGACTCCG ACGCCTCCGC CCGCGCGCTG
CGCGCCGGGC TGCGCGAGCG CCTGGGCGTG AACGTCGGTG TGATCGTGTC CGACACCTTC
GGGCGGCCGT GGCGGATCGG GCAGACCGAC GTGGCCATCG GCGCCGCCGG GCTGTCCGCC
GTGCAGGACC TGCGCGGCAC CGCCGACACC CACGGCAACC TCATGGAGGC GACCGTCAAC
GCGGTGGCCG ACGAGATCGC CGGCGCGGGC GAACTGGTCA AGGGCAAGAC GAACGGTGTC
CCGGTCGCGG TGGTCAGCGG CCTGGGCGGC CTGGTCACCG AGGAGGACGG TCCTGGGGCG
GCGGTGCTGA TCCGCACGCC CGAGGCCGAC CTGTTCCGGT ACGGGTCGCG GGACGTGGTC
CCGGCCCGGC GGACGGTGCG CTCGTTCACC GACGCGCCGG TGGATCCGGC GGCGGTGCGG
CGGGCGGTGG CCGCGGCGGT GACGGCCCCC GCGCCGCACC ACACCACGCC GTGGCGCTTC
GTGCTGGTGG AGTCGGCGCA CACGCGCAAG CGGCTGCTGG ACGCGATGCT CGCGGCGTGG
GTGGCGGACC TGCGCGGTGA CGGGTTCAGC GAGGAGCAGA TCGCCCGGCG CACGCGGCGC
GGCGACGTGC TGCGCGGCGC GCCGTACCTG GTGGTGCCCT TCCTGGTGGC CGACGGGGCG
CACCCCTACC CGGACGCGCG CCGGGCCGAC GCCGAGCGGT CGATGTTCCT GGTGGCGATG
GGCGCCGGGG TGCAGAACCT GCTGGTGGGG CTGGCCGTGG AGGGGCTGGG GTCGGCCTGG
GTCTCCTCGA CCATGTTCTG CGCGGACGTG GTGCGCGAGG TGCTGGAGGT GCCCGAGGAG
TGGCGGCCGA TGGGCGCGGT GGCCGTGGGG CACGCGGCCG AGGCGCCGAA GGAGAGGCCG
CCGCGCGACC CGGAGGACTT CGTCGCCGTG CGTTAG
 
Protein sequence
MSTELRVRGL GGIPEVRSGD DLSALVAEAV AASGTALADG DILVVTSKIV SKAEGRARRM 
DREEAVASET VRVVARTGRT GIMQTRQGLV MAAAGVDASN VEPGTVLLLP EDSDASARAL
RAGLRERLGV NVGVIVSDTF GRPWRIGQTD VAIGAAGLSA VQDLRGTADT HGNLMEATVN
AVADEIAGAG ELVKGKTNGV PVAVVSGLGG LVTEEDGPGA AVLIRTPEAD LFRYGSRDVV
PARRTVRSFT DAPVDPAAVR RAVAAAVTAP APHHTTPWRF VLVESAHTRK RLLDAMLAAW
VADLRGDGFS EEQIARRTRR GDVLRGAPYL VVPFLVADGA HPYPDARRAD AERSMFLVAM
GAGVQNLLVG LAVEGLGSAW VSSTMFCADV VREVLEVPEE WRPMGAVAVG HAAEAPKERP
PRDPEDFVAV R